Doctor of Medicine and Doctor of Philosophy (MDPhD)
Epigenomic Dysregulation in Synovial Sarcoma
G+PS regularly provides virtual sessions that focus on admission requirements and procedures and tips how to improve your application.
I’m extremely grateful to have such a brilliant, caring, and enthusiastic supervisor. He has been supportive and motivating at every step of the PhD process. He is not only a role model as a scientist but also as a leader. He motivates his students and directs them on a path to better themselves.
Epigenetic modifications including reversible and non-uniform chemical marks to chromatin support activation and silencing of gene transcription. Alterations in normal epigenetic states are associated with transformation across a wide range of cancer types including myeloid malignancies. To understand the role of epigenetic regulation of normal human hematopoietic progenitors and its dysfunction in myeloid transformation, I developed a low-cell input chromatin immunoprecipitation method that, when combined with an analytical framework enables a simultaneous assessment of chromatin accessibility and histone modification state. This method enabled a comparative analysis of the epigenomic states of normal and malignant human blood cell compartments. Application of this methodology to highly purified, phenotypically defined subsets of primitive and terminally differentiating normal human cord blood cells showed that multiple human hematopoietic progenitor phenotypes display a common H3K27me3 signature. This signature includes many large organized H3K27me3 domains co-marked by H3K9me3 also found in the mature lymphoid cells in cord blood (CB) but not in co-isolated monocytes or erythroblasts. These results indicate a marked difference in the epigenomic changes primitive human neonatal hematopoietic cells undergo when they initiate terminal differentiation of the lymphoid and myeloid lineages. Further evidence that this differential H3K27me3 contraction directly impacts hematopoietic differentiation was obtained by manipulating H3K27me3 regulators in cell line models of inducible neutrophil differentiation in vitro.These methodologies were then used to explore epigenomic dysfunction found in the leukemic cells obtained from patients presenting with acute myeloid leukemia (AML) whose blasts differed in their content of neomorphic isocitrate dehydrogenase (IDH) mutations. Comparison of the methylation landscape in the AML cells with and without IDH mutations revealed a higher fractional DNA methylation level at active enhancers in the IDH mutant cells. However, there was no significant difference in global occupancy of histone modifications between the leukemic cells from the two patient groups.Collectively, these findings reveal previously unknown relationships of epigenetic modifications in normal and malignant human blood cells.
Structural variants (SVs) contribute greater diversity at the nucleotide level between two human genomes than any other form of genetic variation and are three-fold more likely to correlate in genome-wide association studies (GWAS) than single nucleotide variants (SNVs). Using short-read, high-throughput sequencing technologies to uncover such variation has proven to be troublesome and the methods to detect SVs depend on indirect inferences. However, while larger (>5kb) copy number variations (CNVs) could be characterized using read-depth-based algorithms, this approach often fails for smaller and balanced events. Another fundamental problem for detection of SVs from short-read sequencing is inherent to the predominant data type and typical SV detection algorithm that is effective in unique sequences often fails within complex genomic regions, which have been proven to be highly enriched for SVs. In addition, most SV discovery methods do not indicate the haplotype-origin for a given SV and require parental sequencing for this information. For a more complete description and interpretation of human genomic information in relation to phenotypes such as e.g. cancer predisposition and response to therapies, it will, therefore, be necessary to arrange sequence data into parental haplotypes and ascertain polymorphic inversions with respect to such haplotypes. All this can be achieved using Strand-seq. Strand-seq complements other sequencing approaches by providing crucial information about the genetic make-up of individuals that cannot be obtained in any other way. To make Strand-seq available for human studies worldwide is an immense challenge. Library construction, as well as data analysis, needs to be further developed, integrated and made user-friendly to allow accurate and rapid interpretation of results. Here we present a custom bioinformatics pipeline for analyzing Strand-seq data that streamlines the workflow of raw sequence read alignment, putative variant calling, variant call refinement and haplotype assembly by integrating current available Strand-seq specific tools. In addition, relevant metric data are compiled and visualized, ensuring and reinforcing the potential of Strand-seq as a robust sequencing method for uncovering clinically significant SVs and the assembly of WGH without additional parental genomic data.
Neomorphic mutations in isocitrate dehydrogenase 1/2 (IDH1/2) and inactivating mutations in ten eleven translocation dioxygenase 2 (TET2) are frequent and mutually exclusive events in de novo acute myeloid leukemia (AML). IDH1/2 mutations drive epigenomic dysfunction through production of the oncometabolite R-2-hydroxyglutarate (R-2HG), which inhibits the ability of TET2 to oxidize 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC), and causes DNA hypermethylation. Vitamin C (vitC) induces epigenetic reprogramming and cellular differentiation through TET re-activation in a murine leukemia model with mutant IDH1ᴿ¹³²ᴴ. Although TET mutations in AML occur exclusively in the TET2 gene, TET2 and TET3 are expressed at similar levels in de novo AML blasts. The role of TET3 in the regulation of myeloid activation and its role in AML is currently unknown. Based on the functional redundancy among TET enzymes, I hypothesize that the vitC-induced re-establishment of methylation homeostasis and differentiation in IDH1 mutant cells is also mediated through TET3. To delineate the contributions of TET2 and TET3 to vitC-induced reprogramming, we inactivated TET2 (TET2KO) and TET3 (TET3KO) individually or in combination (DKO) in HOXA9/IDH1ᴿ¹³²ᴴ (R132H) cells using CRISPR/Cas9. After 15 and 72 hours of vitC treatment, we sequenced total RNA and 5hmC/5mC-immunoprecipitated DNA. We showed that IDH1ᴿ¹³²ᴴ inhibition of TET is incomplete and that loss of either TET2 or TET3 down-regulates a set of 136 genes related to myeloid activation. A subset of the TET2/3-dependent genes can be rescued with vitC activation of the remaining TET, from which we identified PU.1, CEBPE, and RUNX1 as putative transcription factors.We identified a hypermethylation phenotype at enhancers in the context of the IDH1ᴿ¹³²ᴴ, which can be reversed through vitC treatment in a TET2- and TET3- dependent mechanism. Pathway analyses of nearby genes and genes up-regulated by vitC suggest a common myeloid differentiation pathway that both TET2 and TET3 can activate. To verify this, TET2KO and TET3KO but not DKO cells showed reduced proliferation and increased levels of myeloid differentiation markers from vitC treatment. These findings support my hypothesis and suggest a model in which vitC activates both TET2 and TET3 to reprogram the enhancer landscape of IDH1ᴿ¹³²ᴴ leukemic cells to drive differentiation.
Increasing evidence of functional and transcriptional heterogeneity in phenotypically similar single-cells has prompted interest in protocols for obtaining parallel methylome data. Despite appreciable advancements in experimental protocols for single-cell DNA methylation measurements, methods for analyzing the resulting data are still immature. To address the challenge of stochastic data loss associated with single cell measurements, current strategies average methylation in windows or region sets. However previous studies have demonstrated that single CpGs are functional and our analysis of single cell methylation measurements revealed a rapid decay in concordance neighbouring CpG states beyond 1kb. To leverage the information content of individual CpGs in the context of single cell methylation measurements we developed an analytical strategy for deriving single-cell DNA methylation states using individual CpGs, which we term PDclust. We validated PDclust on existing datasets and on data we generated from single index-sorted murine and human hematopoietic stem cells (HSCs) that are highly enriched in functionally defined stem cells. Using PDClust, we identified epigenetically distinct subpopulations within these HSC populations. Strikingly, human cord blood derived HSC populations were separable by donor specific methylation states whereas more differentiated hematopoietic cells separated solely by cell type. Interestingly, removal of methylation sites near genetic variants did not impact this separation, suggesting that these epigenetic states may be a consequence of environmental differences. Finally, through protocol optimization and deep sequencing we generated one of the most comprehensive sets of single cell methylome profiles (20% of CpGs on average) and from these were able to generate genomewide profiles from as little as 6 epigenetically related HSCs to derive subtype-specific regulatory states.