Paul Pavlidis

Professor

Faculty of Medicine

Research Classification

Neurosciences, medical and physiological and health aspects

Engineering and technology

Research Interests

genomics

Bioinformatics

cellular and molecular neuroscience

Genetics

disorders of the nervous system

single-cell genomics

Computational Biology

Gene regulation

Relevant Thesis-Based Degree Programs

View all programs

Affiliations to Research Centres, Institutes & Clusters

Centre for Brain Health

Dynamic Brain Circuits in Health and Disease Research Excellence Cluster

Michael Smith Laboratories

Research Options

I am available and interested in collaborations (e.g. clusters, grants).

I am interested in and conduct interdisciplinary research.

I am interested in working with undergraduate students on research projects.

Graduate Student Supervision

Doctoral Student Supervision

Dissertations completed in 2010 or later are listed below. Please note that there is a 6-12 month delay to add the latest dissertations.

Mining of differential expression across thousands of conditions (2022)

Differential expression (DE) analysis is performed to identify genes associated to a phenotype based on changes in RNA expression levels. The result of various bioinformatics analyses is a hit list of genes that requires further interpretation to identify the functions of these genes and prioritize the genes for further study; there is currently a lack of objective metrics for gene prioritization. The ease of generating transcriptomic data has resulted in the accumulation of massive amounts of data in repositories (“NCBI GEO”). In my thesis, I investigate means of harnessing this archived data for interpreting hit lists. First, I describe the development of Gemma, a large corpus containing over 10,000 curated and reprocessed datasets made suitable for data mining. I contributed by establishing the curation guidelines of using ontology concepts during dataset annotation, and characterizing Gemma’s features. Next, I describe the evaluation of Connectivity Map (CMap), a hit list interpretation framework designed for in silico repositioning of previously approved drugs for treating human diseases. Through a series of analyses, I demonstrated that drug repositioning results between two versions of CMap are discordant, and is caused by low reproducibility of DE profiles both between and within each CMap. This demonstrates the importance of high-quality data and careful evaluation of hit list interpretation frameworks. Finally, in a collaboration, we showed that there are huge differences in how often genes are differentially expressed (“DE prior”) across a large corpus of human datasets. We proposed that the prior could be used to facilitate hit list interpretation, identifying genes that are more specifically DE in a studied phenotype. I expanded this work further by examining variables that may influence the DE prior such as microarray platform gene coverage; I found the DE prior robust to these variables. I also demonstrate that given enough data, context (e.g. tissue) or topic specific DE priors can be developed for topic-specific applications. My work contributes to our knowledge of patterns of gene differential expression and their utility in addressing questions related to gene function in human health and disease.

View record

Prioritizing genes with functionally distinct splice isoforms (2021)

Most mammalian genes generate multiple transcripts via splicing, and we do not know the function of most splice variants. Currently, there is a debate about how many splice variants are likely nonfunctional or “noisy” transcripts. My thesis explores the claim that alternative splicing vastly increases the genome’s functional diversity in the context of noisy splicing, and in doing so attempts to identify candidate cases for which alternative splicing is likely to be of consequence.To ground computational analyses of genes with multiple splice variants in experimental data, the field needs a corpus of genes that have experimental evidence of functionally distinct splice isoforms (FDSIs). We curated the literature for 743 genes and found that ~5% had literature evidence of FDSIs. This suggests that the claim that alternative splicing vastly increases genomic functional diversity is extrapolated from a few key genes.Next, I developed a pipeline to identify candidate genes with FDSIs using long-read RNA-seq data. The output of my pipeline is a computationally-prioritized list of candidate genes likely to have FDSIs based on features such as expression, conservation, functional domains, and coding-potential. From an initial set of 6,799 genes with multiple splice variants, I prioritized 79 candidate genes. While I had limited long-read data, my work aids in establishing guidelines for high-throughput prioritization of genes with FDSIs for future study.With our collaborators, I investigated a specific application of my pipeline to the voltage-gated calcium channel gene Cacna1e. Using novel long-read data, I established a set of 2,110 splice variants for Cacna1e. Based on properties of the channel, I determined that at most 154 splice variants are likely to encode a functional channel. My results highlighted the amount of potential noise produced by one gene’s expression. Through my investigation, I added to the growing body of literature in support of noisy splicing. I also provided the field with a list of interesting genes with multiple splice variants. This includes a gold standard set of genes from the experimental literature, and a novel set of prioritized genes. Both sets of genes will be useful for future studies of gene function.

View record

The interpretation of gene coexpression in systems biology (2020)

One of the key features of transcriptomic data is the similarity of expression patterns among groups of genes, referred to as coexpression. It has been shown that coexpressed genes tend to share similar functions. Based on this, a common assumption is that gene coexpression is a result of transcriptional regulation and therefore, regulatory relationships could be inferred from coexpression. However, success in inferring such relationships has been limited and there are questions about the source and interpretation of coexpression. Here I explore coexpression as an observed signal from the data, examine its source and assess its relevance for inferring regulatory relationships. In chapter 2 I studied differential coexpression, which refers to the alteration of gene coexpression between biological conditions. It is commonly assumed that differential coexpression can reveal rewiring of transcription regulatory networks, specifically among the genes that maintain their average expression level between the conditions. However, I show that to a large extent and in contrast to this common assumption, differential coexpression is more parsimoniously explained by changes in average expression levels. This finding demonstrates limitations for inference of regulatory rewiring from coexpression and poses questions for the underlying causes of the observed coexpression. In Chapter 3, I studied cellular composition variation among bulk tissue samples as a source of variance and the observed coexpression. I found that for most genes, differences in expression levels across cell types account for a large fraction of their variance and as a result genes with similar cell-type expression profiles appear to be coexpressed. Finally, I showed that this coexpression dominates the underlying intra-cell-type coexpression and also has the two prominent features of coexpression in the bulk tissue: reproducibility and biological relevance. Through my studies, I was able to provide an explanation for much of the observed coexpression in the bulk tissue and shed light on its resolution and limitation for inference of regulatory relationships. I also studied coexpression in single-nucleus data and show that some of the observed coexpression in it is likely to be attributed to the transcriptional regulation, which could be a subject for future studies.

View record

Computational analysis of ribonucleic acid basepairs in RNA structure and RNA-RNA interactions (2016)

Ribonucleic acids (RNA), are an essential part of cellular function, transcribed from DNA and translated into protein. Rather than a passive informational medium, RNA can also be highly functional and regulatory. Certain RNAs fold into specific structures giving it enzymatic properties, while others bind to specific targets to guide regulatory processes. With the advent of next-generation sequencing, a large number of novel non-coding RNAs have been discovered through whole-transcriptome sequencing. Many efforts have been made to study the structure and binding partners of these novel RNAs, in order to determine their function and roles. This work begins with a description of my R package R4RNA for manipulating RNA basepair data, the building blocks of RNA structure and RNA binding. The package deals with the input/output and manipulation of RNA basepair and sequence data, along with statistical and visualization methods for evaluation, interpretation and presentation. We also describe R-chie, a visualization tool and web server built on R4RNA that visualizes complex RNA basepairs in conjunction with sequence alignments. We then conduct the largest known evaluation of RNA-RNA interaction methods to date, running state-of-the-art tools on curated experimentally validated datasets. We end with a review of cotranscriptional RNA basepair formation, summarizing biological, theoretical and computational methods for the process, and future directions for improving classical methods in RNA structure prediction.All content chapters of this thesis has been peer-reviewed and published. The work on R4RNA has led to two publications, with the package used to great visual effect by various publications and also adopted by the RNA structure database Rfam. My assessment of RNA-RNA interaction is at present the only published evaluation of its kind, and will hopefully become a benchmark for future tool development and a guide to selecting appropriate tools and algorithms. Our published review on RNA cotranscriptional folding is well-received, being the first review specifically on its topic.

View record

Generation of Truncated Proteoforms in Proteolytic Networks: Modeling and Prediction in the Protease Web (2016)

Primarily controlled by gene expression and fine-tuned by translation and degradation rates, protein activity is governed by a plethora of post-translation modifications such as phosphorylation and glycosylation, which generate a diversity of protein species and thereby control complex biological phenotypes. Protease processing by proteases is a particular modification leading to the irreversible generation of stable protein truncations. Well understood in examples such as signal- or propeptide removal, recent analyses consistently identify >50% of N-terminal peptides mapping inside the protein sequence as predicted by genomics, indicating an important regulatory role of proteases. All proteins undergo protease cleavage as part of processing or degradation, a second biological process controlled by proteases. Proteases are involved in numerous pathologies and commonly considered as drug targets. However, protease research and drug development is complicated, in part due to widespread crosstalk between proteases. Proteases regulate other proteases through direct cleavage or cleavage of protease inhibitors in a complex network of protease interactions, the protease web. Yet, a comprehensive analysis of the protease web has not been performed, hampering assignment of proteases to clear biological roles, their direct substrates, and protease inhibitor drug targeting. A second problem in the identification of protein processing is the potential confound between protein termini generated by protease processing, alternative splicing, and alternative translation. In this thesis, I computationally analyzed large and diverse datasets of protease interactions and protein truncations to gain insight into complex proteolytic processes and to guide biochemical follow- up experiments. Analyzing protease cleavage, alternative splicing and alternative translation data incorporated into our database TopFIND, I found that protease cleavage and alternative translation likely generate most protein truncations. Combining protease cleavage and inhibition data in a graph model of the protease web, I demonstrated extensive protease crosstalk and then predicted and validated a proteolytic pathway. Finally, investigating strategies for the prediction of protease inhibition, I predicted hundreds of protease-inhibitor interactions, and validated inhibition of kallikrein-5 by serpin B12. This work thus generated predictions for biochemical follow-up as well as important insights into the regulation of biological systems through proteases.

View record

Bioinformatics for neuroanatomical connectivity (2012)

Neuroscience research is increasingly dependent on bringing together large amounts of data collected at the molecular, anatomical, functional and behavioural levels. This data is disseminated in scientific articles and large online databases. I utilized these large resources to study the wiring diagram of the brain or ‘connectome’. The aims of this thesis were to automatically collect large amounts of connectivity knowledge and to characterize relationships between connectivity and gene expression in the rodent brain. To extract the knowledge embedded in the neuroscience literature I created the first corpus of neuroscience abstracts annotated for brain regions and their connections. These connections describe long distance or macroconnectivity between brain regions. The collection of over 1,300 abstracts allowed accurate training of machine learning classifiers that mark brain region mentions (76% recall at 81% precision) and neuroanatomical connections between regions (50% sentence level recall at 70% precision). By automatically extracting connectivity statements from the Journal of Comparative Neurology I generated a literature based connectome of over 28,000 connections. Evaluations revealed that a large number of brain region descriptions are not found in existing lexicons. To address this challenge I developed novel methods that allow mapping of brain region terms to enclosing structures. To further study the connectome I moved from scientific articles to large online databases. By employing resources for gene expression and connectivity I showed that patterns of gene expression correlate with connectivity. First, two spatially anti-correlated patterns of mouse brain gene expression were identified. These signatures are associated with differences in expression of neuronal and oligodendrocyte markers, suggesting they reflect regional differences in cellular populations. Expression level of these genes is correlated with connectivity degree, with regions expressing the neuron-enriched pattern having more incoming and outgoing connections with other regions. Finally, relationships between profiles of gene expression and connectivity were tested. Specifically, I showed that brain regions with similar expression profiles tend to have similar connectivity profiles. Further, optimized sets of connectivity linked genes are associated with neuronal development, axon guidance and autistic spectrum disorder. This demonstration of text mining and large scale analysis provides new foundations for neuroinformatics.

View record

Meta-analysis of expression profiling data in the postmodern human brain (2012)

Schizophrenia is a severe psychiatric illness for which the precise etiology remains unknown. Studies using postmortem human brain have become increasingly important in schizophrenia research, providing an opportunity to directly investigate the diseased brain tissue. Gene expression profiling technologies have been used by a number of groups to explore the postmortem human brain and seek genes which show changes in expression correlated with schizophrenia. While this has been a valuable means of generating hypotheses, there is a general lack of consensus in the findings across studies. Expression profiling of postmortem human brain tissue is difficult due to the effect of various factors that can confound the data. The first aim of this thesis was to use control postmortem human cortex for identification of expression changes associated with several factors, specifically: age, sex, brain pH and postmortem interval. I conducted a meta-analysis across the control arm of eleven microarray datasets (representing over 400 subjects), and identified a signature of genes associated with each factor. These genes provide critical information towards the identification of problematic genes when investigating postmortem human brain in schizophrenia and other neuropsychiatric illnesses. The second aim of this thesis was to evaluate gene expression patterns in the prefrontal cortex associated with schizophrenia by exploring two methods of analysis: differential expression and coexpression. Seven schizophrenia microarray studies of prefrontal cortex were combined for a total of 153 subjects with schizophrenia and 153 healthy controls. Meta-analysis was conducted with careful consideration for the effects of covariates, revealing a robust list of 98 differentially expressed ‘schizophrenia genes’. Using the same seven schizophrenia datasets, coexpression networks were generated for control and schizophrenia cohorts within each dataset and then combined across studies using a rank aggregation approach. Topological properties of our ‘schizophrenia genes’ were evaluated in the context of each network, highlighting differences in correlation structure of these genes in the control and schizophrenia brain. Together these results converge towards a general conclusion, emphasizing that the integration of postmortem human brain expression profiling data improves statistical power and is particularly useful in detecting subtle yet consistent changes in expression associated with schizophrenia

View record

Master's Student Supervision

Theses completed in 2010 or later are listed below. Please note that there is a 6-12 month delay to add the latest theses.

Investigations into transcriptomic engram cells (2024)

Identification of cell type marker genes of the brain and their use in estimation of cell type proportions (2022)

Cellular composition variation drives coexpression-based gene function prediction (2021)

Large-scale mining of differential expression data for insight into gene function (2021)

Single-cell analytics for phospho flow cytometry reveals dynamic interactions between molecular pathways (2021)

Quantitative analysis of large single-cell measures acquired by phospho flow cytometry typically involves establishing inclusion gate thresholds and combining measures from accepted cells into a single median metric. Though this analysis method is simple, it overlooks the heterogeneity of cell populations and there could be information missing from the single-cell level. Here, we have formulated approaches that can recognize the heterogeneity and extract additional information involving dose-response and interactions between multiple molecules from phospho flow cytometry datasets. Using phospho flow multiplexed sampling of cell physical features, and primary antibodies against protein markers, including GAPDH as a protein expression control, HA tag as an exogenous gene/variant transfection measurement, and 8 antibodies detecting the activation (phosphorylation) states of 8 proteins within conserved molecular pathways, two panels of phospho-specific antibodies were used simultaneously for multiplexed measures in the same cells. Our approach involves single-cell standardization, fitting loess regression, identifying linear domains in dose-response plots, building linear mixed-effects models, and multi-dimensional analyses to detect interactions between phosphorylated protein markers. We demonstrate the utility of this approach by expressing wild-type and 5 variants (4A, D268E, Y138L, P38H, G129E) of PTEN on 8 markers of molecular pathways downstream of PTEN, and we also expressed RHEB WT testing its impact on markers in the shared associated pathways. We succeeded in differentiating subtypes of PTEN loss-of-function variants and were able to predict that PTEN P38H is a loss-of-lipid-phosphatase-function variant. We were also able to infer that pAKT, p4EBP1, pS6, and pCREB are all downstream targets of PTEN regulation while pAKT is between PTEN and p4EBP1, pS6, or pCREB. In conclusion, our results demonstrate dose response and molecular pathway interactions unavailable from reducing population data to single values, and our approach manifests strong promise in variant function measurement and molecular signaling pathway inference.

View record

An investigation into the utility of guilt by association machine learning algorithms for the prioritization of autism spectrum disorder candidate risk genes (2020)

A Study of Methods for Learning Phylogenies of Cancer Cell Populations from Binary Single Nucleotide Variant Profiles (2015)

An analysis of genetic variants associated with autism spectrum disorder (2018)

Mega-analysis of gene expression patterns across tissues in human and mouse (2018)

Exploring sources of variability in electrophysiology data of mammalian neurons (2017)

Meta-analysis of gene expression in mouse models of neurodegenerative disorders (2017)

There is intense interest in understanding the molecular mechanisms that contribute to neurodegenerative disorders (NDs), which involve complex interplays of genetic and environmental factors. To catch early events involved in disease initiation requires investigation on pre-symptomatic brain samples. It is difficult to capture early molecular events using post-mortem human brain samples since these samples represent the late phase of the disorder with progressive brain damage and neurodegeneration. Disease mouse models are developed to study disease progression and pathophysiology. Here, I focus on two of the most studied NDs: Alzheimer’s disease (AD) and Huntington’s disease (HD). Mouse models developed for the disease (AD or HD) often share similar phenotypes mimicking human disease symptoms, which suggest potential common underlying mechanisms of disease initiation and progression across mouse models of the same disease. Investigation of gene expression profiles of pre-symptomatic animals from different mouse models may shed light on the mechanisms occurred in the early disease phase. Gene expression profiling analyses have been performed on mouse models and some of the studies investigate the molecular changes in pre-symptomatic phase of AD and HD respectively. However, their findings have not reached a clear consensus. To identify shared molecular changes across mouse models, I conducted a systematic meta-analysis of gene expression in mouse models of AD and HD, consisted of 369 gene expression profiles from 23 independent studies. The goal of this project is to identify transcriptional alterations shared among different mouse models of each disease respectively, especially changes during early disease phase that may link to disease-causing mechanisms, and potential common cross-disease changes. For both of the disorders, the results showed subtle but biologically interpretable changes shared across mouse models in the early disease phase that may contribute to the early disease progression: dysregulation of genes involved in cholesterol biosynthesis and complement system in AD mouse models and genes encoding mitochondrial respiratory chain complexes in HD mouse models. Cross-disease similarities in the late phase suggested that different brain regions may share mechanisms in response to neuronal loss and toxic protein aggregates.

View record

A Study of Methods for Learning Phylogenies of Cancer Cell Populations from Binary Single Nucleotide Variant Profiles (2015)

Identification and exploration of gene product annotation instability and its impact in current usages (2014)

Proteins are macromolecules responsible for a wide range of activities in the structure and function of cells. Their activities have been described in different contexts as a mean to elucidate their ``function". These descriptions have been captured across biological databases in a standardized format called Gene Ontology Annotations (GOA), to disseminate the knowledge and extrapolate the information to other proteins whose function is still unknown. Furthermore, the annotations are used to analyse and interpret data from high-throughput studies and also as a benchmark for the assessment of protein function prediction algorithms. Constant changes occur in GOA that can potentially impact such usages, but only limited effort has been put into exploring their instability, or to assess the impact that these changes have on reproducibility or interpretation of previous analyses. In the present work, I performed the most comprehensive analysis of the annotation instability for 14 representative model organisms (E.coli, fruit fly, Mouse, etc.). The results showed important instability patterns that were species-specific. As such information would be of use to the community to trace the instability of annotations of their interest, a web-based visualization tool was built to track these changes on a protein, functional term and species specific basis. Additionally, we identified artifacts on the annotation data that can be attributed to curation patterns. We propose such artifacts to be considered for a more accurate assessment of function prediction algorithms. Furthermore, the impact that changes in the annotations have on common settings like gene set enrichment analyses was also explored. In particular, 2,000 datasets were used to assess the robustness of enrichment results over time. On average, the results would display a 60% similarity after only 2 years. However, cases were found were the similarity will drop 80% within the same year, demonstrating the impact that the instability has on such applications. In conclusion, the results of this work will prove useful for those who use the annotations to interpret their studies to assess their reliability on a case-by-case scenario.

View record

Meta-analysis of Human Methylomes Reveals Stably Methylated Sequences Surrounding CpG Islands Associated with High Gene Expression (2014)

Cell type marker enrichment across brain regions and experimental conditions (2013)

Characterization of gene expression patterns in wild pacific salmon (2013)

Meta-analysis of gene expression in individuals with autism spectrum disorders (2013)

Wide-scale comparison of transcriptome data and the role of microRNA in major depression and suicide (2011)

The first chapter of this thesis addresses a common problem in genomics experiments: interpreting a resulting "hit list" of interesting genes. We present work on an approach for summarizing and exploring "hit lists" that makes use of the large amount of gene expression data in public repositories such as the Gene Expression Omnibus. We compare the query list with datasets that we have analyzed for differential expression of genes. Studies that have similarities to the given hit list yield additional insights, help contextualize studies, and serve as a basis for future meta-analysis. A conceptually similar problem that we addressed is the classification or clustering of datasets based on patterns of differential expression. Both problems required a method for determining distances between datasets based on rankings of genes. We tested and benchmarked several methods using manually annotated datasets. The method that performed best according to our evaluation process is based on Kendall's Tau top-k distance. We investigated potential sources of confounds, finding that the largest challenge may be posed by the high prevalence of certain gene expression patterns. These highly prevalent patterns tended to dominate search results. Nonetheless, we demonstrated the effectiveness of this approach in a case study. In the second chapter, we investigated the role of microRNAs in the context of major depression and suicide. We profiled microRNA and messenger RNA levels in post-mortem prefrontal cortex and hippocampus brain tissue of depressed suicides, suicides, and controls. In the prefrontal cortex, we found miR-1202 to be down-regulated in suicides versus controls, and LCT (lactase enzyme) was up-regulated in suicides or depressed suicides compared to controls. The former result was independently confirmed using quantitative PCR. While further study is needed, our results have the potential to provide insight into molecular changes in the brains of depressed and suicidal individuals.

View record

Evaluating Coexpression Analysis for Gene Function Prediction (2010)

Microarray expression data sets vary in size, data quality and other features, but most methods for selecting coexpressed gene pairs use a ‘one size fits all’ approach. There have been many different procedures for selecting coexpressed gene pairs of high functional similarity from an expression dataset. However, it is not clear which procedure performs best as there are few studies reporting comparisons of these approaches. The goal of this thesis is to develop a set of “best practices” in order to select coexpression links of high functional similarity from an expression dataset, along which methods for identifying datasets likely to yield poor information. With these goals, we hope to improve the quality of gene function predictions produced by coexpression analysis.Using 80 human expression datasets we examined the impact of different thresholds, correlation metrics, expression data filtering and transformation procedures on performance in functional prediction. We also investigated the relationship between data quality and other features of expression datasets and their performance in functional prediction. We used the annotations of the Gene Ontology as a primary metric to measure similarity in gene function, and employ additional functional metrics for validation. Our results show that several dataset features have a greater influence on the performance in functional prediction than others. Expression datasets which produce coexpressed gene pairs of poor functional quality can be identified by a similar set of data features. Some procedures used in coexpression analysis have a negligible effect on the quality of functional predictions while others are essential to achieving the best performance in the algorithm. We also find that some procedures interact greatly with features of expression datasets and that these interactions increase the number of high quality coexpressed gene pairs retrieved through coexpression analysis. This thesis uncovers important information on the many intrinsic and extrinsic factors that influence the performance in functional prediction of coexpression analysis. The information summarized here will help guide future studies using coexpression analysis and improve the quality of gene function predictions.

View record

If this is your researcher profile you can log in to the Faculty & Staff portal to update your details and provide recruitment preferences.

Paul Pavlidis's Profile

Publications on Google Scholar

ORCID Profile

Membership Status

Member of G+PS

View explanation of statuses

Program Affiliations

Neuroscience

Bioinformatics

Genome Science and Technology

Academic Unit(s)

Department of Psychiatry

Michael Smith Laboratories

Research Classification

Research Interests

Relevant Thesis-Based Degree Programs

Affiliations to Research Centres, Institutes & Clusters

Research Options

Research Methodology

Recruitment

Complete these steps before you reach out to a faculty member!

Check requirements

Focus your search

Make a good impression

Attend an information session

ADVICE AND INSIGHTS FROM UBC FACULTY ON REACHING OUT TO SUPERVISORS

Graduate Student Supervision

Doctoral Student Supervision

Mining of differential expression across thousands of conditions (2022)

Prioritizing genes with functionally distinct splice isoforms (2021)

The interpretation of gene coexpression in systems biology (2020)

Computational analysis of ribonucleic acid basepairs in RNA structure and RNA-RNA interactions (2016)

Generation of Truncated Proteoforms in Proteolytic Networks: Modeling and Prediction in the Protease Web (2016)

Bioinformatics for neuroanatomical connectivity (2012)

Meta-analysis of expression profiling data in the postmodern human brain (2012)

Master's Student Supervision

Investigations into transcriptomic engram cells (2024)

Identification of cell type marker genes of the brain and their use in estimation of cell type proportions (2022)

Cellular composition variation drives coexpression-based gene function prediction (2021)

Large-scale mining of differential expression data for insight into gene function (2021)

Single-cell analytics for phospho flow cytometry reveals dynamic interactions between molecular pathways (2021)

An investigation into the utility of guilt by association machine learning algorithms for the prioritization of autism spectrum disorder candidate risk genes (2020)

A Study of Methods for Learning Phylogenies of Cancer Cell Populations from Binary Single Nucleotide Variant Profiles (2015)

An analysis of genetic variants associated with autism spectrum disorder (2018)

Mega-analysis of gene expression patterns across tissues in human and mouse (2018)

Exploring sources of variability in electrophysiology data of mammalian neurons (2017)

Meta-analysis of gene expression in mouse models of neurodegenerative disorders (2017)

A Study of Methods for Learning Phylogenies of Cancer Cell Populations from Binary Single Nucleotide Variant Profiles (2015)

Identification and exploration of gene product annotation instability and its impact in current usages (2014)

Meta-analysis of Human Methylomes Reveals Stably Methylated Sequences Surrounding CpG Islands Associated with High Gene Expression (2014)

Cell type marker enrichment across brain regions and experimental conditions (2013)

Characterization of gene expression patterns in wild pacific salmon (2013)

Meta-analysis of gene expression in individuals with autism spectrum disorders (2013)

Wide-scale comparison of transcriptome data and the role of microRNA in major depression and suicide (2011)

Evaluating Coexpression Analysis for Gene Function Prediction (2010)

Membership Status

Program Affiliations

Academic Unit(s)

Get key application advice, hear about the latest research opportunities and keep up with the latest news from UBC's graduate programs.