Steven Hallam


Relevant Degree Programs

Affiliations to Research Centres, Institutes & Clusters


Postdoctoral Fellows

Graduate Student Supervision

Doctoral Student Supervision (Jan 2008 - Nov 2020)
Machine learning methods for metabolic pathway inference from genomic sequence information (2020)

Metabolic pathway prediction within and between cells from genomic sequence information is an integral problem in biology linking genotype to phenotype. This is a prerequisite to both understanding fundamental life processes and ultimately engineering these processes for specific biotechnological applications. A pathway prediction problem exists because we have limited knowledge of the reactions and pathways operating in cells even in model organisms like Escherichia coli where the majority of protein functions are determined. Consequently, over the past decades several computational tools were developed to automate the reconstruction of pathways given enzymes obtained from genomes. Unfortunately, with an ever-increasing rate in the content and diversity of publicly available genomics and metagenomics datasets, those algorithms, to this date, experience more prominent and complex problems. These include incapability of systemically solving meta-level noise, neglecting pathway interactions, not considering vagueness associated with enzymes, and inadequate to scale to heterogeneous genomic datasets. In an attempt to resolve the aforementioned problems, this thesis examines multiple pathway prediction models given a list of enzymes based on multi-label learning approaches. Specifically, it first introduces mlLGPR that encodes manually designed enzyme and pathway properties to reconstruct pathways. Then, it proposes triUMPF, a more advanced model, that characterizes interactions among pathways and enzymes, jointly, with community detection from enzyme and pathway networks to improve the precision of predictions. This requires pathway2vec, a novel representation learning model, to automatically generate features aiding triUMPF’s prediction process. Next, the thesis presents leADS that subselects more impacted examples from a dataset to increase the pathway sensitivity performance. This model may rely on reMap, a novel relabeling algorithm, that incorporates the bag concept which is composed of correlated pathways to articulate missing pathways from data. Finally, all these models are integrated into a unified framework, mltS, to achieve the desired balance between sensitivity and precision outputs while assigning a confidence score to each model. The applicability of these models to recover pathways at the individual, population, and community levels of organization were examined against the traditional inference algorithms using benchmark datasets, where all the proposed models demonstrated accurate predictions and outperformed the previous approaches.

View record

A multi-omics perspective on microbial mediated methane oxidation in the Saanich Inlet water column (2019)

Microbial communities play an integral role in the biogeochemical cycling of carbon, nitrogen and sulfur throughout the biosphere. These communities interact, forming metabolic networks that change and adapt in response to availability of electron donors and acceptors. Oxygen minimum zones (OMZs) are regions of the ocean where oxygen (O₂) is naturally depleted. In OMZs microbial communities use alternative terminal electron acceptors such as nitrate, sulfate and carbon dioxide, resulting in fixed nitrogen loss and production of greenhouse gases including methane (CH₄). In this thesis, I explored microbial community structure, dynamics and metabolic interactions as they relate to CH₄ cycling in Saanich Inlet, a seasonally anoxic fjord on the coast of British Columbia Canada that serves as a model ecosystem for studying microbial processes in OMZs. Leveraging decadal time series observations in Saanich Inlet, I developed a geochemical dataset consisting of nutrient and gas measurements, coupled with multiomic (DNA, RNA and protein) sequence information to chart microbial community structure and dynamics along defined redox gradients. I conducted methods optimization comparing in situ and on-ship samplingparadigms and used correlation analysis to infer putative microbial interaction networks in relation to water column CH4 oxidation. Methanotrophic bacteria in Saanich Inlet were identified associated with three uncultivated Gammaproteobacteria clades termed OPU1, OPU3 and symbiont-related that partitioned in the water column during periods of prolonged stratification. Water column distribution of the OPU3 clade was found to correlate with nitrite (NO₂-). Based on these results, I conducted incubations with labelled CH₄ and NO₂- to test this correlation and constrain potential metabolic interactions between methanotrophs and other one-carbon utilizing microorganisms under low O₂ conditions. Using multi-omic information derived from these incubations I confirmed the role of OPU3 in coupling CH₄ oxidation to NO₂- reduction and uncovered potential metabolic interactions between OPU3 and other co-occurring microorganisms including Methylophilales,Planctomycetes and Bacteroidetes. Evidence for a communal function in CH₄ oxidation expandsthe role of OPU3 in the global carbon budget and provides a conceptual foundation for the development of numerical models to predict CH₄ flux from OMZs as they expand throughout the global ocean.

View record

A multi-omics approach to microbial nitrogen and sulfur cycling in the oxygen starved ocean (2018)

Microbial communities mediate biogeochemical processes of Carbon (C), Nitrogen (N) and Sulfur (S) cycling in the ocean on global scales. Oxygen (O₂) availability is a key driver in these processes and shapes microbial community structure and metabolisms. As O₂ decreases, microbes utilize alternative terminal electron acceptors, nitrate (NO₃–), nitrite, sulfate and carbon dioxide, depleting biologically available nitrogen and producing greenhouse gases nitrous oxide (N₂O) and methane (CH₄). Marine oxygen minimum zones (OMZs) are areas of O₂-depletion (O₂
View record

Harnessing natural diversity for the discovery of glycoside hydrolases and design of new glycosynthases (2018)

Plant biomass offers a sustainable source for energy and materials and an alternative to fossil fuels. However, the industrial scale production or biorefining of fermentable sugars from plant biomass is currently limited by the lack of cost effective and efficient biocatalysts. Microbes, the earth's master chemists - employing biocatalytic solutions to harvest energy, and transform this energy into useful molecules - offer a potential solution to this problem. However, a majority of microbes remain uncultured, limiting our access to the genetic potential encoded within their genomes. This has spurred the development of culture independent methods, termed metagenomics. In this thesis I harnessed high-throughput functional metagenomic screening to discover biomass deconstructing biocatalysts from uncultured microbial communities. Towards this goal, twenty-two clone libraries containing DNA sourced from diverse microbial communities inhabiting terrestrial and aquatic ecosystems were screened with 4-methylumbelliferyl cellobioside to detect glycoside hydrolase activity. This revealed 178 active clones containing glycoside hydrolases, often in gene clusters. This set of active clones was consolidated and further characterized through sequencing and rapid, plate-based, biochemical assays. Additionally, libraries sourced from beaver fecal and gut microbiomes were screened with four fluorogenic probes (6-chloro-4-methylumbelliferyl derivatives of cellobiose, xylobiose, xylose and mannose) for glycoside hydrolase activity. This revealed a total of 247 active fosmid-harbouring clones, that encoded many polysaccharide-degrading genes and gene cassettes. Specific candidate genes from the fecal library were sub-cloned, and the resulting purified enzymes were shown to be involved in synergistic degradation of arabinoxylan oligomers. The clone libraries that were generated through functional metagenomic screening were then employed to reveal the promiscuity of glycoside hydrolases towards unnatural azido- and aminoglycosides. Promiscuous enzymes identified from metagenomic and synthetic clone libraries were then used as a starting point for the generation of new glycosynthases capable of incorporating modified glucosides and galactosides. The resulting set of eight new glycosynthases are capable of synthesizing di- and trisaccharides, glycolipids and inhibitors such as 2,4-dinitrophenyl 4'-amino-2,4'-dideoxy-2-fluoro-cellobioside. Taken together this work has exploited the power of functional metagenomics to reveal new modes of biocatalysis and develop new synthetic tools.

View record

Functional metagenomic screening for glycoside hydrolases (2017)

Limitations on the cultivation of a majority of naturally occurring microbes have spurred the rise of culture-independent methods for the investigation of environmental microbial communities, a field known as metagenomics. This thesis addresses both functional and informatic approaches to metagenomics with the aim of improving our knowledge of carbohydrate degradation. A high throughput functional metagenomic screen was developed and applied to over 350,000 fosmid clones to search for glycoside hydrolases (GHs) in metagenomic libraries. Screening yielded 798 fosmid clones capable of hydrolyzing a model sugar compound, and the genes responsible were subcloned and biochemically characterized for pH and temperature stability, and substrate specificity. The combination of functional and in silico methods developed were used in a longitudinal study of the beaver (Castor canadensis) digestive tract, in order to gain insight into the sequential degradation of biomass. A linear model was used to identify enrichment of endo-acting versus exo-acting GH families at five locations throughout the digestive tract. The discovery of high numbers of GH43 family genes on functionally identified fosmids resulted in their combination with all other known GH43 genes in order to create subfamily classifications that provide finer resolution of enzyme activities. This classification system resulted in an improved ability to assign functional characteristics to enzymes identified through informatic studies. Of the 37 subfamilies created, only 22 contained a characterized enzyme. Fosmids identified earlier in this work harboured genes from four uncharacterized GH43 subfamilies, and future characterization efforts will further our understanding of the GH43 family. Altogether, the developed methods provide a framework for future studies of biomass degradation and improve the power of both functional and in silico metagenomics.

View record

Unearthing the influence of soil organic matter removal on population structure and metabolism of soil microbial communities (2017)

Microorganisms are the stewards and creators of Earth's ecosystems, driving planetary nutrient and energy cycles. As such, the interactions and metabolic processes of microbial communities have emerged as a fundamental area of scientific research. Through the use of multi-omic (e.g.; metagenomics, metatranscriptomics, proteomics) sequence information, it is possible to reconstruct the compositional, regulatory, and distributed metabolic processes connecting microbial community members. This dissertation develops an interpretative framework for the joint analysis of compositional network patterns and metabolic pathway reconstruction using metagenomic and metatranscriptomic data from soils collected from lodgepole pine forests 13 years post-harvesting and soil organic matter removal and adjacent undisturbed lodgepole pine forests and Interior Douglas-fir forests at two Long Term Soil Productivity (LTSP) sites located near Kamloops, and Williams Lake B.C., Canada. Further, this dissertation serves to improve the accuracy with which environmental sequence data are analyzed by leveraging the work of statisticians to overcome known biases in canonical approaches to data normalization. Finally, this work provides a systematic approach to the functional annotation of unassembled data, and extends an existing diversity index to accept data types common in studies involving uncultivated microbial annotation. Together the data indicated spatiotemporal variation in, and forest harvesting impact on, metabolic interactions and genomic potential for plant biomass degradation and carbon cycling. However, redundant metabolic capacity combined with genetic variation within the microbial community ensures natural and anthropogenically-induced environmental change had disparate effects across community members thereby moderating the consequences of localized extinctions or niche space reduction, and guarding against the loss of metabolic functions within the soil ecosystem. Indeed, environmental change can result in the reshuffling of trophic relationships and information exchange (e.g.; H+, metabolites, and horizontal gene transfer) allowing new and novel interactions between organisms to form and increase the community's ability to tolerate disturbance. This work represents an important step in understanding how environmental changes impact microbial communities and ecosystem function within the soil milieu. Ultimately, data and findings from this dissertation can be integrated with future analyses of biogeochemical parameter information and thermodynamic principles to enable time variable forecasts of microbial adaptive response to environmental change.

View record

Illuminating Phylogeny and Function of Microbial Dark Matter in Sakinaw Lake (2015)

Microorganisms are the most abundant and diverse forms of life on Earth. Interconnected microbial communities drive matter and energy transformations integral to ecosystem functions and services through distributed metabolic networks innovated over 3.5 billion years of evolution. To effectively harness this metabolic potential it is necessary to chart uncultivated microbial community structure and function. Cultivation-independent studies indicate that over half of the microbial diversity on Earth belongs to uncultivated candidate divisions, also known as microbial dark matter (MDM). Here, I illuminate MDM structure and function in meromictic Sakinaw Lake on the Sunshine Coast of British Columbia Canada. Sakinaw Lake water column conditions quantified over 8 field campaigns were intimately associated with unprecedented abundance and diversity of MDM. Using network analysis and single cell genomics, co-occurrence patterns between MDM including OP9/JS-1, OP8 and methanogenic Archaea were linked with potential to perform syntrophic acetate oxidation, an important process in anaerobic digestion of organic matter in natural and engineered ecosystems. Single-cell and metagenome analysis revealed previously unrecognized nitrate reduction potential in candidate division OP3 and uncovered a novel archaeal lineage. In addition, numerous Fe-S oxidoreductases associated with MDM in Sakinaw Lake indicate the potential to couple sulfur oxidation to iron reduction. Taken together, my work establishes Sakinaw Lake as a natural laboratory in which to explore MDM structure and function, shines a spotlight on known and novel interactions and metabolic capabilities among these most enigmatic microorganisms, and points to potential biotechnological innovations based on cooperative interactions between MDM populations.

View record

MetaPathways: a molecular pipeline for the analysis of environmental sequence information (2015)

The lack of cultivated reference strains for the majority of naturally occurring microorganisms has lead to the development of plurality sequencing methods and the field of metagenomics, offering a glimpse into the genomes of this so-called 'microbial dark matter' (MDM). An explosion of sequencing initiatives has followed, attempting to capture and extract biological meaning from MDM across a wide range of ecosystems from deep-sea vents and polar seas to waste-water bioreactors and human beings. Current analytic approaches focus on taxonomic structure and metabolic potential through a combination of phylogenetic anchor screening of the small subunit ribosomal RNA gene (SSU or 16S rRNA) and general sequence searches using homology-based inference. Though much has been learned about microbial diversity and metabolic potential within natural and engineered ecosystems using these approaches, they are insufficient to resolve the ecological relationships that couple nutrient and energy flow between community members - ultimately translating into ecosystem functions and services. This shortcoming arises from a combination of data-intensive challenges presented by environmental sequence information that span processing, integration, and interpretation steps, and a general lack of robust statistical and analytical methods to directly address these problems.This dissertation addresses some of these shortcomings through the development of a modular analytical pipeline, MetaPathways, allowing for the large-scale and systematic processing and integration of many forms of environmental sequence information. MetaPathways is built to scale, comparing hundreds of metagenomic samples through the efficient use of data structures, grid compute models, and interactive data query. Moreover, it attempts to bring functional analysis back to the metabolic map through the creation of environmental pathway/genome databases (ePGDBs), adopting the Pathway Tools software for metabolic pathway prediction on the MetaCyc encyclopedia of genes and genomes. ePGDBs and the pathway-centric approach are validated to provide known and novel insights into community structure and function. Finally, novel taxonomic and metabolic methods supporting the pathway-centric model are derived and demonstrated, and enhance Pathway Tools as a framework for engineering microbial communities and consortia.

View record

Molecular mechanisms of host-symbiont recognition in a highly specific sponge-archaeal symbiosis (2014)

Evolution of multicellular eukaryotes is intimately associated with microbial interactions resulting in diversification and niche expansion. This long history of co-evolution is evident in metabolic interdependence, and reliance of animal (i.e. metazoan) ecosystems on their microbiota for healthy development and function. Specific recognition between interacting partners is essential for establishing and successfully maintaining interspecies associations, and involves host immunity and symbiont-encoded factors. Sponges represent the most deeply branching animal phylum with the potential to shed new light on the evolution of innate immunity and host-microbe interactions within the metazoa. Marine sponges harbour diverse microbial communities that contribute to higher order ecosystem functions including primary production and nutrient cycling. However, molecular mechanisms mediating symbiont recognition and host immune signalling in sponge symbioses are unknown. This knowledge gap stems from the fact that most sponge-associated microbes remain uncultivated and no sponge host/symbiont culture systems exist. In this thesis, I used cultivation-independent approaches including environmental genomics, transcriptomics, and proteomics in combination with homology modeling and community composition profiling to identify molecular determinants of sponge symbiosis in the sponge Dragmacidon mexicanum. Community composition profiling indicated that D. mexicanum is a high microbial abundance sponge harbouring a specific microbial community dominated by the Thaumarchaeaote Cenarchaeum symbiosum. Comparative genomics and gene expression profiling identified potential symbiont-encoded proteins including serine protease inhibitors (serpins) with the potential to mediate host-microbe interactions that were not found in closely related free-living Thaumarchaeaota, consistent with C. symbiosum’s adaptation to a symbiotic lifestyle. Biochemical assays were subsequently used to characterize serpin activity and infer function. Immunity determinants previously unreported in sponges were identified, enabling near-complete reconstruction of innate immune signalling pathways and partial adaptive immunity pathways. Thus, this work expanded the known complexity of sponge immune signalling and suggests a more ancient origin of certain pathways than previously recognized. The composition of sponge innate immunity may reflect the complex nature of sponge-associated microbiota, which likely acquired adaptive features to thrive in the host milieu. Taken together, this thesis provides novel insights into the evolution of host-microbial recognition, archaeal adaptations to a symbiotic lifestyle, and molecular interactions between archaea and eukaryotic cells.

View record

Microbial community structure and ecology of Marine Group A Bacteria in the oxygen minimum zone of the Northeast subarctic Pacific Ocean (2013)

Oxygen minimum zones (OMZs) are intrinsic water column features that arise when the respiratory oxygen (O2) demand during microbial remineralization of organic matter exceeds O2 supply rates in poorly ventilated regions of the ocean. Microbial processes play a key role in mediating biogeochemical cycling of nutrients and radiatively active trace gases in OMZs. Specific roles of individual microbial groups and the ecological interactions among groups that drive OMZ biogeochemistry on a global scale, however, remain poorly constrained. This dissertation focuses on describing microbial community structure in the world’s largest and least studied OMZ, located in the Northeast subarctic Pacific Ocean (NESAP), with a specific emphasis on characterizing the ecology of Marine Group A, an uncultivated candidate phylum of bacteria found to be prevalent in this region. To begin, I performed a survey of microbial community structure in the NESAP at two time points and over a range of depths based on traditional ecological analyses. I applied techniques derived from network theory to identify co-occurrence patterns among microbial groups within the NESAP and determined that MGA bacteria most frequently co-occurred with other MGA bacteria, suggesting that intra-phylum interactions may play a role in governing microbial processes in this region. Through analysis of small subunit ribosomal rRNA (SSU rRNA) gene sequences affiliated with MGA, I identified 8 novel subgroups and established the phylogeny and population structure of both novel and previously detected MGA subgroups. Finally, I provided first insights into the metabolic capacity of this little-known candidate phylum through investigations of metagenomic data obtained from NESAP waters. Analysis of large-insert genomic DNA fragments derived from MGA revealed protein-coding genes associated with adaptation to oxygen deficiency and sulfur-based energy metabolism. These observations may implicate MGA bacteria in the cryptic sulfur cycle, recently discovered to play a central role in biogeochemical cycling within OMZs. This work describes the first survey of microbial community structure in the NESAP OMZ and the first application of co-occurrence networks to study the ecology of deep ocean microbial communities, in addition to the first analysis of the diversity, population structure, and metabolic capacity of the enigmatic bacterial lineage MGA.

View record

Master's Student Supervision (2010 - 2018)
Precise correlation and metagenomic binning uncovers fine microbial community structure (2017)

Bacteria and Archaea represent the invisible majority of living things on Earth with an estimated numerical abundance exceeding 10^30 cells. This estimate surpasses the number of grains of sand on Earth and stars in the known universe. Interdependent microbial communities drive fluxes of matter and energy underlying biogeochemical processes, and provide essential ecosystem functions and services that help create the operating conditions for life. Despite their abundance and functional imperative, the vast majority of microorganisms remain uncultivated in laboratory settings, and therefore remain extremely difficult to study. Recent advances in high-throughput sequencing are opening a multi-omic (DNA and RNA) window to the structure and function of microbial communities providing new insights into coupled biogeochemical cycling and the metabolic problem solving power of otherwise uncultivated microbial dark matter (MDM). These technological advances have created bottlenecks with respect to information processing, and innovative bioinformatics solutions are required to analyze immense biological data sets. This is particularly apparent when dealing with metagenome assembly, population genome binning, and network analysis.This work investigates combined use of single-cell amplifed genomes (SAGs) and metagenomes to more precisely construct population genome bins and evaluates the use of covariance matrix regularization methods to identify putative metabolic interdependencies at the population and community levels of organization. Applying dimensional reduction with principal components and a Gaussian mixture model to k-mer statistics from SAGs and metagenomes is shown to bin more precisely, and has been implemented as a novel pipeline, SAG Extrapolator (SAGEX). Also, correlation networks derived from small subunit ribosomal RNA gene sequences are shown to be more precisely inferred through regularization with factor analysis models applied via Gaussian copula. SAGEX and regularized correlation are applied toward 368 SAGs and 91 metagenomes, postulating populations’ metabolic capabilities via binning, and constraining interpretations via correlation. The application describes coupled biogeochemical cycling in low-oxygen waters. Use of SAGEX leverages SAGs’ deep taxonomic descriptions and metagenomes’ breadth, produces precise population genome bins, and enables metabolic reconstruction and analysis of population dynamics over time. Regularizing correlation networks overcomes a known analytic bottleneck based in precision limitations.

View record

A metagenomic approach to study flavin mediated public good dynamics in hydrocarbon resource environments (2015)

Emerging lines of evidence indicate that microbes form distributed networks of metabolite exchange based in part on public goods. These networks have the potential to drive the evolution of microbial lineages, and contribute to essential functions and services in natural and engineered ecosystems. However, experimental systems in which to evolve and perturb public good dynamics remain poorly constrained. Here, a functional metagenomic screening approach was used to recover abundant biosynthetic gene clusters with the potential to mediate microbial interactions in the environment. Specifically, 29 gene clusters involved in the production of riboflavin sourced from diverse microbial donor genotypes were recovered by functional screening from two fosmid libraries constructed from methanogenic communities enriched on hydrocarbons. Active clones were sequenced and riboflavin encoding gene cassettes were verified using cluster subcloning. Focusing on observed relationships between mobile genetic elements, metabolite secretion patterns and gene frequency distributions, a role for riboflavin as a public good in hydrocarbon-enriched environments was posited. Overall, this work suggests that secreted riboflavin may have versatile and unrecognized roles in microbial hydrocarbon transformation with potential to modulate microbial community dynamics in hydrocarbon resource environments

View record

Exploring microbial community structure and resilience through visualization and microbial co-occurrence networks (2015)

Cultivation independent microbial ecology research relies on high throughput sequencing technologies and analytical methods to resolve the infinite diversity of microbial life on Earth. Microorganisms live in communities driven by genetic and metabolic processes as well as symbiotic relationships. Interconnected communities of microorganisms provide essential functions in natural and human engineered ecosystems. Modelling the community as an inter-connected system can give insight into the community's functional characteristics related to the biogeochemical processes it performs. Network science resolves associations between elements of structure to notions of function in a system and has been successfully applied to the study of microbial communities and other complex biological systems. Microbial co-occurrence networks are inferred from community composition data to resolve structural patterns related to ecological properties such as community resilience to disturbance and keystone species. However, the interpretation of global and local network properties from an ecological standpoint remains difficult due to the complexity of these systems creating a need for quantitative analytical methods and visualization techniques for co-occurrence networks. This thesis tackles the visualization and analytical challenges of modelling microbial community structure from a network science approach. First, Hive Panel Explorer, an interactive visualization tool, is developed to permit data driven exploration of topological and data association patterns in complex systems. The effectiveness of Hive Panel Explorer is validated by resolving known and novel patterns in a model biological network, the C. elegans connectome. Second, network structural robustness analysis methods are applied to study microbial communities from timber harvested forest soils from a North American longterm soil productivity study. Analyzing these geographically dispersed soils reveals biogeographic patterns of diversity and enables the discovery of conserved organizing principles shaping microbial community structure. The capacity of robustness analysis to identify key microbial community members as well as model shifts in community structure due to environmental change is demonstrated. Finally, this work provides insight into the relationship between microbes and their ecosystem, and characterizing this relationship can help us understand the organization of microbial communities, survey microbial diversity and harness its potential.

View record

Exploring microbial controls on methane cycling using ML TreeMap (2013)

Biological methane (CH₄) production, or methanogenesis, plays a crucial role in the global carbon cycle. Biologically generated CH₄ can be emitted into the atmosphere, where it acts as greenhouse gas twenty five times more potent than carbon dioxide (CO₂), stored as “methane ice” (clathrates or hydrates) in marine sediments along continental margins, or oxidized under aerobic or anaerobic conditions by microbial agents effectively limiting atmospheric flux. Methanogenesis is orchestrated by a group of obligate anaerobic archaea within the phylum Euryarchaeota, known as the methanogens that produce CH₄ as the end product of their energy metabolism. Three methanogenic pathways have been described including the hydrogenotrophic, methylotrophic and aceticlastic. Although differing in their use of electron donors, all three pathways converge on a terminal step catalyzed by the heterohexameric methyl-coenzyme M reductase (MCR). Over the last decade cultivation-independent studies have identified anaerobic methane-oxidizing archaea (ANME-1, 2 and 3) related methanogens that appear to run one or more canonical methanogenic pathways in reverse including the terminal step catalyzed by MCR. The three genes encoding MCR subunits, mcrA, mcrB and mcrG possess phylogenetic resolution similar to that of the small subunit ribosomal RNA gene making them useful functional markers for detection and differentiation of methanogenic and methane-oxidation pathways in natural and human engineered ecosystems. Here I introduce an automated and culture-independent method for monitoring the taxonomic structure and genomic potential of methane-cycling environments that leverages and improves upon an existing software package called MLTreeMap. MLTreeMap is a user-extensible software framework that automates maximum likelihood analysis to recover phylogenetic or functional marker genes from environmental sequence data. I first describe the taxonomic structure and pathway representation of methane-cycling environments on a global scale based on the identification of MCRA alleles. I then chart both the metabolic potential and gene expression of marine sediments supporting the anaerobic oxidation of CH₄ using a series of reference trees representing near complete methane-cycling pathways.

View record


If this is your researcher profile you can log in to the Faculty & Staff portal to update your details and provide recruitment preferences.


Explore our wide range of course-based and research-based program options!