An open and integrative data science system
Introducing the National Microbiome Data Collaborative (NMDC), an initiative to empower the research community to harness microbiome data exploration and discovery through a collaborative integrative data science ecosystem.
Join our vision
Subscribe to be the first to know about the latest news and developments.
Advancing microbiome science, together
Founded to support the long-term advancement of microbiome science, the NMDC is building an agile, integrated data ecosystem. Our scientific mission is to provide comprehensive discovery of and access to multi-omics microbiome data. The long-term vision of the NMDC is to support microbiome data exploration through a sustainable data discovery portal that promotes open science and shared-ownership.
Supporting a collaborative data ecosystem
The NMDC seeks to connect and engage the microbiome community by unlocking new possibilities with microbiome data science. Focusing on broad and inclusive activities we aim to empower participation in data curation, discovery, and analytical processes using active partnerships among the research community. This regular and continuous engagement ensures that development aligns and adapts to the current and future needs of the research community.
Building community-supported microbiome data science capabilities
Map existing ontologies and standard vocabularies to the rich contextual metadata used to describe sample collection and processing.
Use ontology mapping tools and curation resources to enable automated annotation of standardized metadata to adhere to the FAIR principles.
Develop microbiome workflows for metagenome, metatranscriptome, metaproteome, and metabolomics data processing leveraging high performance compute (HPC) systems
Generate and integrate interoperable and reusable microbiome data from pilot data providers
Iteratively develop a graphical web-based interface that streamlines search, data exploration, and discovery.
Provide access to FAIR multidisciplinary data and standardized, reproducible data products for comparative analyses.
Building a world where data are FAIR
Over the past few decades, microbiome data have grown exponentially. However, the sheer amount of data available presents a significant bottleneck for analysis and interpretation. The NMDC makes these data findable, accessible, interoperable, and reusable (FAIR) by:
Ensure all data registered within NMDC are human and machine readable
Identify data sets that are available, including any authentication and authorization requirements
Provide provenance, metadata, and uniformly processed data, we are lowering the barriers to making data interoperable
Enable download of data, data products, and workflows for external reprocessing