One Codex Releases New, Largest Ever Database

Today at the Advances in Genome Biology & Technology (AGBT) conference, we are excited to unveil the largest searchable database of microbial genomes. Curated from its larger collection of hundreds of thousands of genomes, the latest One Codex Database includes >80,000 genomes and provides unprecedented sensitivity and specificity for metagenomic applications.

The latest One Codex Database enables:

  • Highly sensitive identification of microbes in complex samples
  • Precise quantification of microbial abundances from whole genome sequencing (WGS) data
  • Community-wide characterization of complex microbial samples, including the human microbiome

Spanning the tree of life

Our latest release includes over 80,000 genomes, representing more than 43,000 distinct species and 69,000 strains across all microbial domains. We’ve increased coverage of all branches of the tree of life, with nearly 1,700 fungal genomes, another 1,700 archaeal genomes, 27,000 viruses, and more than 53,000 bacteria. Collectively, this database provides the most comprehensive snapshot of complex microbial samples.

For example, the new database classified 52% of an extremely complex environmental sample, 18% more than our 2017 database. This also represents a 3-fold increase in the proportion of classified reads over the NCBI RefSeq collection.

Get started today

You can run your samples against our new reference database today. All new samples uploaded will automatically be analyzed against the latest reference database (see our getting started guide). You may also re-analyze any previously uploaded samples on our jobs page.

Using metagenomic shotgun sequencing paired with our classification pipeline and comprehensive database, you can now get a closer, more accurate look at your microbiome data than ever before. We’re excited to continue expanding our reference collection and enabling more comprehensive, specific analyses of shotgun microbiome data as the field continues to rapidly evolve (and more genomes are sequenced!).

Please feel free to reach out to us by email or on Twitter @onecodex if you have any questions or we can help with your data.

— The One Codex Team

← Back to the One Codex blog Academic Study Data Shows One Codex’s Superior Performance for Metagenomic Analysis →