Logo des Robert Koch-InstitutLogo des Robert Koch-Institut
Publikationsserver des Robert Koch-Institutsedoc
de|en
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
JavaScript is disabled for your browser. Some features of this site may not work without it.
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
2020-07-13Zeitschriftenartikel
ganon: precise metagenomics classification against large and up-to-date sets of reference sequences
Piro, Victor C.
Dadi, Temesgen H.
Seiler, Enrico
Reinert, Knut
Renard, Bernhard Y.
Motivation The exponential growth of assembled genome sequences greatly benefits metagenomics studies. However, currently available methods struggle to manage the increasing amount of sequences and their frequent updates. Indexing the current RefSeq can take days and hundreds of GB of memory on large servers. Few methods address these issues thus far, and even though many can theoretically handle large amounts of references, time/memory requirements are prohibitive in practice. As a result, many studies that require sequence classification use often outdated and almost never truly up-to-date indices. Results Motivated by those limitations, we created ganon, a k-mer-based read classification tool that uses Interleaved Bloom Filters in conjunction with a taxonomic clustering and a k-mer counting/filtering scheme. Ganon provides an efficient method for indexing references, keeping them updated. It requires <55 min to index the complete RefSeq of bacteria, archaea, fungi and viruses. The tool can further keep these indices up-to-date in a fraction of the time necessary to create them. Ganon makes it possible to query against very large reference sets and therefore it classifies significantly more reads and identifies more species than similar methods. When classifying a high-complexity CAMI challenge dataset against complete genomes from RefSeq, ganon shows strongly increased precision with equal or better sensitivity compared with state-of-the-art tools. With the same dataset against the complete RefSeq, ganon improved the F1-score by 65% at the genus level. It supports taxonomy- and assembly-level classification, multiple indices and hierarchical classification.
Dateien zu dieser Publikation
Thumbnail
Piro-2020-ganon_ precise metagenomics classifi.pdf — PDF — 1.069 Mb
MD5: b5e193dd8cfe0c86688c9e1eb4a5510d
Zitieren
BibTeX
EndNote
RIS
(CC BY 3.0 DE) Namensnennung 3.0 Deutschland(CC BY 3.0 DE) Namensnennung 3.0 Deutschland
Zur Langanzeige
Nutzungsbedingungen Impressum Leitlinien Datenschutzerklärung Kontakt

Das Robert Koch-Institut ist ein Bundesinstitut im

Geschäftsbereich des Bundesministeriums für Gesundheit

© Robert Koch Institut

Alle Rechte vorbehalten, soweit nicht ausdrücklich anders vermerkt.