Logo of Robert Koch InstituteLogo of Robert Koch Institute
Publication Server of Robert Koch Instituteedoc
de|en
View Item 
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
View Item 
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
2017-01-04Zeitschriftenartikel DOI: 10.1038/srep39194
PaPrBaG: A machine learning approach for the detection of novel pathogens from NGS data
Deneke, Carlus
Rentzsch, Robert
Renard, Bernhard Y.
The reliable detection of novel bacterial pathogens from next-generation sequencing data is a key challenge for microbial diagnostics. Current computational tools usually rely on sequence similarity and often fail to detect novel species when closely related genomes are unavailable or missing from the reference database. Here we present the machine learning based approach PaPrBaG (Pathogenicity Prediction for Bacterial Genomes). PaPrBaG overcomes genetic divergence by training on a wide range of species with known pathogenicity phenotype. To that end we compiled a comprehensive list of pathogenic and non-pathogenic bacteria with human host, using various genome metadata in conjunction with a rule-based protocol. A detailed comparative study reveals that PaPrBaG has several advantages over sequence similarity approaches. Most importantly, it always provides a prediction whereas other approaches discard a large number of sequencing reads with low similarity to currently known reference genomes. Furthermore, PaPrBaG remains reliable even at very low genomic coverages. CombiningPaPrBaG with existing approaches further improves prediction results.
Files in this item
Thumbnail
25fDpe5t1iHbE.pdf — Adobe PDF — 597.4 Kb
MD5: 86b156db03b2f9667d3d22326154e67e
Cite
BibTeX
EndNote
RIS
No license information
Details
Terms of Use Imprint Policy Data Privacy Statement Contact

The Robert Koch Institute is a Federal Institute

within the portfolio of the Federal Ministry of Health

© Robert Koch Institute

All rights reserved unless explicitly granted.

 
DOI
10.1038/srep39194
Permanent URL
https://doi.org/10.1038/srep39194
HTML
<a href="https://doi.org/10.1038/srep39194">https://doi.org/10.1038/srep39194</a>