Logo des Robert Koch-InstitutLogo des Robert Koch-Institut
Publikationsserver des Robert Koch-Institutsedoc
de|en
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
JavaScript is disabled for your browser. Some features of this site may not work without it.
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
2018-01-15Zeitschriftenartikel DOI: 10.1186/s12864-017-4401-3
seq-seq-pan: building a computational pan-genome data structure on whole genome alignment
Jandrasits, Christine
Dabrowski, Piotr Wojtek
Fuchs, Stephan
Renard, Bernhard Y.
Background: The increasing application of next generation sequencing technologies has led to the availability of thousands of reference genomes, often providing multiple genomes for the same or closely related species. The current approach to represent a species or a population with a single reference sequence and a set of variations cannot represent their full diversity and introduces bias towards the chosen reference. There is a need for the representation of multiple sequences in a composite way that is compatible with existing data sources for annotation and suitable for established sequence analysis methods. At the same time, this representation needs to be easily accessible and extendable to account for the constant change of available genomes. Results: We introduce seq-seq-pan, a framework that provides methods for adding or removing new genomes from a set of aligned genomes and uses these to construct a whole genome alignment. Throughout the sequential workflow the alignment is optimized for generating a representative linear presentation of the aligned set of genomes, that enables its usage for annotation and in downstream analyses. Conclusions: By providing dynamic updates and optimized processing, our approach enables the usage of whole genome alignment in the field of pan-genomics. In addition, the sequential workflow can be used as a fast alternative to existing whole genome aligners for aligning closely related genomes. seq-seq-pan is freely available at https://gitlab.com/rki_bioinformatics
Dateien zu dieser Publikation
Thumbnail
24umfISTjJDYs.pdf — PDF — 745.4 Kb
MD5: ffacfdea3f3961acc0360a3fef9ceee5
Zitieren
BibTeX
EndNote
RIS
Keine Lizenzangabe
Zur Langanzeige
Nutzungsbedingungen Impressum Leitlinien Datenschutzerklärung Kontakt

Das Robert Koch-Institut ist ein Bundesinstitut im

Geschäftsbereich des Bundesministeriums für Gesundheit

© Robert Koch Institut

Alle Rechte vorbehalten, soweit nicht ausdrücklich anders vermerkt.

 
DOI
10.1186/s12864-017-4401-3
Permanent URL
https://doi.org/10.1186/s12864-017-4401-3
HTML
<a href="https://doi.org/10.1186/s12864-017-4401-3">https://doi.org/10.1186/s12864-017-4401-3</a>