Logo des Robert Koch-InstitutLogo des Robert Koch-Institut
Publikationsserver des Robert Koch-Institutsedoc
de|en
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
JavaScript is disabled for your browser. Some features of this site may not work without it.
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Gesamter edoc-ServerBereiche & SammlungenTitelAutorSchlagwortDiese SammlungTitelAutorSchlagwort
PublizierenEinloggenRegistrierenHilfe
StatistikNutzungsstatistik
Publikation anzeigen 
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
  • edoc Startseite
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • Publikation anzeigen
2024-05-28Zeitschriftenartikel
Sequencing accuracy and systematic errors of nanopore direct RNA sequencing
Liu-Wei, Wang
van der Toorn, Wiep
Bohn, Patrick
Hölzer, Martin
Smyth, Redmond P.
von Kleist, Max
Background: Direct RNA sequencing (dRNA-seq) on the Oxford Nanopore Technologies (ONT) platforms can produce reads covering up to full-length gene transcripts, while containing decipherable information about RNA base modifications and poly-A tail lengths. Although many published studies have been expanding the potential of dRNA-seq, its sequencing accuracy and error patterns remain understudied. Results: We present the first comprehensive evaluation of sequencing accuracy and characterisation of systematic errors in dRNA-seq data from diverse organisms and synthetic in vitro transcribed RNAs. We found that for sequencing kits SQK-RNA001 and SQK-RNA002, the median read accuracy ranged from 87% to 92% across species, and deletions significantly outnumbered mismatches and insertions. Due to their high abundance in the transcriptome, heteropolymers and short homopolymers were the major contributors to the overall sequencing errors. We also observed systematic biases across all species at the levels of single nucleotides and motifs. In general, cytosine/uracil-rich regions were more likely to be erroneous than guanines and adenines. By examining raw signal data, we identified the underlying signal-level features potentially associated with the error patterns and their dependency on sequence contexts. While read quality scores can be used to approximate error rates at base and read levels, failure to detect DNA adapters may be a source of errors and data loss. By comparing distinct basecallers, we reason that some sequencing errors are attributable to signal insufficiency rather than algorithmic (basecalling) artefacts. Lastly, we generated dRNA-seq data using the latest SQK-RNA004 sequencing kit released at the end of 2023 and found that although the overall read accuracy increased, the systematic errors remain largely identical compared to the previous kits. Conclusions: As the first systematic investigation of dRNA-seq errors, this study offers a comprehensive overview of reproducible error patterns across diverse datasets, identifies potential signal-level insufficiency, and lays the foundation for error correction methods.
Dateien zu dieser Publikation
Thumbnail
s12864-024-10440-w.pdf — PDF — 3.093 Mb
MD5: dd3daea64bd26626b89ca0295b7c8e8b
Zitieren
BibTeX
EndNote
RIS
(CC BY 3.0 DE) Namensnennung 3.0 Deutschland(CC BY 3.0 DE) Namensnennung 3.0 Deutschland
Zur Langanzeige

Verwandte Publikationen

Anzeige der Publikationen mit ähnlichem Titel, Autor, Urheber und Thema.

  • 2015-02-06Zeitschriftenartikel
    Isolation and Functional Characterization of the Novel Clostridium botulinum Neurotoxin A8 Subtype 
    Kull, Skadi; Schulz, Melanie; Strotmeier, Jasmin Weisemann née; Kirchner, Sebastian; Schreiber, Tanja; Bollenbach, Alexander; Dabrowski, Piotr Wojtek; Nitsche, Andreas; Kalb, Suzanne R.; Dorner, Martin; Barr, John R.; Rummel, Andreas; Dorner, Brigitte
    Botulism is a severe neurological disease caused by the complex family of botulinum neurotoxins (BoNT). Based on the different serotypes known today, a classification of serotype variants termed subtypes has been proposed ...
  • 2007-05-25Zeitschriftenartikel
    FepA- and TonB-dependent bacteriophage H8: receptor binding and genomic sequence. 
    Rabsch, Wolfgang; Ma, Li; Wiley, Graham; Najar, Fares Z.; Kaserer, Wallace; Schuerch, Daniel W.; Klebba, Joseph E.; Roe, Bruce A.; Gomez, Jenny A. Laverde; Schallmey, Marcus; Newton, Salete M. C.; Klebba, Phillip E.
    H8 is derived from a collection of Salmonella enterica serotype Enteritidis bacteriophage. Its morphology and genomic structure closely resemble those of bacteriophage T5 in the family Siphoviridae. H8 infected S. enterica ...
  • 2013-10-23Zeitschriftenartikel
    Cytomegalovirus expresses the chemokine homologue vXCL1 capable of attracting XCR1+CD4- dendritic cells 
    Geyer, Henriette; Hartung, Evelyn; Mages, Hans Werner; Weise, Christoph; Belužić, Robert; Vugrek, Oliver; Jonjić, Stipan; Kroczek, Richard; Voigt, Sebastian
    Cytomegaloviruses (CMV) have developed various strategies to escape the immune system of the host. One strategy involves the expression of virus-encoded chemokines to modulate the host chemokine network. We have identified ...
Nutzungsbedingungen Impressum Leitlinien Datenschutzerklärung Kontakt

Das Robert Koch-Institut ist ein Bundesinstitut im

Geschäftsbereich des Bundesministeriums für Gesundheit

© Robert Koch Institut

Alle Rechte vorbehalten, soweit nicht ausdrücklich anders vermerkt.