Logo of Robert Koch InstituteLogo of Robert Koch Institute
Publication Server of Robert Koch Instituteedoc
de|en
View Item 
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
View Item 
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
  • edoc-Server Home
  • Artikel in Fachzeitschriften
  • Artikel in Fachzeitschriften
  • View Item
2024-05-28Zeitschriftenartikel
Sequencing accuracy and systematic errors of nanopore direct RNA sequencing
Liu-Wei, Wang
van der Toorn, Wiep
Bohn, Patrick
Hölzer, Martin
Smyth, Redmond P.
von Kleist, Max
Background: Direct RNA sequencing (dRNA-seq) on the Oxford Nanopore Technologies (ONT) platforms can produce reads covering up to full-length gene transcripts, while containing decipherable information about RNA base modifications and poly-A tail lengths. Although many published studies have been expanding the potential of dRNA-seq, its sequencing accuracy and error patterns remain understudied. Results: We present the first comprehensive evaluation of sequencing accuracy and characterisation of systematic errors in dRNA-seq data from diverse organisms and synthetic in vitro transcribed RNAs. We found that for sequencing kits SQK-RNA001 and SQK-RNA002, the median read accuracy ranged from 87% to 92% across species, and deletions significantly outnumbered mismatches and insertions. Due to their high abundance in the transcriptome, heteropolymers and short homopolymers were the major contributors to the overall sequencing errors. We also observed systematic biases across all species at the levels of single nucleotides and motifs. In general, cytosine/uracil-rich regions were more likely to be erroneous than guanines and adenines. By examining raw signal data, we identified the underlying signal-level features potentially associated with the error patterns and their dependency on sequence contexts. While read quality scores can be used to approximate error rates at base and read levels, failure to detect DNA adapters may be a source of errors and data loss. By comparing distinct basecallers, we reason that some sequencing errors are attributable to signal insufficiency rather than algorithmic (basecalling) artefacts. Lastly, we generated dRNA-seq data using the latest SQK-RNA004 sequencing kit released at the end of 2023 and found that although the overall read accuracy increased, the systematic errors remain largely identical compared to the previous kits. Conclusions: As the first systematic investigation of dRNA-seq errors, this study offers a comprehensive overview of reproducible error patterns across diverse datasets, identifies potential signal-level insufficiency, and lays the foundation for error correction methods.
Files in this item
Thumbnail
s12864-024-10440-w.pdf — Adobe PDF — 3.093 Mb
MD5: dd3daea64bd26626b89ca0295b7c8e8b
Cite
BibTeX
EndNote
RIS
(CC BY 3.0 DE) Namensnennung 3.0 Deutschland(CC BY 3.0 DE) Namensnennung 3.0 Deutschland
Details

Related Items

Show related Items with similar Title, Author, Creator or Subject.

  • 2015-02-06Zeitschriftenartikel
    Isolation and Functional Characterization of the Novel Clostridium botulinum Neurotoxin A8 Subtype 
    Kull, Skadi; Schulz, Melanie; Strotmeier, Jasmin Weisemann née; Kirchner, Sebastian; Schreiber, Tanja; Bollenbach, Alexander; Dabrowski, Piotr Wojtek; Nitsche, Andreas; Kalb, Suzanne R.; Dorner, Martin; Barr, John R.; Rummel, Andreas; Dorner, Brigitte
    Botulism is a severe neurological disease caused by the complex family of botulinum neurotoxins (BoNT). Based on the different serotypes known today, a classification of serotype variants termed subtypes has been proposed ...
  • 2007-05-25Zeitschriftenartikel
    FepA- and TonB-dependent bacteriophage H8: receptor binding and genomic sequence. 
    Rabsch, Wolfgang; Ma, Li; Wiley, Graham; Najar, Fares Z.; Kaserer, Wallace; Schuerch, Daniel W.; Klebba, Joseph E.; Roe, Bruce A.; Gomez, Jenny A. Laverde; Schallmey, Marcus; Newton, Salete M. C.; Klebba, Phillip E.
    H8 is derived from a collection of Salmonella enterica serotype Enteritidis bacteriophage. Its morphology and genomic structure closely resemble those of bacteriophage T5 in the family Siphoviridae. H8 infected S. enterica ...
  • 2013-10-23Zeitschriftenartikel
    Cytomegalovirus expresses the chemokine homologue vXCL1 capable of attracting XCR1+CD4- dendritic cells 
    Geyer, Henriette; Hartung, Evelyn; Mages, Hans Werner; Weise, Christoph; Belužić, Robert; Vugrek, Oliver; Jonjić, Stipan; Kroczek, Richard; Voigt, Sebastian
    Cytomegaloviruses (CMV) have developed various strategies to escape the immune system of the host. One strategy involves the expression of virus-encoded chemokines to modulate the host chemokine network. We have identified ...
Terms of Use Imprint Policy Data Privacy Statement Contact

The Robert Koch Institute is a Federal Institute

within the portfolio of the Federal Ministry of Health

© Robert Koch Institute

All rights reserved unless explicitly granted.