2020-04-24Zeitschriftenartikel
Targeted domain assembly for fast functional profiling of metagenomic datasets with S3A
dc.contributor.author | David, Laurent | |
dc.contributor.author | Vicedomini, Riccardo | |
dc.contributor.author | Richard, Hugues | |
dc.contributor.author | Carbone, Alessandro | |
dc.date.accessioned | 2024-02-08T10:44:16Z | |
dc.date.available | 2024-02-08T10:44:16Z | |
dc.date.issued | 2020-04-24 | none |
dc.identifier.other | 10.1093/bioinformatics/btaa272 | |
dc.identifier.uri | http://edoc.rki.de/176904/11486 | |
dc.description.abstract | Motivation The understanding of the ever-increasing number of metagenomic sequences accumulating in our databases demands for approaches that rapidly ‘explore’ the content of multiple and/or large metagenomic datasets with respect to specific domain targets, avoiding full domain annotation and full assembly. Results S3A is a fast and accurate domain-targeted assembler designed for a rapid functional profiling. It is based on a novel construction and a fast traversal of the Overlap-Layout-Consensus graph, designed to reconstruct coding regions from domain annotated metagenomic sequence reads. S3A relies on high-quality domain annotation to efficiently assemble metagenomic sequences and on the design of a new confidence measure for a fast evaluation of overlapping reads. Its implementation is highly generic and can be applied to any arbitrary type of annotation. On simulated data, S3A achieves a level of accuracy similar to that of classical metagenomics assembly tools while permitting to conduct a faster and sensitive profiling on domains of interest. When studying a few dozens of functional domains—a typical scenario—S3A is up to an order of magnitude faster than general purpose metagenomic assemblers, thus enabling the analysis of a larger number of datasets in the same amount of time. S3A opens new avenues to the fast exploration of the rapidly increasing number of metagenomic datasets displaying an ever-increasing size. | eng |
dc.language.iso | eng | none |
dc.publisher | Robert Koch-Institut | |
dc.rights | (CC BY-NC 3.0 DE) Namensnennung - Nicht kommerziell 3.0 Deutschland | ger |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/3.0/de/ | |
dc.subject | algorithms | eng |
dc.subject | metagenome | eng |
dc.subject | metagenomics | eng |
dc.subject | sequence analysis | eng |
dc.subject | software | eng |
dc.subject.ddc | 610 Medizin und Gesundheit | none |
dc.title | Targeted domain assembly for fast functional profiling of metagenomic datasets with S3A | none |
dc.type | article | |
dc.identifier.urn | urn:nbn:de:0257-176904/11486-2 | |
dc.type.version | publishedVersion | none |
local.edoc.container-title | Bioinformatics | none |
local.edoc.container-issn | 1367-4811 | none |
local.edoc.pages | 7 | none |
local.edoc.type-name | Zeitschriftenartikel | |
local.edoc.container-type | periodical | |
local.edoc.container-type-name | Zeitschrift | |
local.edoc.container-url | https://academic.oup.com/bioinformatics | none |
local.edoc.container-publisher-name | Oxford University Press | none |
local.edoc.container-volume | 36 | none |
local.edoc.container-issue | 13 | none |
local.edoc.container-reportyear | 2020 | none |
local.edoc.container-firstpage | 3975 | none |
local.edoc.container-lastpage | 3981 | none |
dc.description.version | Peer Reviewed | none |