Autor: |
Rei Ck, Kalisvar Marimuthu, Soldo Jp, Jia Yu Koh, Denis Bertrand, Barnaby Edward Young, Kalathiappan M, Timothy Barkham, Muthiah S, Mile Šikić, Dvornicic M, Tek No, Chenhao Li, Hui Qi Ng A, Niranjan Nagarajan, James C. Shaw |
Jazyk: |
angličtina |
Rok vydání: |
2018 |
Předmět: |
|
DOI: |
10.1101/456905 |
Popis: |
The analysis of information rich whole-metagenome datasets acquired from complex microbial communities is often restricted by the fragmented nature of assembly from short-read sequencing. The availability of long-reads from third-generation sequencing technologies (e.g. PacBio or Oxford Nanopore) can help improve assembly quality in principle, but high error rates and low throughput have limited their application in metagenomics. In this work, we describe the first hybrid metagenomic assembler which combines the advantages of short and long-read technologies, providing an order of magnitude improvement in contiguity compared to short read assemblies, and high base-pair level accuracy. The proposed approach (OPERA-MS) integrates a novel assembly-based metagenome clustering technique with an exact scaffolding algorithm that can efficiently assemble repeat rich sequences. Based on evaluations with defined in vitro communities and virtual gut microbiomes, we show that it is possible to assemble near complete genomes from metagenomes with as little as 9× long read coverage, thus enabling high quality assembly of lowly abundant species (80 putative plasmid/phage sequences and a 263kbp jumbo phage. Our results highlight that high-quality hybrid assemblies provide an unprecedented view of the gut resistome in these patients, including strain dynamics and identification of novel plasmid sequences. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|