A de novo DNA Sequencing and Variant Calling Algorithm for Nanopores
Autor: | Tamas Szalay, Jene Andrew Golovchenko |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
0303 health sciences
M13 bacteriophage biology Pipeline (computing) Statistical model 010402 general chemistry biology.organism_classification 01 natural sciences Genome DNA sequencing 0104 chemical sciences 03 medical and health sciences Nanopore Nanopore sequencing Error detection and correction Algorithm 030304 developmental biology |
DOI: | 10.1101/019448 |
Popis: | The single-molecule accuracy of nanopore sequencing has been an area of rapid academic and commercial advancement, but remains insufficient for the de novo analysis of genomes. We introduce here a novel algorithm for the error correction of nanopore data, utilizing statistical models of the physical system in order to obtain high accuracy de novo sequences at a range of coverage depths. We demonstrate the technique by sequencing M13 bacteriophage DNA to 99% accuracy at moderate coverage as well as its use in an assembly pipeline by sequencing λ DNA at a range of coverages. We also show the algorithm’s ability to accurately classify sequence variants at far lower coverage than existing methods. |
Databáze: | OpenAIRE |
Externí odkaz: |