A universal SNP and small-indel variant caller using deep neural networks
Autor: | Scott Schwartz, Ryan Poplin, David Alexander, Thomas Colthurst, Alexander Ku, Jojo Dijamco, Dan Newburger, Nam Nguyen, Pegah T Afshar, Pi-Chuan Chang, Mark A. DePristo, Sam S Gross, Cory Y. McLean, Lizzie Dorfman |
---|---|
Rok vydání: | 2017 |
Předmět: |
0301 basic medicine
Genotype Sequence analysis Computer science DNA Mutational Analysis Biomedical Engineering Bioengineering Genomics Computational biology Applied Microbiology and Biotechnology Genome Convolutional neural network Polymorphism Single Nucleotide 03 medical and health sciences INDEL Mutation Animals Humans Indel Exome sequencing Mammals Artificial neural network Genome Human High-Throughput Nucleotide Sequencing Sequence Analysis DNA 030104 developmental biology Molecular Medicine Neural Networks Computer Software Biotechnology |
Zdroj: | Nature biotechnology. 36(10) |
ISSN: | 1546-1696 |
Popis: | Despite rapid advances in sequencing technologies, accurately calling genetic variants present in an individual genome from billions of short, errorful sequence reads remains challenging. Here we show that a deep convolutional neural network can call genetic variation in aligned next-generation sequencing read data by learning statistical relationships between images of read pileups around putative variant and true genotype calls. The approach, called DeepVariant, outperforms existing state-of-the-art tools. The learned model generalizes across genome builds and mammalian species, allowing nonhuman sequencing projects to benefit from the wealth of human ground-truth data. We further show that DeepVariant can learn to call variants in a variety of sequencing technologies and experimental designs, including deep whole genomes from 10X Genomics and Ion Ampliseq exomes, highlighting the benefits of using more automated and generalizable techniques for variant calling. |
Databáze: | OpenAIRE |
Externí odkaz: |