Using population data for assessing next-generation sequencing performance.

Autor: Houniet DT; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Rahman TJ; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Al Turki S; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Hurles ME; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Xu Y; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Goodship J; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Keavney B; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK., Santibanez Koref M; Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2015 Jan 01; Vol. 31 (1), pp. 56-61. Date of Electronic Publication: 2014 Sep 17.
DOI: 10.1093/bioinformatics/btu606
Abstrakt: Motivation: During the past 4 years, whole-exome sequencing has become a standard tool for finding rare variants causing Mendelian disorders. In that time, there has also been a proliferation of both sequencing platforms and approaches to analyse their output. This requires approaches to assess the performance of different methods. Traditionally, criteria such as comparison with microarray data or a number of known polymorphic sites have been used. Here we expand such approaches, developing a maximum likelihood framework and using it to estimate the sensitivity and specificity of whole-exome sequencing data.
Results: Using whole-exome sequencing data for a panel of 19 individuals, we show that estimated sensitivity and specificity are similar to those calculated using microarray data as a reference. We explore the effect of frequency misspecification arising from using an inappropriately selected population and find that, although the estimates are affected, the rankings across procedures remain the same.
Availability and Implementation: An implementation using Perl and R can be found at busso.ncl.ac.uk (Username: igm101; Password: Z1z1nts).
(© The Author 2014. Published by Oxford University Press.)
Databáze: MEDLINE