Efficient phasing and imputation of low-coverage sequencing data using large reference panels
Autor: | Simone Rubinacci, Diogo M. Ribeiro, Robin J. Hofmeister, Olivier Delaneau |
---|---|
Rok vydání: | 2021 |
Předmět: |
0303 health sciences
Sequencing data Biology computer.software_genre Phaser 03 medical and health sciences 0302 clinical medicine Proof of concept Genetics Leverage (statistics) Data mining Allele frequency Genotyping computer 030217 neurology & neurosurgery Imputation (genetics) 030304 developmental biology Genetic association |
Zdroj: | Nature Genetics. 53:120-126 |
ISSN: | 1546-1718 1061-4036 |
DOI: | 10.1038/s41588-020-00756-0 |
Popis: | Low-coverage whole-genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined because current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. GLIMPSE achieves imputation of a genome for less than US$1 in computational cost, considerably outperforming other methods and improving imputation accuracy over the full allele frequency range. As a proof of concept, we show that 1× coverage enables effective gene expression association studies and outperforms dense SNP arrays in rare variant burden tests. Overall, this study illustrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies. GLIMPSE is a new method for haplotype phasing and genotype imputation of low-coverage sequencing datasets from large reference panels. GLIMPSE shows remarkable performance across different coverages and human populations. |
Databáze: | OpenAIRE |
Externí odkaz: |