Efficient phasing and imputation of low-coverage sequencing data using large reference panels

Autor: Simone Rubinacci, Diogo M. Ribeiro, Robin J. Hofmeister, Olivier Delaneau
Rok vydání: 2021
Předmět:
Zdroj: Nature Genetics. 53:120-126
ISSN: 1546-1718
1061-4036
DOI: 10.1038/s41588-020-00756-0
Popis: Low-coverage whole-genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined because current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. GLIMPSE achieves imputation of a genome for less than US$1 in computational cost, considerably outperforming other methods and improving imputation accuracy over the full allele frequency range. As a proof of concept, we show that 1× coverage enables effective gene expression association studies and outperforms dense SNP arrays in rare variant burden tests. Overall, this study illustrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies. GLIMPSE is a new method for haplotype phasing and genotype imputation of low-coverage sequencing datasets from large reference panels. GLIMPSE shows remarkable performance across different coverages and human populations.
Databáze: OpenAIRE