Population Stratification at the Phenotypic Variance level and Implication for the Analysis of Whole Genome Sequencing Data from Multiple Studies
Autor: | Ethan M. Lange, Matthew P. Conomos, Tamar Sofer, Xiuwen Zheng, Kenneth Rice, O’Connell, J. C. Bis, Stephanie M. Gogarten, Cecilia A. Laurie, Jennifer A. Brody, Bruce M. Psaty, Adam A. Szpiro, Yan Gao, Timothy A. Thornton, L. A. Cupples |
---|---|
Rok vydání: | 2020 |
Předmět: |
Whole genome sequencing
0303 health sciences 030305 genetics & heredity Variance (accounting) Biology Population stratification Statistical power 3. Good health Term (time) 03 medical and health sciences Sample size determination Statistics False positive paradox Allele frequency 030304 developmental biology |
DOI: | 10.1101/2020.03.03.973420 |
Popis: | SummaryIn modern Whole Genome Sequencing (WGS) epidemiological studies, participant-level data from multiple studies are often pooled and results are obtained from a single analysis. We consider the impact of differential phenotype variances by study, which we term ‘variance stratification’. Unaccounted for, variance stratification can lead to both decreased statistical power, and increased false positives rates, depending on how allele frequencies, sample sizes, and phenotypic variances vary across the studies that are pooled. We describe a WGS-appropriate analysis approach, implemented in freely-available software, which allows study-specific variances and thereby improves performance in practice. We also illustrate the variance stratification problem, its solutions, and a corresponding diagnostic procedure in data from the Trans-Omics for Precision Medicine Whole Genome Sequencing Program (TOPMed), used in association tests for hemoglobin concentrations and BMI. |
Databáze: | OpenAIRE |
Externí odkaz: |