Efficient polygenic risk scores for biobank scale data by exploiting phenotypes from inferred relatives
Autor: | Julius H. J. van der Werf, Xuan Zhou, Jisu Shin, Jiuyong Li, Buu Truong, S. Hong Lee, Thuc Duy Le |
---|---|
Přispěvatelé: | Truong, Buu, Zhou, Xuan, Shin, Jisu, Li, Jiuyong, van der Werf, Julius HJ, Le, Thuc D, Lee, S Hong |
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
Male
0301 basic medicine Multifactorial Inheritance General Physics and Astronomy Genome-wide association study predictive medicine 0302 clinical medicine lcsh:Science Biological Specimen Banks Multidisciplinary Life style food and beverages Biobank Phenotype Pedigree Experimental models of disease population screening Scale (social sciences) Female experimental models of disease Genotype Science Predictive medicine Biology Polymorphism Single Nucleotide Risk Assessment Article General Biochemistry Genetics and Molecular Biology Population screening 03 medical and health sciences Humans Genetic Predisposition to Disease Life Style Family Health Models Genetic Genome Human fungi Reproducibility of Results General Chemistry United Kingdom 030104 developmental biology Sample size determination lcsh:Q Polygenic risk score 030217 neurology & neurosurgery Genome-Wide Association Study Demography |
Zdroj: | Nature Communications, Vol 11, Iss 1, Pp 1-11 (2020) Nature Communications |
ISSN: | 2041-1723 |
Popis: | Polygenic risk scores are emerging as a potentially powerful tool to predict future phenotypes of target individuals, typically using unrelated individuals, thereby devaluing information from relatives. Here, for 50 traits from the UK Biobank data, we show that a design of 5,000 individuals with first-degree relatives of target individuals can achieve a prediction accuracy similar to that of around 220,000 unrelated individuals (mean prediction accuracy = 0.26 vs. 0.24, mean fold-change = 1.06 (95% CI: 0.99-1.13), P-value = 0.08), despite a 44-fold difference in sample size. For lifestyle traits, the prediction accuracy with 5,000 individuals including first-degree relatives of target individuals is significantly higher than that with 220,000 unrelated individuals (mean prediction accuracy = 0.22 vs. 0.16, mean fold-change = 1.40 (1.17-1.62), P-value = 0.025). Our findings suggest that polygenic prediction integrating family information may help to accelerate precision health and clinical intervention. Genetic data from large cohorts of unrelated individuals can be used to create polygenic risk scores, which could be used to predict individual risk of developing a specific disease. Here the authors show that smaller cohorts of related individuals can provide similarly powerful predictive ability. |
Databáze: | OpenAIRE |
Externí odkaz: |