XPXP: improving polygenic prediction by cross-population and cross-phenotype analysis.
Autor: | Xiao J; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China., Cai M; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China., Hu X; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China., Wan X; Shenzhen Research Institute of Big Data, Shenzhen 518172, China.; Pazhou Lab, Guangzhou 510330, China., Chen G; Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China., Yang C; Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China.; Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China. |
---|---|
Jazyk: | angličtina |
Zdroj: | Bioinformatics (Oxford, England) [Bioinformatics] 2022 Mar 28; Vol. 38 (7), pp. 1947-1955. |
DOI: | 10.1093/bioinformatics/btac029 |
Abstrakt: | Motivation: As increasing sample sizes from genome-wide association studies (GWASs), polygenic risk scores (PRSs) have shown great potential in personalized medicine with disease risk prediction, prevention and treatment. However, the PRS constructed using European samples becomes less accurate when it is applied to individuals from non-European populations. It is an urgent task to improve the accuracy of PRSs in under-represented populations, such as African populations and East Asian populations. Results: In this article, we propose a cross-population and cross-phenotype (XPXP) method for construction of PRSs in under-represented populations. XPXP can construct accurate PRSs by leveraging biobank-scale datasets in European populations and multiple GWASs of genetically correlated phenotypes. XPXP also allows to incorporate population-specific and phenotype-specific effects, and thus further improves the accuracy of PRS. Through comprehensive simulation studies and real data analysis, we demonstrated that our XPXP outperformed existing PRS approaches. We showed that the height PRSs constructed by XPXP achieved 9% and 18% improvement over the runner-up method in terms of predicted R2 in East Asian and African populations, respectively. We also showed that XPXP substantially improved the stratification ability in identifying individuals at high genetic risk of type 2 diabetes. Availability and Implementation: The XPXP software and all analysis code are available at github.com/YangLabHKUST/XPXP. Supplementary Information: Supplementary data are available at Bioinformatics online. (© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.) |
Databáze: | MEDLINE |
Externí odkaz: |