LDpred-funct: incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

Autor:	Samuel S. Kim, Furlotte N, Po-Ru Loh, Carla Marquez-Luna, Adam Auton, Alkes L. Price, Steven Gazal
Rok vydání:	2018
Předmět:	0303 health sciences Training set 030305 genetics & heredity Confounding Causal effect Heritability Biobank 03 medical and health sciences Statistics Prior probability Trait 030304 developmental biology Mathematics Coding (social sciences)
Popis:	Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a new method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, which includes coding, conserved, regulatory and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. LDpred-funct attained higher prediction accuracy than other polygenic prediction methods in simulations using real genotypes. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank. We used association statistics from British-ancestry samples as training data (avgN=373K) and samples of other European ancestries as validation data (avgN=22K), to minimize confounding. LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg predictionR2=0.144; highestR2=0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (totalN=1107K; higher heritability in UK Biobank cohort) increased predictionR2to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::97d10e769390e009ebf322b3480e462f https://doi.org/10.1101/375337 Zobrazit plný text záznamu