Understanding admixture fractions: theory and estimation of gene-flow.

Autor: Liang M; Department of Integrative Biology, University of California, Valley Life Sciences Building, Berkeley, CA, 94720, USA.; Department of Statistics, University of California, Evans Hall, Berkeley, CA, 94720, USA., Shishkin M; International laboratory of statistical and computational genomics, Faculty of computer science, HSE University, Pokrovskiy Boulevard, 11, Moscow, Russian Federation, 109028., Shchur V; International laboratory of statistical and computational genomics, Faculty of computer science, HSE University, Pokrovskiy Boulevard, 11, Moscow, Russian Federation, 109028. vshchur@hse.ru., Nielsen R; Department of Integrative Biology, University of California, Valley Life Sciences Building, Berkeley, CA, 94720, USA. rasmus_nielsen@berkeley.edu.; Department of Statistics, University of California, Evans Hall, Berkeley, CA, 94720, USA. rasmus_nielsen@berkeley.edu.; Globe Institute, University of Copenhagen, Oester Voldgade 5-7, 1350, Copenhagen K, Denmark. rasmus_nielsen@berkeley.edu.
Jazyk: angličtina
Zdroj: Journal of mathematical biology [J Math Biol] 2024 Oct 04; Vol. 89 (5), pp. 47. Date of Electronic Publication: 2024 Oct 04.
DOI: 10.1007/s00285-024-02146-0
Abstrakt: Estimation of admixture proportions has become one of the most commonly used computational tools in population genomics. However, there is remarkably little population genetic theory on statistical properties of these variables. We develop theoretical results that can accurately predict means and variances of admixture proportions within a population using models with recombination and genetic drift. Based on established theory on measures of multilocus disequilibrium, we show that there is a set of recurrence relations that can be used to derive expectations for higher moments of the admixture proportions distribution. We obtain closed form solutions for some special cases. Using these results, we develop a method for estimating admixture parameters from estimated admixture proportions obtained from programs such as Structure or Admixture. We apply this method to HapMap 3 data and find that the population history of African Americans, as expected, is not best explained by a single admixture event between people of European and African ancestry. The model of constant gene flow starting at 8 generations and ending at 2 generations before present gives the best fit.
(© 2024. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.)
Databáze: MEDLINE