Coalescence times for three genes provide sufficient information to distinguish population structure from population size changes.

Autor: Grusea S; Institut de Mathématiques de Toulouse, Université de Toulouse, Institut National des Sciences Appliquées, 31077, Toulouse, France. grusea@insa-toulouse.fr., Rodríguez W; Institut de Mathématiques de Toulouse, Université de Toulouse, Institut National des Sciences Appliquées, 31077, Toulouse, France., Pinchon D; Institut de Mathématiques de Toulouse, Université de Toulouse, Institut National des Sciences Appliquées, 31077, Toulouse, France., Chikhi L; Laboratoire Évolution et Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS, 118 route de Narbonne, Bât. 4R1, 31062, Toulouse Cedex 9, France.; Instituto Gulbenkian de Ciência, Rua da Quinta Grande, No. 6, 2780-156, Oeiras, Portugal., Boitard S; GenPhySE, Université de Toulouse, INRA, INPT, INP-ENVT, Castanet Tolosan, France., Mazet O; Institut de Mathématiques de Toulouse, Université de Toulouse, Institut National des Sciences Appliquées, 31077, Toulouse, France.
Jazyk: angličtina
Zdroj: Journal of mathematical biology [J Math Biol] 2019 Jan; Vol. 78 (1-2), pp. 189-224. Date of Electronic Publication: 2018 Jul 20.
DOI: 10.1007/s00285-018-1272-4
Abstrakt: The increasing amount of genomic data currently available is expanding the horizons of population genetics inference. A wide range of methods have been published allowing to detect and date major changes in population size during the history of species. At the same time, there has been an increasing recognition that population structure can generate genetic data similar to those generated under models of population size change. Recently, Mazet et al. (Heredity 116(4):362-371, 2016) introduced the idea that, for any model of population structure, it is always possible to find a panmictic model with a particular function of population size-change having an identical distribution of [Formula: see text] (the time of the first coalescence for a sample of size two). This implies that there is an identifiability problem between a panmictic and a structured model when we base our analysis only on [Formula: see text]. In this paper, based on an analytical study of the rate matrix of the ancestral lineage process, we obtain new theoretical results about the joint distribution of the coalescence times [Formula: see text] for a sample of three haploid genes in a n-island model with constant size. Even if, for any [Formula: see text], it is always possible to find a size-change scenario for a panmictic population such that the marginal distribution of [Formula: see text] is exactly the same as in a n-island model with constant population size, we show that the joint distribution of the coalescence times [Formula: see text] for a sample of three genes contains enough information to distinguish between a panmictic population and a n-island model of constant size.
Databáze: MEDLINE