Estimating admixture pedigrees of recent hybrids without a contiguous reference genome.

Autor: Garcia-Erill G; Department of Biology, University of Copenhagen, Copenhagen, Denmark., Hanghøj K; Department of Biology, University of Copenhagen, Copenhagen, Denmark., Heller R; Department of Biology, University of Copenhagen, Copenhagen, Denmark., Wiuf C; Department of Mathematical Sciences, University of Copenhagen, Copenhagen, Denmark., Albrechtsen A; Department of Biology, University of Copenhagen, Copenhagen, Denmark.
Jazyk: angličtina
Zdroj: Molecular ecology resources [Mol Ecol Resour] 2023 Oct; Vol. 23 (7), pp. 1604-1619. Date of Electronic Publication: 2023 Jul 03.
DOI: 10.1111/1755-0998.13830
Abstrakt: The genome of recently admixed individuals or hybrids has characteristic genetic patterns that can be used to learn about their recent admixture history. One of these are patterns of interancestry heterozygosity, which can be inferred from SNP data from either called genotypes or genotype likelihoods, without the need for information on genomic location. This makes them applicable to a wide range of data that are often used in evolutionary and conservation genomic studies, such as low-depth sequencing mapped to scaffolds and reduced representation sequencing. Here we implement maximum likelihood estimation of interancestry heterozygosity patterns using two complementary models. We furthermore develop apoh (Admixture Pedigrees of Hybrids), a software that uses estimates of paired ancestry proportions to detect recently admixed individuals or hybrids, and to suggest possible admixture pedigrees. It furthermore calculates several hybrid indices that make it easier to identify and rank possible admixture pedigrees that could give rise to the estimated patterns. We implemented apoh both as a command line tool and as a Graphical User Interface that allows the user to automatically and interactively explore, rank and visualize compatible recent admixture pedigrees, and calculate the different summary indices. We validate the performance of the method using admixed family trios from the 1000 Genomes Project. In addition, we show its applicability on identifying recent hybrids from RAD-seq data of Grant's gazelle (Nanger granti and Nanger petersii) and whole genome low-depth data of waterbuck (Kobus ellipsiprymnus) which shows complex admixture of up to four populations.
(© 2023 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.)
Databáze: MEDLINE