Machine-Learning Prospects for Detecting Selection Signatures Using Population Genomics Data.

Autor: Kumar H; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Panigrahi M; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Panwar A; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Rajawat D; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Nayak SS; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Saravanan KA; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Kaisa K; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Parida S; Divisions of Pharmacology and Toxicology, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Bhushan B; Divisions of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, India., Dutt T; Livestock Production and Management Section, ICAR-Indian Veterinary Research Institute, Izatnagar, India.
Jazyk: angličtina
Zdroj: Journal of computational biology : a journal of computational molecular cell biology [J Comput Biol] 2022 Sep; Vol. 29 (9), pp. 943-960. Date of Electronic Publication: 2022 May 30.
DOI: 10.1089/cmb.2021.0447
Abstrakt: Natural selection has been given a lot of attention because it relates to the adaptation of populations to their environments, both biotic and abiotic. An allele is selected when it is favored by natural selection. Consequently, the favored allele increases in frequency in the population and neighboring linked variation diminishes, causing so-called selective sweeps. A high-throughput genomic sequence allows one to disentangle the evolutionary forces at play in populations. With the development of high-throughput genome sequencing technologies, it has become easier to detect these selective sweeps/selection signatures. Various methods can be used to detect selective sweeps, from simple implementations using summary statistics to complex statistical approaches. One of the important problems of these statistical models is the potential to provide inaccurate results when their assumptions are violated. The use of machine learning (ML) in population genetics has been introduced as an alternative method of detecting selection by treating the problem of detecting selection signatures as a classification problem. Since the availability of population genomics data is increasing, researchers may incorporate ML into these statistical models to infer signatures of selection with higher predictive accuracy and better resolution. This article describes how ML can be used to aid in detecting and studying natural selection patterns using population genomic data.
Databáze: MEDLINE