Strategies for choosing core animals in the algorithm for proven and young and their impact on the accuracy of single-step genomic predictions in cattle

Autor:	A. Cesarani, M. Bermann, C. Dimauro, L. Degano, D. Vicario, D. Lourenco, N.P.P. Macciotta
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	Genomic selection Key individuals Prediction accuracy Principal component analysis Relationship matrix Animal culture SF1-1100
Zdroj:	Animal, Vol 17, Iss 4, Pp 100766- (2023)
Druh dokumentu:	article
ISSN:	1751-7311
DOI:	10.1016/j.animal.2023.100766
Popis:	Nowadays, in some populations, the number of genotyped animals is too large to obtain the inverse of the genomic relationship matrix. The algorithm for proven and young animals (APY) can be used to overcome this problem. In the present work, different strategies for defining core animals in APY were tested using either simulated or real data. In particular, core definitions based on random choice or on the contribution to the genomic relationship matrix (GCONTR) calculated using Principal Component Analysis were tested. Core sizes able to explain 90, 95, 98, and 99% of the total variance of the genomic relationship matrix (G) were used. Analyzed phenotypes were three simulated traits for 3 000 individuals, and milkability records for 136 406 Italian Simmental cows. The number of genotypes was 4 100 for the simulated dataset, and 11 636 for the Simmental data, respectively. The GCONTR values in Simmental dataset were moderately correlated with the analyzed phenotype, and they showed a decreasing trend according to the year of birth of genotyped animals. The accuracy increased as the size of the core increased in both datasets. The inclusion in the core of animals with largest GCONTR values led to the lowest accuracies (0.50 and 0.71 for the simulated and Simmental datasets, respectively; average across traits and core sizes). On the contrary, the selection of animals with the lowest rank according to their contribution to the G provided slightly higher accuracies, especially in the simulated dataset (0.68 for the simulated dataset, and 0.76 for the Simmental data; average across traits and core sizes). In real data, particularly for larger sizes of core animals, the criteria of choice appear less important, confirming the results of earlier studies. Anyway, the inclusion in the core of animals with the lowest values of GCONTR led to increases in accuracy. These are preliminary results based on a small sample size that need to be confirmed on a larger number of genotypes.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/116a3d56cd53431fad3f4ea3284fff75 Zobrazit plný text záznamu View record in DOAJ