Population structure and genetic diversity of the Pee Dee cotton breeding program.

Autor: Billings GT; Clemson University, Pee Dee Research and Education Center, Florence, SC 29501, USA.; North Carolina State University, Crop Science Department, Raleigh, NC 27695, USA., Jones MA; Clemson University, Pee Dee Research and Education Center, Florence, SC 29501, USA., Rustgi S; Clemson University, Pee Dee Research and Education Center, Florence, SC 29501, USA., Hulse-Kemp AM; North Carolina State University, Crop Science Department, Raleigh, NC 27695, USA.; USDA-ARS, Genomics and Bioinformatics Research Unit, Raleigh, NC 27695, USA., Campbell BT; USDA-ARS, Coastal Plains, Soil, Water, and Plant Research Center, Florence, SC 29501, USA.
Jazyk: angličtina
Zdroj: G3 (Bethesda, Md.) [G3 (Bethesda)] 2021 Jul 14; Vol. 11 (7).
DOI: 10.1093/g3journal/jkab145
Abstrakt: Accelerated marker-assisted selection and genomic selection breeding systems require genotyping data to select the best parents for combining beneficial traits. Since 1935, the Pee Dee (PD) cotton germplasm enhancement program has developed an important genetic resource for upland cotton (Gossypium hirsutum L.), contributing alleles for improved fiber quality, agronomic performance, and genetic diversity. To date, a detailed genetic survey of the program's eight historical breeding cycles has yet to be undertaken. The objectives of this study were to evaluate genetic diversity across and within-breeding groups, examine population structure, and contextualize these findings relative to the global upland cotton gene pool. The CottonSNP63K array was used to identify 17,441 polymorphic markers in a panel of 114 diverse PD genotypes. A subset of 4597 markers was selected to decrease marker density bias. Identity-by-state pairwise distance varied substantially, ranging from 0.55 to 0.97. Pedigree-based estimates of relatedness were not very predictive of observed genetic similarities. Few rare alleles were present, with 99.1% of SNP alleles appearing within the first four breeding cycles. Population structure analysis with principal component analysis, discriminant analysis of principal components, fastSTRUCTURE, and a phylogenetic approach revealed an admixed population with moderate substructure. A small core collection (n < 20) captured 99% of the program's allelic diversity. Allele frequency analysis indicated potential selection signatures associated with stress resistance and fiber cell growth. The results of this study will steer future utilization of the program's germplasm resources and aid in combining program-specific beneficial alleles and maintaining genetic diversity.
(Published by Oxford University Press on behalf of Genetics Society of America 2021. This work is written by US Government employees and is in the public domain in the US.)
Databáze: MEDLINE