Minimum sample sizes for population genomics: an empirical study from an Amazonian plant species
Autor: | Alison G. Nazareno, Christopher W. Dick, Jordan B. Bemmels, Lúcia G. Lohmann |
---|---|
Rok vydání: | 2017 |
Předmět: |
0106 biological sciences
0301 basic medicine Genotyping Techniques Population Biology Polymorphism Single Nucleotide 010603 evolutionary biology 01 natural sciences Genome DNA sequencing Nucleotide diversity Population genomics 03 medical and health sciences Genetics education Ecology Evolution Behavior and Systematics education.field_of_study Genetic diversity Genetic Variation High-Throughput Nucleotide Sequencing 030104 developmental biology Sample size determination Genetic marker Evolutionary biology Sample Size Metagenomics Violaceae Biotechnology |
Zdroj: | Molecular Ecology Resources. 17:1136-1147 |
ISSN: | 1755-0998 1755-098X |
DOI: | 10.1111/1755-0998.12654 |
Popis: | High-throughput DNA sequencing facilitates the analysis of large portions of the genome in nonmodel organisms, ensuring high accuracy of population genetic parameters. However, empirical studies evaluating the appropriate sample size for these kinds of studies are still scarce. In this study, we use double-digest restriction-associated DNA sequencing (ddRADseq) to recover thousands of single nucleotide polymorphisms (SNPs) for two physically isolated populations of Amphirrhox longifolia (Violaceae), a nonmodel plant species for which no reference genome is available. We used resampling techniques to construct simulated populations with a random subset of individuals and SNPs to determine how many individuals and biallelic markers should be sampled for accurate estimates of intra- and interpopulation genetic diversity. We identified 3646 and 4900 polymorphic SNPs for the two populations of A. longifolia, respectively. Our simulations show that, overall, a sample size greater than eight individuals has little impact on estimates of genetic diversity within A. longifolia populations, when 1000 SNPs or higher are used. Our results also show that even at a very small sample size (i.e. two individuals), accurate estimates of FST can be obtained with a large number of SNPs (≥1500). These results highlight the potential of high-throughput genomic sequencing approaches to address questions related to evolutionary biology in nonmodel organisms. Furthermore, our findings also provide insights into the optimization of sampling strategies in the era of population genomics. |
Databáze: | OpenAIRE |
Externí odkaz: |