SimPrily: A Python framework to simplify high-throughput genomic simulations

Autor: Ariella L. Gladstein, Consuelo D. Quinto-Cortés, Julian L. Pistorius, David Christy, Logan Gantner, Blake L. Joyce
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Zdroj: SoftwareX, Vol 7, Iss , Pp 335-340 (2018)
Druh dokumentu: article
ISSN: 2352-7110
DOI: 10.1016/j.softx.2018.09.003
Popis: Genomic simulations are an important technique used in population genetics to infer demographic history, test for regions under selection, and create datasets to validate software. However, running thousands of simulations and manipulating large loci can present computational challenges. We present SimPrily, a Python tool optimized for high throughput computing (HTC), which facilitates simulation of whole chromosomes. SimPrily can use prior distributions of parameters to run simulations, incorporate single nucleotide polymorphism array ascertainment bias into the simulated model, and calculate a variety of genomic summary statistics. We include with SimPrily high-throughput workflows that leverage free computing resources through the Open Science Grid and CyVerse Discovery Environment, allowing researchers to run thousands or millions of large-locus simulations with minimal or no prior command line knowledge. Keywords: Genomics, Coalescent simulation, High-throughput computing, Demographic history
Databáze: Directory of Open Access Journals