NGSANE: a lightweight production informatics framework for high-throughput data analysis.
Autor: | Buske FA; Cancer Epigenetics Program, Cancer Research Division, Kinghorn Cancer Centre, Garvan Institute of Medical Research, RNA Biology and Plasticity Laboratory, Garvan Institute of Medical Research, St Vincent's Clinical School, University of NSW, Sydney 2010, Australia and Division of Computational Informatics, CSIRO, Sydney 2113, Australia., French HJ, Smith MA, Clark SJ, Bauer DC |
---|---|
Jazyk: | angličtina |
Zdroj: | Bioinformatics (Oxford, England) [Bioinformatics] 2014 May 15; Vol. 30 (10), pp. 1471-2. Date of Electronic Publication: 2014 Jan 26. |
DOI: | 10.1093/bioinformatics/btu036 |
Abstrakt: | Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way of software 'pipelines'. However, individual components depreciate rapidly because of the evolving technology and analysis methods, often rendering entire versions of production informatics pipelines obsolete. Constructing pipelines from Linux bash commands enables the use of hot swappable modular components as opposed to the more rigid program call wrapping by higher level languages, as implemented in comparable published pipelining systems. Here we present Next Generation Sequencing ANalysis for Enterprises (NGSANE), a Linux-based, high-performance-computing-enabled framework that minimizes overhead for set up and processing of new projects, yet maintains full flexibility of custom scripting when processing raw sequence data. Availability and Implementation: Ngsane is implemented in bash and publicly available under BSD (3-Clause) licence via GitHub at https://github.com/BauerLab/ngsane. Contact: Denis.Bauer@csiro.au Supplementary Information: Supplementary data are available at Bioinformatics online. |
Databáze: | MEDLINE |
Externí odkaz: |