OMA standalone: orthology inference among public and custom genomes and transcriptomes
Autor: | Adrian M. Altenhoff, Maximilian J. Telford, Bartlomiej Tomiczek, David Dylus, Natasha Glover, Steven Müller, Christophe Dessimoz, Daniel A. Dalquen, Magdalena Zarowiecki, Jeremy Levy, Alex Warwick Vesztrocy |
---|---|
Rok vydání: | 2019 |
Předmět: |
0106 biological sciences
Computer science Method Inference Context (language use) Computational biology Biology 010603 evolutionary biology 01 natural sciences Genome Transcriptome 03 medical and health sciences 0302 clinical medicine Phylogenetics Databases Genetic Genetics Gene family Animals Clade Gene Phylogeny Genetics (clinical) 030304 developmental biology 0303 health sciences Phylogenetic tree Invertebrates/classification Invertebrates/genetics Software Invertebrates Pipeline (software) Standalone program 030217 neurology & neurosurgery |
Zdroj: | Genome research, vol. 29, no. 7, pp. 1152-1163 Genome Research, 29 (7) |
ISSN: | 1549-5469 1088-9051 |
Popis: | Genomes and transcriptomes are now typically sequenced by individual laboratories but analyzing them often remains challenging. One essential step in many analyses lies in identifying orthologs—corresponding genes across multiple species—but this is far from trivial. The Orthologous MAtrix (OMA) database is a leading resource for identifying orthologs among publicly available, complete genomes. Here, we describe the OMA pipeline available as a standalone program for Linux and Mac. When run on a cluster, it has native support for the LSF, SGE, PBS Pro, and Slurm job schedulers and can scale up to thousands of parallel processes. Another key feature of OMA standalone is that users can combine their own data with existing public data by exporting genomes and precomputed alignments from the OMA database, which currently contains over 2100 complete genomes. We compare OMA standalone to other methods in the context of phylogenetic tree inference, by inferring a phylogeny of Lophotrochozoa, a challenging clade within the protostomes. We also discuss other potential applications of OMA standalone, including identifying gene families having undergone duplications/losses in specific clades, and identifying potential drug targets in nonmodel organisms. OMA standalone is available under the permissive open source Mozilla Public License Version 2.0. Genome Research, 29 (7) ISSN:1088-9051 ISSN:1549-5469 |
Databáze: | OpenAIRE |
Externí odkaz: |