Protein Sequence Annotation Tool (PSAT): a centralized web-based meta-server for high-throughput sequence annotations

Autor:	Elo Leung, Aldrin Montana, Carol L. Ecale Zhou, Jan Lorenz Soliman, Amy Huang, Eithon Cadag
Jazyk:	angličtina
Předmět:	0301 basic medicine Herbaspirillum Computer science Peptide Computational biology Genome Biochemistry 03 medical and health sciences Annotation Protein sequencing Sequence annotation Structural Biology Web application Gene Molecular Biology Sequence (medicine) chemistry.chemical_classification Internet 030102 biochemistry & molecular biology Computers business.industry Applied Mathematics Computational Biology High-Throughput Nucleotide Sequencing Molecular Sequence Annotation Genome project Computer Science Applications 030104 developmental biology Enzyme chemistry DNA microarray Water Microbiology business Software Genome Bacterial
Zdroj:	BMC Bioinformatics
ISSN:	1471-2105
DOI:	10.1186/s12859-016-0887-y
Popis:	Background Here we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. Results In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resulting functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Conclusions PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequence-based genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0887-y) contains supplementary material, which is available to authorized users.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0993b988e0e207b9b1617120f987c71d Zobrazit plný text záznamu Plný text ve formátu PDF