The CaspBase: a curated database for evolutionary biochemical studies of caspase functional divergence and ancestral sequence inference
Autor: | Robert D. Grinshpon, A. Clay Clark, Anna Williford, James Titus-McQuillan |
---|---|
Rok vydání: | 2018 |
Předmět: |
0301 basic medicine
Multiple sequence alignment Database biology Phylogenetic tree Sequence analysis Computer science Inference computer.software_genre Biochemistry 03 medical and health sciences 030104 developmental biology biology.protein Gene family Molecular Biology computer Caspase Functional divergence Sequence (medicine) |
Zdroj: | Protein Science. 27:1857-1870 |
ISSN: | 0961-8368 |
DOI: | 10.1002/pro.3494 |
Popis: | Sequence databases are powerful tools for the contemporary scientists' toolkit. However, most functional annotations in public databases are determined computationally and are not verified by a human expert. While hypotheses generated from computational studies are now amenable to experimentation, the quality of the results relies on the quality of input data. We developed the CaspBase to expedite high-quality dataset compilation of annotated caspase sequences, to maximize phylogenetic signal, and to reduce the noise contributed from public databanks. We describe our methods of curation for the CaspBase and how researchers can acquire sequences from CaspBase.org. Our immediate goal for developing the CaspBase was to optimize the ancestral protein reconstruction (APR) of caspases, and we demonstrate the utility of the CaspBase in APR studies. We also developed the Common Position (CP) system for comparing human caspase family paralogs and suggest the CP system as an update to current reporting methods of caspase amino acid positions. We present a standardized multiple sequence alignment (MSA) for the CP system and show the advantage of using large databases such as the CaspBase in defining structural positions in proteins. Although the results described here pertain to caspase evolution and structure-function studies, the methods can be adapted to any gene family. |
Databáze: | OpenAIRE |
Externí odkaz: |