A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor
Autor: | D. J. Brooks, Mona Singh, Jacques R. Fresco |
---|---|
Rok vydání: | 2004 |
Předmět: |
Statistics and Probability
Molecular Sequence Data Biology Biochemistry Protein evolution Evolution Molecular Extant taxon Sequence Analysis Protein Animals Humans Amino Acid Sequence Molecular Biology Conserved Sequence Phylogeny chemistry.chemical_classification Likelihood Functions Models Statistical Models Genetic Thermophile Last universal ancestor Proteins Biological Evolution Computer Science Applications Amino acid Computational Mathematics Genetics Population Amino Acid Substitution Computational Theory and Mathematics Amino acid composition chemistry Genetic Code Evolutionary biology Simulated data Sequence Alignment Algorithms |
Zdroj: | Bioinformatics. 20:2251-2257 |
ISSN: | 1367-4811 1367-4803 |
DOI: | 10.1093/bioinformatics/bth235 |
Popis: | Motivation: Knowledge of how proteomic amino acid composition has changed over time is important for constructing realistic models of protein evolution and increasing our understanding of molecular evolutionary history. The proteomic amino acid composition of the Last Universal Ancestor (LUA) of life is of particular interest, since that might provide insight into the early evolution of proteins and the nature of the LUA itself. Results: We introduce a method to estimate ancestral amino acid composition that is based on expectation–maximization. On simulated data, the approach was found to be very effective in estimating ancestral amino acid composition, with accuracy improving as the number of residues in the dataset was increased. The method was then used to infer the amino acid composition of a set of proteins in the LUA. In general, as compared with the modern protein set, LUA proteins were found to be richer in amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those believed to have been unavailable or scarce. Additionally, we found the inferred amino acid composition of this protein set in the LUA to be more similar to the observed composition of the same set in extant thermophilic species than in extant mesophilic species, supporting the idea that the LUA lived in a thermophilic environment. Availability: The program is available at http://compbio.cs.princeton.edu/ancestralaa |
Databáze: | OpenAIRE |
Externí odkaz: |