Structural genomics analysis of uncharacterized protein families overrepresented in human gut bacteria identifies a novel glycoside hydrolase
Autor: | Yuanyuan Chang, Christian M. Zmasek, Zhanwen Li, Adam Godzik, Anna Sheydina, Daniel J. Rigden, Herbert L. Axelrod, Ruth Y. Eberhardt |
---|---|
Rok vydání: | 2014 |
Předmět: |
Protein Structure
Protein family Glycoside Hydrolases Bioinformatics 1.1 Normal biological development and functioning Domain of unknown function DUF Computational biology Carbohydrate metabolism Biology 3D structure Biochemistry Mathematical Sciences Structural genomics 03 medical and health sciences 0302 clinical medicine Bacterial Proteins Structural Biology Underpinning research Information and Computing Sciences Hydrolase Genetics Humans Bacteroides Glycoside hydrolase Protein function prediction Amino Acid Sequence Molecular Biology Peptide sequence 030304 developmental biology 0303 health sciences Applied Mathematics Human Genome Computational Biology Genomics Biological Sciences Computer Science Applications Protein Structure Tertiary Gastrointestinal Tract Bacteroides thetaiotaomicron 030217 neurology & neurosurgery Tertiary Research Article Biotechnology |
Zdroj: | BMC bioinformatics, vol 15, iss 1 BMC Bioinformatics |
Popis: | Background Bacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism. Results BT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications. Conclusions Structural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively. |
Databáze: | OpenAIRE |
Externí odkaz: |