Structural genomics analysis of uncharacterized protein families overrepresented in human gut bacteria identifies a novel glycoside hydrolase

Autor: Yuanyuan Chang, Christian M. Zmasek, Zhanwen Li, Adam Godzik, Anna Sheydina, Daniel J. Rigden, Herbert L. Axelrod, Ruth Y. Eberhardt
Rok vydání: 2014
Předmět:
Protein Structure
Protein family
Glycoside Hydrolases
Bioinformatics
1.1 Normal biological development and functioning
Domain of unknown function
DUF
Computational biology
Carbohydrate metabolism
Biology
3D structure
Biochemistry
Mathematical Sciences
Structural genomics
03 medical and health sciences
0302 clinical medicine
Bacterial Proteins
Structural Biology
Underpinning research
Information and Computing Sciences
Hydrolase
Genetics
Humans
Bacteroides
Glycoside hydrolase
Protein function prediction
Amino Acid Sequence
Molecular Biology
Peptide sequence
030304 developmental biology
0303 health sciences
Applied Mathematics
Human Genome
Computational Biology
Genomics
Biological Sciences
Computer Science Applications
Protein Structure
Tertiary

Gastrointestinal Tract
Bacteroides thetaiotaomicron
030217 neurology & neurosurgery
Tertiary
Research Article
Biotechnology
Zdroj: BMC bioinformatics, vol 15, iss 1
BMC Bioinformatics
Popis: Background Bacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism. Results BT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications. Conclusions Structural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively.
Databáze: OpenAIRE