Popis: |
This study addresses the following fundamental question: Do sequences of protein domains with sandwich architecture have common sequence characteristics even though they belong to different superfamilies and folds? The analysis was carried out in two stages: determination of substructures in the domains that are common to all sandwich proteins; and detection of common sequence characteristics within the substructures. Analysis of supersecondary structures in domains of proteins revealed two types of four-strand substructures that are common to sandwich proteins. At least one of these common substructures was found in proteins of 42 sandwich-like folds (as per structural classification in the CATH database). Comparison of the sequence fragments corresponding to strands that make up the common substructures revealed specific rules of distribution of hydrophobic residues within these strands. These rules can be conceptualized as grammatical rules of beta protein linguistics. Understanding of the structural and sequence commonalities of sandwich proteins may also be useful for rational protein design. |