Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies
Autor: | Daria Timonina, Dmitry A. Suplatov, Vytas K. Švedas, Yana A. Sharapova |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Subfamily
Protein family Protein superfamilies SSR Subfamily-Specific Regions RMSD Root Mean Square Deviation Protein design Biophysics Protein Data Bank (RCSB PDB) Computational biology Biology Biochemistry SSP Subfamily-Specific Position 03 medical and health sciences (H)DBSCAN (Hierarchical) Density-Based Spatial Clustering of Applications with Noise 0302 clinical medicine Structural Biology Specificity-determining positions Machine learning Genetics Protein secondary structure ComputingMethodologies_COMPUTERGRAPHICS 030304 developmental biology 0303 health sciences Drug discovery Structure-function relationship Protein superfamily Directed evolution 3D-structure analysis Computer Science Applications SDR/SDP Specificity-Determining Residue/Position 030220 oncology & carcinogenesis OPTICS Ordering Points to Identify the Clustering Structure TP248.13-248.65 Research Article Biotechnology |
Zdroj: | Computational and Structural Biotechnology Journal, Vol 19, Iss, Pp 1302-1311 (2021) Computational and Structural Biotechnology Journal |
ISSN: | 2001-0370 |
Popis: | Graphical abstract Local 3D-structural differences in homologous proteins contribute to functional diversity observed in a superfamily, but so far received little attention as bioinformatic analysis was usually carried out at the level of amino acid sequences. We have developed Zebra3D – the first-of-its-kind bioinformatic software for systematic analysis of 3D-alignments of protein families using machine learning. The new tool identifies subfamily-specific regions (SSRs) – patterns of local 3D-structure (i.e. single residues, loops, or secondary structure fragments) that are spatially equivalent within families/subfamilies, but are different among them, and thus can be associated with functional diversity and function-related conformational plasticity. Bioinformatic analysis of protein superfamilies by Zebra3D can be used to study 3D-determinants of catalytic activity and specific accommodation of ligands, help to prepare focused libraries for directed evolution or assist development of chimeric enzymes with novel properties by exchange of equivalent regions between homologs, and to characterize plasticity in binding sites. A companion Mustguseal web-server is available to automatically construct a 3D-alignment of functionally diverse proteins, thus reducing the minimal input required to operate Zebra3D to a single PDB code. The Zebra3D + Mustguseal combined approach provides the opportunity to systematically explore the value of SSRs in superfamilies and to use this information for protein design and drug discovery. The software is available open-access at https://biokinet.belozersky.msu.ru/Zebra3D. |
Databáze: | OpenAIRE |
Externí odkaz: |