Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies

Autor: Daria Timonina, Dmitry A. Suplatov, Vytas K. Švedas, Yana A. Sharapova
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Subfamily
Protein family
Protein superfamilies
SSR
Subfamily-Specific Regions

RMSD
Root Mean Square Deviation

Protein design
Biophysics
Protein Data Bank (RCSB PDB)
Computational biology
Biology
Biochemistry
SSP
Subfamily-Specific Position

03 medical and health sciences
(H)DBSCAN
(Hierarchical) Density-Based Spatial Clustering of Applications with Noise

0302 clinical medicine
Structural Biology
Specificity-determining positions
Machine learning
Genetics
Protein secondary structure
ComputingMethodologies_COMPUTERGRAPHICS
030304 developmental biology
0303 health sciences
Drug discovery
Structure-function relationship
Protein superfamily
Directed evolution
3D-structure analysis
Computer Science Applications
SDR/SDP
Specificity-Determining Residue/Position

030220 oncology & carcinogenesis
OPTICS
Ordering Points to Identify the Clustering Structure

TP248.13-248.65
Research Article
Biotechnology
Zdroj: Computational and Structural Biotechnology Journal, Vol 19, Iss, Pp 1302-1311 (2021)
Computational and Structural Biotechnology Journal
ISSN: 2001-0370
Popis: Graphical abstract
Local 3D-structural differences in homologous proteins contribute to functional diversity observed in a superfamily, but so far received little attention as bioinformatic analysis was usually carried out at the level of amino acid sequences. We have developed Zebra3D – the first-of-its-kind bioinformatic software for systematic analysis of 3D-alignments of protein families using machine learning. The new tool identifies subfamily-specific regions (SSRs) – patterns of local 3D-structure (i.e. single residues, loops, or secondary structure fragments) that are spatially equivalent within families/subfamilies, but are different among them, and thus can be associated with functional diversity and function-related conformational plasticity. Bioinformatic analysis of protein superfamilies by Zebra3D can be used to study 3D-determinants of catalytic activity and specific accommodation of ligands, help to prepare focused libraries for directed evolution or assist development of chimeric enzymes with novel properties by exchange of equivalent regions between homologs, and to characterize plasticity in binding sites. A companion Mustguseal web-server is available to automatically construct a 3D-alignment of functionally diverse proteins, thus reducing the minimal input required to operate Zebra3D to a single PDB code. The Zebra3D + Mustguseal combined approach provides the opportunity to systematically explore the value of SSRs in superfamilies and to use this information for protein design and drug discovery. The software is available open-access at https://biokinet.belozersky.msu.ru/Zebra3D.
Databáze: OpenAIRE