Autor: |
Gregory A. Babbitt, Madhusudan Rajendran, Miranda L. Lynch, Richmond Asare-Bediako, Leora T. Mouli, Cameron J. Ryan, Harsh Srivastava, Kavya Phadke, Makayla L. Reed, Nadia Moore, Maureen C. Ferran, Ernest P. Fokoue |
Rok vydání: |
2023 |
Popis: |
Comparative methods in molecular biology and evolution rely almost entirely upon the analysis of DNA sequence and protein structure, both static forms of information. However, it is widely accepted that protein function results from dynamic machine-like motions induced during molecular interactions, a type of data for which comparative methods of analysis are challenged by the large fraction of protein motion created by random thermal noise induced by the surrounding solvent. Here, we introduce ATOMDANCE, a suite of statistical and kernel-based machine learning tools designed for identifying and comparing functional states of proteins captured in noisy molecular dynamics simulations. ATOMDANCE employs interpretable Gaussian kernel functions to compute site-wise maximum mean discrepancy (MMD) between learned features representing functional protein states (e.g. bound vs. unbound, wild-type vs. mutant). ATOMDANCE derives empirical p-values identifying functional similarities/differences in dynamics at specific sites on the protein. Depending upon mode of protein function, the machine learner can be trained upon either local atom fluctuations or reduced atom correlation matrices. Heat maps identifying amino acid sites with coordinated states of dynamics can also be produced as well. ATOMDANCE also employs MMD to contextually analyze all possible random amino-acid replacements thus allowing for a site-wise test of neutral vs. non-neutral evolution in the divergence of dynamic function in protein homologs. ATOMDANCE offers a user-friendly interface and requires as input only the structure, topolopy and trajectory files for each of the two proteins being compared. A separate interface for generating molecular dynamics simulations via open-source tools is offered as well. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|