Dynamic correlations: exact and approximate methods for mutual information.

Autor: Demirtaş K; Department of Chemical Engineering, Bogazici University, 34342 Istanbul, Turkey.; Polymer Research Center, Bogazici University, 34342 Istanbul, Turkey., Erman B; Department of Chemical and Biological Engineering, Koc University, 34450 Istanbul, Turkey., Haliloğlu T; Department of Chemical Engineering, Bogazici University, 34342 Istanbul, Turkey.; Polymer Research Center, Bogazici University, 34342 Istanbul, Turkey.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2024 Feb 01; Vol. 40 (2).
DOI: 10.1093/bioinformatics/btae076
Abstrakt: Motivation: Proteins are dynamic entities that undergo conformational changes critical for their functions. Understanding the communication pathways and information transfer within proteins is crucial for elucidating allosteric interactions in their mechanisms. This study utilizes mutual information (MI) analysis to probe dynamic allostery. Using two cases, Ubiquitin and PLpro, we have evaluated the accuracy and limitations of different approximations including the exact anisotropic and isotropic models, multivariate Gaussian model, isotropic Gaussian model, and the Gaussian Network Model (GNM) in revealing allosteric interactions.
Results: Our findings emphasize the required trajectory length for capturing accurate mutual information profiles. Long molecular dynamics trajectories, 1 ms for Ubiquitin and 100 µs for PLpro are used as benchmarks, assuming they represent the ground truth. Trajectory lengths of approximately 5 µs for Ubiquitin and 1 µs for PLpro marked the onset of convergence, while the multivariate Gaussian model accurately captured mutual information with trajectories of 5 ns for Ubiquitin and 350 ns for PLpro. However, the isotropic Gaussian model is less successful in representing the anisotropic nature of protein dynamics, particularly in the case of PLpro, highlighting its limitations. The GNM, however, provides reasonable approximations of long-range information exchange as a minimalist network model based on a single crystal structure. Overall, the optimum trajectory lengths for effective Gaussian approximations of long-time dynamic behavior depend on the inherent dynamics within the protein's topology. The GNM, by showcasing dynamics across relatively diverse time scales, can be used either as a standalone method or to gauge the adequacy of MD simulation lengths.
Availability and Implementation: Mutual information codes are available at https://github.com/kemaldemirtas/prc-MI.git.
(© The Author(s) 2024. Published by Oxford University Press.)
Databáze: MEDLINE