Robust automated backbone triple resonance NMR assignments of proteins using Bayesian-based simulated annealing.

Autor: Bishop AC; Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX, 77843, USA., Torres-Montalvo G; Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX, 77843, USA., Kotaru S; Graduate Group in Biochemistry & Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19014, USA., Mimun K; Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX, 77843, USA., Wand AJ; Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX, 77843, USA. josh.wand@ag.tamu.edu.; Graduate Group in Biochemistry & Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19014, USA. josh.wand@ag.tamu.edu.; Department of Chemistry, Texas A&M University, College Station, TX, 77843, USA. josh.wand@ag.tamu.edu.; Department of Molecular & Cellular Medicine, Texas A&M University, College Station, TX, 77843, USA. josh.wand@ag.tamu.edu.
Jazyk: angličtina
Zdroj: Nature communications [Nat Commun] 2023 Mar 21; Vol. 14 (1), pp. 1556. Date of Electronic Publication: 2023 Mar 21.
DOI: 10.1038/s41467-023-37219-z
Abstrakt: Assignment of resonances of nuclear magnetic resonance (NMR) spectra to specific atoms within a protein remains a labor-intensive and challenging task. Automation of the assignment process often remains a bottleneck in the exploitation of solution NMR spectroscopy for the study of protein structure-dynamics-function relationships. We present an approach to the assignment of backbone triple resonance spectra of proteins. A Bayesian statistical analysis of predicted and observed chemical shifts is used in conjunction with inter-spin connectivities provided by triple resonance spectroscopy to calculate a pseudo-energy potential that drives a simulated annealing search for the most optimal set of resonance assignments. Termed Bayesian Assisted Assignments by Simulated Annealing (BARASA), a C++ program implementation is tested against systems ranging in size to over 450 amino acids including examples of intrinsically disordered proteins. BARASA is fast, robust, accommodates incomplete and incorrect information, and outperforms current algorithms - especially in cases of sparse data and is sufficiently fast to allow for real-time evaluation during data acquisition.
(© 2023. The Author(s).)
Databáze: MEDLINE