An Atomistic Statistically Effective Energy Function for Computational Protein Design
Autor: | Isabelle André, Christopher M. Topham, Sophie Barbe |
---|---|
Přispěvatelé: | Laboratoire d'Ingénierie des Systèmes Biologiques et des Procédés (LISBP), Centre National de la Recherche Scientifique (CNRS)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de la Recherche Agronomique (INRA), We gratefully acknowledge access granted to the HPC resources of the Midi-Pyrenees Regional Computing Centre (CALMIP, Toulouse, France)., ANR-12-MONU-0015,ProtiCAD,Modèles multi-physiques et algorithmes robotiques pour la conception assistée par ordinateur de protéines(2012), French National Research Agency (ANR Project PROTICAD) [ANR-12-MONU-0015-03] Funding Text : This work was supported by the French National Research Agency (ANR Project PROTICAD, ANR-12-MONU-0015-03). We gratefully acknowledge access granted to the HPC resources of the Midi-Pyrenees Regional Computing Centre (CALMIP, Toulouse, France)., Institut National de la Recherche Agronomique (INRA)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS) |
Rok vydání: | 2016 |
Předmět: |
0301 basic medicine
Protein Folding Protein Conformation Globular protein [SDV]Life Sciences [q-bio] Protein domain Protein design Ligands globular-proteins 010402 general chemistry Polymorphism Single Nucleotide 01 natural sciences Accessible surface area Viral Proteins 03 medical and health sciences Superoxide Dismutase-1 Protein structure Atom scoring function data-bank Bacteriophage T4 Humans Statistical physics Physical and Theoretical Chemistry Databases Protein knowledge-based potentials Protein Unfolding ligand-binding affinities chemistry.chemical_classification Quantitative Biology::Biomolecules reference state improves Function (mathematics) structure prediction 0104 chemical sciences Computer Science Applications 030104 developmental biology chemistry Mutagenesis mean force Thermodynamics Muramidase accessible surface-area Protein folding Atomic physics quasi-chemical approximation |
Zdroj: | Journal of Chemical Theory and Computation Journal of Chemical Theory and Computation, American Chemical Society, 2016, 12 (8), pp.4146-4168. ⟨10.1021/acs.jctc.6b00090⟩ Journal of Chemical Theory and Computation, 2016, 12 (8), pp.4146-4168. ⟨10.1021/acs.jctc.6b00090⟩ |
ISSN: | 1549-9626 1549-9618 |
Popis: | Shortcomings in the definition of effective free-energy surfaces of proteins are recognized to be a major contributory factor responsible for the low success rates of existing automated methods for computational protein design (CPD). The formulation of an atomistic statistically effective energy function (SEEF) suitable for a wide range of CPD applications and its derivation from structural data extracted from protein domains and protein-ligand complexes are described here. The proposed energy function comprises nonlocal atom-based and local residue based SEEFs, which are coupled using a novel atom connectivity number factor to scale short-range, pairwise, nonbonded atomic interaction energies and a surface-area-dependent cavity energy term. This energy function was used to derive additional SEEFs describing the unfolded-state ensemble of any given residue sequence based on computed average energies for partially or fully solvent-exposed fragments in regions of irregular structure in native proteins. Relative thermal stabilities of 97 T4 bacteriophage lysozyme mutants were predicted from calculated energy differences for folded and unfolded states with an average unsigned error (AUE) of 0.84 kcal mol(-1) when compared to experiment. To demonstrate the utility of the energy function for CPD, further validation was carried out in tests of its capacity to recover cognate protein sequences and to discriminate native and near-native protein folds, loop conformers, and small-molecule ligand binding poses from non-native benchmark decoys. Experimental ligand binding free energies for a diverse set of 80 protein complexes could be predicted with an AUE of 2.4 kcal mol(-1) using an additional energy term to account for the loss in ligand configurational entropy upon binding. The atomistic SEEF is expected to improve the accuracy of residue-based coarse-grained SEEFs currently used in CPD and to extend the range of applications of extant atom-based protein statistical potentials. |
Databáze: | OpenAIRE |
Externí odkaz: |