Validation of nuclear magnetic resonance structures of proteins and nucleic acids: hydrogen geometry and nomenclature.

Autor: Doreleijers JF; Bijvoet Center for Biomolecular Research, Utrecht University, The Netherlands., Vriend G, Raves ML, Kaptein R
Jazyk: angličtina
Zdroj: Proteins [Proteins] 1999 Nov 15; Vol. 37 (3), pp. 404-16.
DOI: 10.1002/(sici)1097-0134(19991115)37:3<404::aid-prot8>3.0.co;2-2
Abstrakt: A statistical analysis is reported of 1,200 of the 1,404 nuclear magnetic resonance (NMR)-derived protein and nucleic acid structures deposited in the Protein Data Bank (PDB) before 1999. Excluded from this analysis were the entries not yet fully validated by the PDB and the more than 100 entries that contained < 95% of the expected hydrogens. The aim was to assess the geometry of the hydrogens in the remaining structures and to provide a check on their nomenclature. Deviations in bond lengths, bond angles, improper dihedral angles, and planarity with respect to estimated values were checked. More than 100 entries showed anomalous protonation states for some of their amino acids. Approximately 250,000 (1.7%) atom names differed from the consensus PDB nomenclature. Most of the inconsistencies are due to swapped prochiral labeling. Large deviations from the expected geometry exist for a considerable number of entries, many of which are average structures. The most common causes for these deviations seem to be poor minimization of average structures and an improper balance between force-field constraints for experimental and holonomic data. Some specific geometric outliers are related to the refinement programs used. A number of recommendations for biomolecular databases, modeling programs, and authors submitting biomolecular structures are given.
Databáze: MEDLINE