Molecular replacement using structure predictions from databases
Autor: | Daniel J. Rigden, Felix Simkovic, Ronan M. Keegan, Adam J. Simpkin, Jens M. H. Thomas |
---|---|
Rok vydání: | 2019 |
Předmět: |
Models
Molecular databases Protein Conformation Computer science Pipeline (computing) Ab initio ab initio structure predictions Scale (descriptive set theory) Phase problem Crystallography X-Ray computer.software_genre 03 medical and health sciences Structural Biology Molecular replacement Databases Protein 030304 developmental biology Structure (mathematical logic) 0303 health sciences Database 030302 biochemistry & molecular biology Proteins molecular replacement 3. Good health Range (mathematics) Gremlin (programming language) ab initio modelling Ccp4 computer Algorithms Software |
Zdroj: | ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY Acta Crystallographica. Section D, Structural Biology |
ISSN: | 2059-7983 |
Popis: | Predicted ab initio protein models from online databases are a useful supplement to the PDB for molecular replacement, but usually require nontrivial processing to succeed. Molecular replacement (MR) is the predominant route to solution of the phase problem in macromolecular crystallography. Where the lack of a suitable homologue precludes conventional MR, one option is to predict the target structure using bioinformatics. Such modelling, in the absence of homologous templates, is called ab initio or de novo modelling. Recently, the accuracy of such models has improved significantly as a result of the availability, in many cases, of residue-contact predictions derived from evolutionary covariance analysis. Covariance-assisted ab initio models representing structurally uncharacterized Pfam families are now available on a large scale in databases, potentially representing a valuable and easily accessible supplement to the PDB as a source of search models. Here, the unconventional MR pipeline AMPLE is employed to explore the value of structure predictions in the GREMLIN and PconsFam databases. It was tested whether these deposited predictions, processed in various ways, could solve the structures of PDB entries that were subsequently deposited. The results were encouraging: nine of 27 GREMLIN cases were solved, covering target lengths of 109–355 residues and a resolution range of 1.4–2.9 Å, and with target–model shared sequence identity as low as 20%. The cluster-and-truncate approach in AMPLE proved to be essential for most successes. For the overall lower quality structure predictions in the PconsFam database, remodelling with Rosetta within the AMPLE pipeline proved to be the best approach, generating ensemble search models from single-structure deposits. Finally, it is shown that the AMPLE-obtained search models deriving from GREMLIN deposits are of sufficiently high quality to be selected by the sequence-independent MR pipeline SIMBAD. Overall, the results help to point the way towards the optimal use of the expanding databases of ab initio structure predictions. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |