Protein NMR assignment by isotope pattern recognition.

Autor: Rasulov U; School of Chemistry, University of Southampton, University Road, Southampton SO17 1BJ, UK., Wang HK; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Viennet T; Department of Chemistry and iNANO, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark., Droemer MA; Faculty for Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany., Matosin S; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Schindler S; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Sun ZJ; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Mureddu L; Department of Molecular and Cell Biology, Institute for Structural and Chemical Biology, University of Leicester, Lancaster Road, Leicester LE1 7HB, UK., Vuister GW; Department of Molecular and Cell Biology, Institute for Structural and Chemical Biology, University of Leicester, Lancaster Road, Leicester LE1 7HB, UK., Robson SA; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Arthanari H; Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.; Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA., Kuprov I; School of Chemistry, University of Southampton, University Road, Southampton SO17 1BJ, UK.
Jazyk: angličtina
Zdroj: Science advances [Sci Adv] 2024 Sep 06; Vol. 10 (36), pp. eado0403. Date of Electronic Publication: 2024 Sep 04.
DOI: 10.1126/sciadv.ado0403
Abstrakt: The current standard method for amino acid signal identification in protein NMR spectra is sequential assignment using triple-resonance experiments. Good software and elaborate heuristics exist, but the process remains laboriously manual. Machine learning does help, but its training databases need millions of samples that cover all relevant physics and every kind of instrumental artifact. In this communication, we offer a solution to this problem. We propose polyadic decompositions to store millions of simulated three-dimensional NMR spectra, on-the-fly generation of artifacts during training, a probabilistic way to incorporate prior and posterior information, and integration with the industry standard CcpNmr software framework. The resulting neural nets take [ 1 H, 13 C] slices of mixed pyruvate-labeled HNCA spectra (different CA signal shapes for different residue types) and return an amino acid probability table. In combination with primary sequence information, backbones of common proteins (GB1, MBP, and INMT) are rapidly assigned from just the HNCA spectrum.
Databáze: MEDLINE