High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis.

Autor: Tiwary S; Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany., Levy R; Verily Life Sciences, South San Francisco, CA, USA., Gutenbrunner P; Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany., Salinas Soto F; Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany., Palaniappan KK; Verily Life Sciences, South San Francisco, CA, USA., Deming L; Google LLC, Mountain View, CA, USA., Berndl M; Google LLC, Mountain View, CA, USA., Brant A; Verily Life Sciences, South San Francisco, CA, USA., Cimermancic P; Verily Life Sciences, South San Francisco, CA, USA. cpeter@verily.com., Cox J; Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany. cox@biochem.mpg.de.; Department of Biological and Medical Psychology, University of Bergen, Bergen, Norway. cox@biochem.mpg.de.
Jazyk: angličtina
Zdroj: Nature methods [Nat Methods] 2019 Jun; Vol. 16 (6), pp. 519-525. Date of Electronic Publication: 2019 May 27.
DOI: 10.1038/s41592-019-0427-6
Abstrakt: Peptide fragmentation spectra are routinely predicted in the interpretation of mass-spectrometry-based proteomics data. However, the generation of fragment ions has not been understood well enough for scientists to estimate fragment ion intensities accurately. Here, we demonstrate that machine learning can predict peptide fragmentation patterns in mass spectrometers with accuracy within the uncertainty of measurement. Moreover, analysis of our models reveals that peptide fragmentation depends on long-range interactions within a peptide sequence. We illustrate the utility of our models by applying them to the analysis of both data-dependent and data-independent acquisition datasets. In the former case, we observe a q-value-dependent increase in the total number of peptide identifications. In the latter case, we confirm that the use of predicted tandem mass spectrometry spectra is nearly equivalent to the use of spectra from experimental libraries.
Databáze: MEDLINE