Sequence similarity-driven proteomics in organisms with unknown genomes by LC-MS/MS and automatedde novo sequencing

Autor: Shamil R. Sunyaev, Ari Frank, Vineeth Surendranath, Andandrej Shevchenko, Henrik Thomas, Patrice Waridel, Pavel A. Pevzner
Rok vydání: 2007
Předmět:
Zdroj: PROTEOMICS. 7:2318-2329
ISSN: 1615-9861
1615-9853
DOI: 10.1002/pmic.200700003
Popis: LC-MS/MS analysis on a linear ion trap LTQ mass spectrometer, combined with data processing, stringent, and sequence-similarity database searching tools, was employed in a layered manner to identify proteins in organisms with unsequenced genomes. Highly specific stringent searches (MASCOT) were applied as a first layer screen to identify either known (i.e. present in a database) proteins, or unknown proteins sharing identical peptides with related database sequences. Once the confidently matched spectra were removed, the remainder was filtered against a nonannotated library of background spectra that cleaned up the dataset from spectra of common protein and chemical contaminants. The rectified spectral dataset was further subjected to rapid batch de novo interpretation by PepNovo software, followed by the MS BLAST sequence-similarity search that used multiple redundant and partially accurate candidate peptide sequences. Importantly, a single dataset was acquired at the uncompromised sensitivity with no need of manual selection of MS/MS spectra for subsequent de novo interpretation. This approach enabled a completely automated identification of novel proteins that were, otherwise, missed by conventional database searches.
Databáze: OpenAIRE