EBprot: Statistical analysis of labeling-based quantitative proteomics data
Autor: | Hyungwon Choi, Hannah L. F. Swa, Jayantha Gunaratne, Damian Fermin, Siok Ghee Ler, Hiromi W. L. Koh |
---|---|
Rok vydání: | 2015 |
Předmět: |
Proteomics
Proteome Computer science Quantitative proteomics computer.software_genre Biochemistry Mass Spectrometry Mice Cell Line Tumor Animals Humans Computer Simulation Differential expression Shotgun proteomics Molecular Biology Software suite Epidermal Growth Factor Design of experiments Computational Biology Reproducibility of Results Models Theoretical HCT116 Cells Phosphoproteins Identifier Isotope Labeling Data mining Peptides computer Algorithms HeLa Cells |
Zdroj: | PROTEOMICS. 15:2580-2591 |
ISSN: | 1615-9853 |
Popis: | Labeling-based proteomics is a powerful method for detection of differentially expressed proteins (DEPs). The current data analysis platform typically relies on protein-level ratios, which is obtained by summarizing peptide-level ratios for each protein. In shotgun proteomics, however, some proteins are quantified with more peptides than others, and this reproducibility information is not incorporated into the differential expression (DE) analysis. Here, we propose a novel probabilistic framework EBprot that directly models the peptide-protein hierarchy and rewards the proteins with reproducible evidence of DE over multiple peptides. To evaluate its performance with known DE states, we conducted a simulation study to show that the peptide-level analysis of EBprot provides better receiver-operating characteristic and more accurate estimation of the false discovery rates than the methods based on protein-level ratios. We also demonstrate superior classification performance of peptide-level EBprot analysis in a spike-in dataset. To illustrate the wide applicability of EBprot in different experimental designs, we applied EBprot to a dataset for lung cancer subtype analysis with biological replicates and another dataset for time course phosphoproteome analysis of EGF-stimulated HeLa cells with multiplexed labeling. Through these examples, we show that the peptide-level analysis of EBprot is a robust alternative to the existing statistical methods for the DE analysis of labeling-based quantitative datasets. The software suite is freely available on the Sourceforge website http://ebprot.sourceforge.net/. All MS data have been deposited in the ProteomeXchange with identifier PXD001426 (http://proteomecentral.proteomexchange.org/dataset/PXD001426/). |
Databáze: | OpenAIRE |
Externí odkaz: |