Running PeptideProphet Separately on Replicates Improves Peptide Identification Results

Chao Yang; Weichuan Yu; Zengyou He

arxiv: 1211.6198 · v3 · pith:3SGA6TGEnew · submitted 2012-11-27 · 🧬 q-bio.QM · q-bio.GN· stat.AP

Running PeptideProphet Separately on Replicates Improves Peptide Identification Results

Chao Yang , Zengyou He , Weichuan Yu This is my paper

classification 🧬 q-bio.QM q-bio.GNstat.AP

keywords peptidepeptideprophetreplicatesdatasetimproveresultsbaggingcoverage

0 comments

read the original abstract

Limited spectrum coverage is a problem in shotgun proteomics. Replicates are generated to improve the spectrum coverage. When integrating peptide identification results obtained from replicates, the state-of-the-art algorithm PeptideProphet combines Peptide-Spectrum Matches (PSMs) before building the statistical model to calculate peptide probabilities. In this paper, we find the connection between merging results of replicates and Bagging, which is a standard routine to improve the power of statistical methods. Following Bagging's philosophy, we propose to run PeptideProphet separately on each replicate and combine the outputs to obtain the final peptide probabilities. In our experiments, we show that the proposed routine can improve PeptideProphet consistently on a standard protein dataset, a Human dataset and a Yeast dataset.

This paper has not been read by Pith yet.

Running PeptideProphet Separately on Replicates Improves Peptide Identification Results

discussion (0)