pith. sign in

arxiv: 1211.6198 · v3 · pith:3SGA6TGEnew · submitted 2012-11-27 · 🧬 q-bio.QM · q-bio.GN· stat.AP

Running PeptideProphet Separately on Replicates Improves Peptide Identification Results

classification 🧬 q-bio.QM q-bio.GNstat.AP
keywords peptidepeptideprophetreplicatesdatasetimproveresultsbaggingcoverage
0
0 comments X
read the original abstract

Limited spectrum coverage is a problem in shotgun proteomics. Replicates are generated to improve the spectrum coverage. When integrating peptide identification results obtained from replicates, the state-of-the-art algorithm PeptideProphet combines Peptide-Spectrum Matches (PSMs) before building the statistical model to calculate peptide probabilities. In this paper, we find the connection between merging results of replicates and Bagging, which is a standard routine to improve the power of statistical methods. Following Bagging's philosophy, we propose to run PeptideProphet separately on each replicate and combine the outputs to obtain the final peptide probabilities. In our experiments, we show that the proposed routine can improve PeptideProphet consistently on a standard protein dataset, a Human dataset and a Yeast dataset.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.