Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

Alexandre Lacasse; Fran\c{c}ois Laviolette; Jean-Francis Roy; Mario Marchand; Pascal Germain

arxiv: 1503.08329 · v2 · pith:JOXG7JTUnew · submitted 2015-03-28 · 📊 stat.ML · cs.LG

Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

Pascal Germain , Alexandre Lacasse , Fran\c{c}ois Laviolette , Mario Marchand , Jean-Francis Roy This is my paper

classification 📊 stat.ML cs.LG

keywords analysispac-bayesianboundsc-boundextensivelearningmajoritymincq

0 comments

read the original abstract

We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in the training data. The analysis intends to be self-contained and can be used as introductory material to PAC-Bayesian statistical learning theory. It starts from a general PAC-Bayesian perspective and ends with uncommon PAC-Bayesian bounds. Some of these bounds contain no Kullback-Leibler divergence and others allow kernel functions to be used as voters (via the sample compression setting). Finally, out of the analysis, we propose the MinCq learning algorithm that basically minimizes the C-bound. MinCq reduces to a simple quadratic program. Aside from being theoretically grounded, MinCq achieves state-of-the-art performance, as shown in our extensive empirical comparison with both AdaBoost and the Support Vector Machine.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Concentration and Calibration in Predictive Bayesian Inference
stat.ME 2026-05 unverdicted novelty 6.0

Predictive Bayesian inference posteriors concentrate onto a forward-model-dependent quantity and produce miscalibrated credible sets unless the predictive model contains the true data-generating process.
Margin-Adaptive Confidence Ranking for Reliable LLM Judgement
cs.LG 2026-05 unverdicted novelty 5.0

Introduces a margin-adaptive confidence ranking method that learns an estimator from simulated diversity and derives margin-dependent generalization bounds for use in fixed-sequence testing of LLM-human agreement.