Empirical margin distributions and bounding the generalization error of combined classifiers

Dmitry Panchenko; Vladimir Koltchinskii

arxiv: math/0405343 · v1 · submitted 2004-05-18 · 🧮 math.PR

Empirical margin distributions and bounding the generalization error of combined classifiers

Vladimir Koltchinskii , Dmitry Panchenko This is my paper

classification 🧮 math.PR

keywords classifiersempiricalerrorgeneralizationmarginboundingdistributionbartlett

0 comments

read the original abstract

We prove new probabilistic upper bounds on generalization error of complex classifiers that are combinations of simple classifiers. Such combinations could be implemented by neural networks or by voting methods of combining the classifiers, such as boosting and bagging. The bounds are in terms of the empirical distribution of the margin of the combined classifier. They are based on the methods of the theory of Gaussian and empirical processes (comparison inequalities, symmetrization method, concentration inequalities) and they improve previous results of Bartlett (1998) on bounding the generalization error of neural networks in terms of l_1-norms of the weights of neurons and of Schapire, Freund, Bartlett and Lee (1998) on bounding the generalization error of boosting. We also obtain rates of convergence in Levy distance of empirical margin distribution to the true margin distribution uniformly over the classes of classifiers and prove the optimality of these rates.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Functional-Class Meta-Analytic Framework for Quantifying Surrogate Resilience
stat.ME 2026-04 unverdicted novelty 6.0

A meta-analytic framework estimates the resilience probability of a surrogate marker to the surrogate paradox in a new study by modeling deviations from functional relationships observed in completed trials.