pith. sign in

arxiv: 1302.7175 · v2 · pith:VRJ2HTZRnew · submitted 2013-02-28 · 📊 stat.ML · cs.AI· cs.LG· stat.ME

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

classification 📊 stat.ML cs.AIcs.LGstat.ME
keywords crossmaximumvalidationbiasvarianceaverageestimatorestimators
0
0 comments X
read the original abstract

We investigate the accuracy of the two most common estimators for the maximum expected value of a general set of random variables: a generalization of the maximum sample average, and cross validation. No unbiased estimator exists and we show that it is non-trivial to select a good estimator without knowledge about the distributions of the random variables. We investigate and bound the bias and variance of the aforementioned estimators and prove consistency. The variance of cross validation can be significantly reduced, but not without risking a large bias. The bias and variance of different variants of cross validation are shown to be very problem-dependent, and a wrong choice can lead to very inaccurate estimates.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.