Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

Hado van Hasselt

arxiv: 1302.7175 · v2 · pith:VRJ2HTZRnew · submitted 2013-02-28 · 📊 stat.ML · cs.AI· cs.LG· stat.ME

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

Hado van Hasselt This is my paper

classification 📊 stat.ML cs.AIcs.LGstat.ME

keywords crossmaximumvalidationbiasvarianceaverageestimatorestimators

0 comments

read the original abstract

We investigate the accuracy of the two most common estimators for the maximum expected value of a general set of random variables: a generalization of the maximum sample average, and cross validation. No unbiased estimator exists and we show that it is non-trivial to select a good estimator without knowledge about the distributions of the random variables. We investigate and bound the bias and variance of the aforementioned estimators and prove consistency. The variance of cross validation can be significantly reduced, but not without risking a large bias. The bias and variance of different variants of cross validation are shown to be very problem-dependent, and a wrong choice can lead to very inaccurate estimates.

This paper has not been read by Pith yet.

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

discussion (0)