pith. sign in

arxiv: 1703.03167 · v1 · pith:NLI55FSOnew · submitted 2017-03-09 · 🧮 math.ST · stat.ML· stat.TH

Cross-validation

classification 🧮 math.ST stat.MLstat.TH
keywords cross-validationestimatorgivenaccountbestproblemriskvariance
0
0 comments X
read the original abstract

This text is a survey on cross-validation. We define all classical cross-validation procedures, and we study their properties for two different goals: estimating the risk of a given estimator, and selecting the best estimator among a given family. For the risk estimation problem, we compute the bias (which can also be corrected) and the variance of cross-validation methods. For estimator selection, we first provide a first-order analysis (based on expectations). Then, we explain how to take into account second-order terms (from variance computations, and by taking into account the usefulness of overpenalization). This allows, in the end, to provide some guidelines for choosing the best cross-validation method for a given learning problem.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.