pith. sign in

arxiv: 1510.02676 · v1 · pith:Y7YE5TELnew · submitted 2015-10-09 · 📊 stat.ML · cs.LG

Some Theory For Practical Classifier Validation

classification 📊 stat.ML cs.LG
keywords classifierdatavalidationholdoutin-sampletheorytrainedtraining
0
0 comments X
read the original abstract

We compare and contrast two approaches to validating a trained classifier while using all in-sample data for training. One is simultaneous validation over an organized set of hypotheses (SVOOSH), the well-known method that began with VC theory. The other is withhold and gap (WAG). WAG withholds a validation set, trains a holdout classifier on the remaining data, uses the validation data to validate that classifier, then adds the rate of disagreement between the holdout classifier and one trained using all in-sample data, which is an upper bound on the difference in error rates. We show that complex hypothesis classes and limited training data can make WAG a favorable alternative.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.