Some Theory For Practical Classifier Validation

Eric Bax; Ya Le

REVIEW

Some Theory For Practical Classifier Validation

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1510.02676 v1 pith:Y7YE5TEL submitted 2015-10-09 stat.ML cs.LG

Some Theory For Practical Classifier Validation

Eric Bax , Ya Le This is my paper

classification stat.ML cs.LG

keywords classifierdatavalidationholdoutin-sampletheorytrainedtraining

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

We compare and contrast two approaches to validating a trained classifier while using all in-sample data for training. One is simultaneous validation over an organized set of hypotheses (SVOOSH), the well-known method that began with VC theory. The other is withhold and gap (WAG). WAG withholds a validation set, trains a holdout classifier on the remaining data, uses the validation data to validate that classifier, then adds the rate of disagreement between the holdout classifier and one trained using all in-sample data, which is an upper bound on the difference in error rates. We show that complex hypothesis classes and limited training data can make WAG a favorable alternative.

Some Theory For Practical Classifier Validation

discussion (0)