Early Stopping without a Validation Set
read the original abstract
Early stopping is a widely used technique to prevent poor generalization performance when training an over-expressive model by means of gradient-based optimization. To find a good point to halt the optimizer, a common practice is to split the dataset into a training and a smaller validation set to obtain an ongoing estimate of the generalization performance. We propose a novel early stopping criterion based on fast-to-compute local statistics of the computed gradients and entirely removes the need for a held-out validation set. Our experiments show that this is a viable approach in the setting of least-squares and logistic regression, as well as neural networks.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
ScoreStop: Gradient-based early stopping using functional score tests
ScoreStop introduces a functional score test for early stopping in gradient boosting, testing the null that the current predictor minimizes population risk with a scale-invariant statistic of known asymptotic distribution.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.