pith. sign in

arxiv: 1412.6599 · v3 · pith:CUHCDBKVnew · submitted 2014-12-20 · 💻 cs.LG

Hot Swapping for Online Adaptation of Optimization Hyperparameters

classification 💻 cs.LG
keywords swappingadaptationapproachhyperparameterslearningonlineoptimizationadadelta
0
0 comments X
read the original abstract

We describe a general framework for online adaptation of optimization hyperparameters by `hot swapping' their values during learning. We investigate this approach in the context of adaptive learning rate selection using an explore-exploit strategy from the multi-armed bandit literature. Experiments on a benchmark neural network show that the hot swapping approach leads to consistently better solutions compared to well-known alternatives such as AdaDelta and stochastic gradient with exhaustive hyperparameter search.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.