Hot Swapping for Online Adaptation of Optimization Hyperparameters

Dennis DeCoste; Kevin Bache; Padhraic Smyth

arxiv: 1412.6599 · v3 · pith:CUHCDBKVnew · submitted 2014-12-20 · 💻 cs.LG

Hot Swapping for Online Adaptation of Optimization Hyperparameters

Kevin Bache , Dennis DeCoste , Padhraic Smyth This is my paper

classification 💻 cs.LG

keywords swappingadaptationapproachhyperparameterslearningonlineoptimizationadadelta

0 comments

read the original abstract

We describe a general framework for online adaptation of optimization hyperparameters by `hot swapping' their values during learning. We investigate this approach in the context of adaptive learning rate selection using an explore-exploit strategy from the multi-armed bandit literature. Experiments on a benchmark neural network show that the hot swapping approach leads to consistently better solutions compared to well-known alternatives such as AdaDelta and stochastic gradient with exhaustive hyperparameter search.

This paper has not been read by Pith yet.

Hot Swapping for Online Adaptation of Optimization Hyperparameters

discussion (0)