pith. machine review for the scientific record. sign in

arxiv: 1807.01774 · v1 · submitted 2018-07-04 · 💻 cs.LG · stat.ML

Recognition: unknown

BOHB: Robust and Efficient Hyperparameter Optimization at Scale

Authors on Pith no claims yet
classification 💻 cs.LG stat.ML
keywords optimizationbayesianhyperparameternetworksneuralbandit-basedbestconfigurations
0
0 comments X
read the original abstract

Modern deep learning methods are very sensitive to many hyperparameters, and, due to the long training times of state-of-the-art models, vanilla Bayesian hyperparameter optimization is typically computationally infeasible. On the other hand, bandit-based configuration evaluation approaches based on random search lack guidance and do not converge to the best configurations as quickly. Here, we propose to combine the benefits of both Bayesian optimization and bandit-based methods, in order to achieve the best of both worlds: strong anytime performance and fast convergence to optimal configurations. We propose a new practical state-of-the-art hyperparameter optimization method, which consistently outperforms both Bayesian optimization and Hyperband on a wide range of problem types, including high-dimensional toy functions, support vector machines, feed-forward neural networks, Bayesian neural networks, deep reinforcement learning, and convolutional neural networks. Our method is robust and versatile, while at the same time being conceptually simple and easy to implement.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. QuickScope: Certifying Hard Questions in Dynamic LLM Benchmarks

    cs.CL 2026-04 unverdicted novelty 6.0

    QuickScope uses modified COUP Bayesian optimization to find truly difficult questions in dynamic LLM benchmarks more sample-efficiently than baselines while cutting false positives.

  2. Inferring identified hadron production in $pp$ collisions with physics-informed machine learning at the LHC

    hep-ph 2026-05 unverdicted novelty 5.0

    A physics-informed neural network infers pT spectra of pi, K, p, Lambda, and Ks in unmeasured rapidity regions from PYTHIA8 pp collisions at 13.6 TeV, achieving 1.5-5.83% yield uncertainties while reproducing yield ra...

  3. Quantifying the Carbon Emissions of Machine Learning

    cs.CY 2019-10 unverdicted novelty 5.0

    Presents a calculator tool for estimating carbon emissions from ML model training along with mitigation actions.