Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

Authors on Pith no claims yet

classification 💻 cs.LG stat.ML

keywords optimizationhyperbandhyperparameterbayesianconfigurationslearningnovelproblems

read the original abstract

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through adaptive resource allocation and early-stopping. We formulate hyperparameter optimization as a pure-exploration non-stochastic infinite-armed bandit problem where a predefined resource like iterations, data samples, or features is allocated to randomly sampled configurations. We introduce a novel algorithm, Hyperband, for this framework and analyze its theoretical properties, providing several desirable guarantees. Furthermore, we compare Hyperband with popular Bayesian optimization methods on a suite of hyperparameter optimization problems. We observe that Hyperband can provide over an order-of-magnitude speedup over our competitor set on a variety of deep-learning and kernel-based learning problems.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Budgeted Online Influence Maximization
cs.LG 2026-04 unverdicted novelty 7.0

A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality ...
Best of both worlds: Stochastic & adversarial best-arm identification
stat.ML 2026-04 unverdicted novelty 7.0

No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.
QuickScope: Certifying Hard Questions in Dynamic LLM Benchmarks
cs.CL 2026-04 unverdicted novelty 6.0

QuickScope uses modified COUP Bayesian optimization to find truly difficult questions in dynamic LLM benchmarks more sample-efficiently than baselines while cutting false positives.
Beyond Structure: Revolutionising Materials Discovery via AI-Driven Synthesis Protocol-Property Relationships
cond-mat.mtrl-sci 2026-05 unverdicted novelty 5.0

A perspective proposes a synthesis-first paradigm for AI-driven materials discovery, treating protocols rather than structures as the key variables to close the synthesizability gap via machine-readable recipes, gener...
Quantifying the Carbon Emissions of Machine Learning
cs.CY 2019-10 unverdicted novelty 5.0

Presents a calculator tool for estimating carbon emissions from ML model training along with mitigation actions.