Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Afshin Rostamizadeh; Ameet Talwalkar; Giulia DeSalvo; Kevin Jamieson; Lisha Li

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1603.06560 v4 pith:B7H6RI4E submitted 2016-03-21 cs.LG stat.ML

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar This is my paper

classification cs.LG stat.ML

keywords optimizationhyperbandhyperparameterbayesianconfigurationslearningnovelproblems

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through adaptive resource allocation and early-stopping. We formulate hyperparameter optimization as a pure-exploration non-stochastic infinite-armed bandit problem where a predefined resource like iterations, data samples, or features is allocated to randomly sampled configurations. We introduce a novel algorithm, Hyperband, for this framework and analyze its theoretical properties, providing several desirable guarantees. Furthermore, we compare Hyperband with popular Bayesian optimization methods on a suite of hyperparameter optimization problems. We observe that Hyperband can provide over an order-of-magnitude speedup over our competitor set on a variety of deep-learning and kernel-based learning problems.

discussion (0)

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Budgeted Online Influence Maximization
cs.LG 2026-04 unverdicted novelty 7.0

A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality ...
Best of both worlds: Stochastic & adversarial best-arm identification
stat.ML 2026-04 unverdicted novelty 7.0

No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.
QuickScope: Certifying Hard Questions in Dynamic LLM Benchmarks
cs.CL 2026-04 unverdicted novelty 6.0

QuickScope uses modified COUP Bayesian optimization to find truly difficult questions in dynamic LLM benchmarks more sample-efficiently than baselines while cutting false positives.
Joint Detection of Malicious Domains and Infected Clients
cs.LG 2019-06 unverdicted novelty 6.0

Sluice network transfer learning jointly detects infected clients and malicious domains from HTTPS traffic, outperforming separate models and identifying previously unknown threats.
When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing
cs.CV 2026-05 unverdicted novelty 5.0

Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.
Beyond Structure: Revolutionising Materials Discovery via AI-Driven Synthesis Protocol-Property Relationships
cond-mat.mtrl-sci 2026-05 unverdicted novelty 5.0

A perspective proposes a synthesis-first paradigm for AI-driven materials discovery, treating protocols rather than structures as the key variables to close the synthesizability gap via machine-readable recipes, gener...
Quantifying the Carbon Emissions of Machine Learning
cs.CY 2019-10 unverdicted novelty 5.0

Presents a calculator tool for estimating carbon emissions from ML model training along with mitigation actions.