pith. sign in

arxiv: 1603.06560 · v4 · pith:B7H6RI4Enew · submitted 2016-03-21 · 💻 cs.LG · stat.ML

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

classification 💻 cs.LG stat.ML
keywords optimizationhyperbandhyperparameterbayesianconfigurationslearningnovelproblems
0
0 comments X
read the original abstract

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through adaptive resource allocation and early-stopping. We formulate hyperparameter optimization as a pure-exploration non-stochastic infinite-armed bandit problem where a predefined resource like iterations, data samples, or features is allocated to randomly sampled configurations. We introduce a novel algorithm, Hyperband, for this framework and analyze its theoretical properties, providing several desirable guarantees. Furthermore, we compare Hyperband with popular Bayesian optimization methods on a suite of hyperparameter optimization problems. We observe that Hyperband can provide over an order-of-magnitude speedup over our competitor set on a variety of deep-learning and kernel-based learning problems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Budgeted Online Influence Maximization

    cs.LG 2026-04 unverdicted novelty 7.0

    A new algorithm for online influence maximization under a total budget constraint using the independent cascade model and edge-level semi-bandit feedback, with improved regret bounds for both budgeted and cardinality ...

  2. Best of both worlds: Stochastic & adversarial best-arm identification

    stat.ML 2026-04 unverdicted novelty 7.0

    No algorithm can be optimal in both stochastic and adversarial best-arm identification; a new parameter-free algorithm matches the derived lower bound up to log factors in stochastic cases while handling adversarial rewards.

  3. QuickScope: Certifying Hard Questions in Dynamic LLM Benchmarks

    cs.CL 2026-04 unverdicted novelty 6.0

    QuickScope uses modified COUP Bayesian optimization to find truly difficult questions in dynamic LLM benchmarks more sample-efficiently than baselines while cutting false positives.

  4. Joint Detection of Malicious Domains and Infected Clients

    cs.LG 2019-06 unverdicted novelty 6.0

    Sluice network transfer learning jointly detects infected clients and malicious domains from HTTPS traffic, outperforming separate models and identifying previously unknown threats.

  5. When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing

    cs.CV 2026-05 unverdicted novelty 5.0

    Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.

  6. Beyond Structure: Revolutionising Materials Discovery via AI-Driven Synthesis Protocol-Property Relationships

    cond-mat.mtrl-sci 2026-05 unverdicted novelty 5.0

    A perspective proposes a synthesis-first paradigm for AI-driven materials discovery, treating protocols rather than structures as the key variables to close the synthesizability gap via machine-readable recipes, gener...

  7. Quantifying the Carbon Emissions of Machine Learning

    cs.CY 2019-10 unverdicted novelty 5.0

    Presents a calculator tool for estimating carbon emissions from ML model training along with mitigation actions.