pith. sign in

Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Bayesian optimization is popular for optimizing time-consuming black-box objectives. Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck. Multi-fidelity optimization promises relief using cheaper proxies to such objectives --- for example, validation error for a network trained using a subset of the training points or fewer iterations than required for convergence. We propose a highly flexible and practical approach to multi-fidelity Bayesian optimization, focused on efficiently optimizing hyperparameters for iteratively trained supervised learning models. We introduce a new acquisition function, the trace-aware knowledge-gradient, which efficiently leverages both multiple continuous fidelity controls and trace observations --- values of the objective at a sequence of fidelities, available when varying fidelity using training iterations. We provide a provably convergent method for optimizing our acquisition function and show it outperforms state-of-the-art alternatives for hyperparameter tuning of deep neural networks and large-scale kernel learning.

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Constrained Bayesian Optimisation with Multiple Information Sources

cs.LG · 2026-07-01 · unverdicted · novelty 6.0

A multi-source extension of constrained Max-value Entropy Search for Bayesian optimization incorporates auxiliary data sources to improve early exploration and performance under constraints even with weak correlations.

citing papers explorer

Showing 1 of 1 citing paper.

  • Constrained Bayesian Optimisation with Multiple Information Sources cs.LG · 2026-07-01 · unverdicted · none · ref 42 · internal anchor

    A multi-source extension of constrained Max-value Entropy Search for Bayesian optimization incorporates auxiliary data sources to improve early exploration and performance under constraints even with weak correlations.