Adaptive multi-fidelity optimization with fast learning rates

Come Fiegel; Michal Valko; Victor Gabillon

arxiv: 2604.16239 · v1 · submitted 2026-04-17 · 📊 stat.ML · cs.LG

Adaptive multi-fidelity optimization with fast learning rates

Come Fiegel , Victor Gabillon , Michal Valko This is my paper

Pith reviewed 2026-05-10 07:22 UTC · model grok-4.3

classification 📊 stat.ML cs.LG

keywords multi-fidelity optimizationsimple regret boundsadaptive learninglocal smoothnesscost-bias tradeoffKometo algorithmbandit optimization

0 comments

The pith

Kometo algorithm achieves near-optimal regret rates in multi-fidelity optimization without knowing smoothness or fidelity parameters.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper proves lower bounds on the simple regret for optimizing a locally smooth function when biased approximations of different costs are available. It introduces the Kometo algorithm that matches these bounds up to logarithmic factors without requiring knowledge of the function's smoothness or the cost-bias relationships of the fidelities. The approach improves on prior guarantees and demonstrates better empirical performance compared to previous methods that assume known parameters.

Core claim

The Kometo algorithm achieves the minimax optimal rates for multi-fidelity optimization with only additional logarithmic factors, while being adaptive to the unknown smoothness and fidelity assumptions.

What carries the argument

The Kometo algorithm, an adaptive procedure that balances the cost and bias of fidelity approximations without prior knowledge of problem parameters.

Load-bearing premise

The objective function is locally smooth and the available fidelities have a well-defined cost-to-bias relationship that determines the achievable lower bounds.

What would settle it

Observing that Kometo fails to match the proven lower bounds in a constructed example where the cost-to-bias function is known but the algorithm does not adapt properly, or that it underperforms a tuned non-adaptive method in synthetic experiments.

Figures

Figures reproduced from arXiv: 2604.16239 by Come Fiegel, Michal Valko, Victor Gabillon.

**Figure 1.** Figure 1: (a) top-left: Curin 2-dimensions, (b) top-center: Branin 2-dimensions (c) top-right: Hartman3d 3-dimensions (d) bottom-left: Hartman6d 6-dimensions (e) bottom-center: Borehole 8-dimensions (f) bottom-right: SVM 2-dimensions. Experiments are composed of five synthetic experiments, from (a) to (e), and one real-world, (f). The multi-fidelity algorithms can use all fidelities, while the non multi-fidelity alg… view at source ↗

read the original abstract

In multi-fidelity optimization, biased approximations of varying costs of the target function are available. This paper studies the problem of optimizing a locally smooth function with a limited budget, where the learner has to make a tradeoff between the cost and the bias of these approximations. We first prove lower bounds for the simple regret under different assumptions on the fidelities, based on a cost-to-bias function. We then present the Kometo algorithm which achieves, with additional logarithmic factors, the same rates without any knowledge of the function smoothness and fidelity assumptions, and improves previously proven guarantees. We finally empirically show that our algorithm outperforms previous multi-fidelity optimization methods without the knowledge of problem-dependent parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Kometo gives a parameter-free algorithm for multi-fidelity optimization that matches the lower bounds up to explicit log factors.

read the letter

Kometo stands out because it adapts to unknown smoothness and fidelity costs in multi-fidelity optimization while still hitting near-optimal rates. The paper first derives lower bounds from a cost-to-bias function on the fidelities, then presents an algorithm that achieves matching upper bounds without being told the smoothness or the fidelity parameters in advance. The doubling schedule handles the unknowns and the extra logarithmic factors are tracked explicitly in the analysis. This removes a practical requirement that earlier methods needed as input and improves the stated guarantees. Experiments show it beats prior approaches when those parameters are hidden, which is the realistic case. One soft spot is the reliance on local smoothness and a well-defined cost-to-bias relation; those are standard assumptions but the rates collapse if they fail. The empirical section would be easier to assess with more detail on the test problems and baseline tuning, though the abstract indicates a fair comparison. The log overhead from adaptation is minor and openly stated rather than hidden. This paper is for researchers working on bandits and multi-fidelity optimization who care about rates that do not require oracle knowledge of the problem. A reader focused on adaptive algorithms will find the matching bounds and the explicit algorithm useful. It deserves a serious referee because the central claim of parameter-free near-optimality is grounded and the problem has clear practical value. I would send it to review.

Referee Report

2 major / 2 minor

Summary. The manuscript derives information-theoretic lower bounds on simple regret for multi-fidelity optimization of locally smooth functions, expressed in terms of a cost-to-bias function that relates fidelity costs to approximation bias. It then introduces the Kometo algorithm, which adaptively selects both evaluation points and fidelities via a doubling schedule and achieves matching upper bounds up to explicit logarithmic factors without requiring prior knowledge of the smoothness parameter or the cost-to-bias function. Empirical results on synthetic and real-world tasks show outperformance relative to prior multi-fidelity methods that assume known parameters.

Significance. If the matching upper and lower bounds hold, the work advances adaptive multi-fidelity optimization by removing the need for problem-dependent tuning parameters while retaining near-optimal rates; the explicit tracking of logarithmic factors in the doubling schedule and the fully adaptive nature of the algorithm constitute a clear technical improvement over earlier analyses that required known smoothness or fidelity models.

major comments (2)

[Problem formulation / Assumptions] The lower-bound construction and the Kometo regret analysis both rest on the existence of a well-defined cost-to-bias function; this assumption is load-bearing for the claimed optimality and should be stated explicitly as an axiom in the problem formulation (currently only alluded to in the abstract).
[Algorithm analysis] The abstract states that the extra logarithmic factors in Kometo’s regret do not depend on the unknown smoothness or fidelity parameters; the proof of this independence (via the doubling schedule) is central to the adaptivity claim and must be verified in the analysis section.

minor comments (2)

[Notation] Notation for the cost-to-bias function and the local smoothness parameter should be introduced once and used consistently throughout the theoretical sections.
[Experiments] The empirical section would benefit from an explicit statement of the budgets used and the number of independent runs for each method to allow direct comparison with the theoretical rates.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive report and positive recommendation. We address each major comment below and will revise the manuscript accordingly.

read point-by-point responses

Referee: [Problem formulation / Assumptions] The lower-bound construction and the Kometo regret analysis both rest on the existence of a well-defined cost-to-bias function; this assumption is load-bearing for the claimed optimality and should be stated explicitly as an axiom in the problem formulation (currently only alluded to in the abstract).

Authors: We agree that the cost-to-bias function is a foundational assumption underlying both the lower bounds and the upper bounds achieved by Kometo. While the abstract and the statements of the lower bounds refer to it, we will add an explicit axiomatic statement of its existence and properties in the problem formulation section of the revised manuscript to make its central role unambiguous. revision: yes
Referee: [Algorithm analysis] The abstract states that the extra logarithmic factors in Kometo’s regret do not depend on the unknown smoothness or fidelity parameters; the proof of this independence (via the doubling schedule) is central to the adaptivity claim and must be verified in the analysis section.

Authors: The independence of the logarithmic factors from the unknown smoothness parameter and the cost-to-bias function is established in the analysis of the doubling schedule (Section 4). The schedule doubles the number of evaluations at successive stages using only the observed costs and biases, without any dependence on these unknown quantities; the resulting log factors depend solely on the total budget and the number of stages. We will insert a short clarifying remark in the analysis section that explicitly verifies this parameter-independence and points to the relevant lemmas. revision: yes

Circularity Check

0 steps flagged

No significant circularity; bounds and rates derived independently from stated assumptions

full rationale

The derivation begins with explicit lower bounds constructed from the cost-to-bias function and local smoothness assumptions (external to any fitted quantities or algorithm outputs). The Kometo algorithm is then defined and analyzed to match those rates up to explicit logarithmic factors while remaining parameter-free; the analysis tracks the doubling schedule and regret terms without reducing to a re-statement of the lower-bound construction or to self-cited uniqueness results. No self-definitional steps, fitted-input predictions, or load-bearing self-citation chains appear in the provided abstract and skeptic summary. The central claim therefore retains independent content from the algorithm design and regret analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Review performed from abstract only; no explicit free parameters, invented entities, or detailed axioms are visible beyond the stated local smoothness and cost-to-bias assumptions.

axioms (2)

domain assumption The objective function is locally smooth
Invoked for the regret analysis and lower bounds.
domain assumption Fidelities admit a cost-to-bias function
Used to express the lower bounds on simple regret.

pith-pipeline@v0.9.0 · 5406 in / 1174 out tokens · 37367 ms · 2026-05-10T07:22:02.142661+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages

[1]

Auer, P., Ortner, R., and Szepesv \' a ri, C. (2007). Improved rates for the stochastic continuum-armed bandit problem . In Conference on Learning Theory (COLT)

work page 2007
[2]

L., Gabillon, V., and Valko, M

Bartlett, P. L., Gabillon, V., and Valko, M. (2019). A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption . In Algorithmic Learning Theory (ALT)

work page 2019
[3]

Bubeck, S., Munos, R., Stoltz, G., and Szepesv \' a ri, C. (2011). X-armed bandits . Journal of Machine Learning Research , 12:1587--1627

work page 2011
[4]

J., and How, J

Cutler, M., Walsh, T. J., and How, J. P. (2014). Reinforcement learning with multi-fidelity simulators. In 2014 IEEE International Conference on Robotics and Automation (ICRA) , pages 3888--3895. IEEE

work page 2014
[5]

Ghosh, S., Kristensen, J., Zhang, Y., Subber, W., and Wang, L. (2019). A Strategy for Adaptive Sampling of Multi-fidelity Gaussian Process to Reduce Predictive Uncertainty . preprint

work page 2019
[6]

Grill, J.-B., Valko, M., and Munos, R. (2015). Black-box optimization of noisy functions with unknown smoothness . In Neural Information Processing Systems (NeurIPS)

work page 2015
[7]

and Hassani, M

Hoorfar, A. and Hassani, M. (2008). Inequalities on the Lambert W function and hyperpower function . Journal of Inequalities in Pure and Applied Mathematics , 9(2):5--9

work page 2008
[8]

T., Notz, W

Huang, D., Allen, T. T., Notz, W. I., and Miller, R. A. (2006). Sequential kriging optimization using multiple-fidelity evaluations. Structural and Multidisciplinary Optimization , 32(5):369--382

work page 2006
[9]

B., Schneider, J., and Poczos, B

Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016a). Gaussian process bandit optimisation with multi-fidelity evaluations. Neural Information Processing Systems (NeurIPS)

work page
[10]

B., Schneider, J., and Poczos, B

Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016b). Multi-fidelity Gaussian Process Bandit Optimisation . Journal of Artificial Intelligence Research

work page
[11]

Kandasamy, K., Dasarathy, G., Poczos, B., and Schneider, J. (2016c). The multi-fidelity multi-armed bandit. In Neural Information Processing Systems (NeurIPS) , pages 1777--1785

work page
[12]

Kandasamy, K., Dasarathy, G., Schneider, J., and Poczos, B. (2017). Multi-fidelity Bayesian optimisation with continuous approximations. In Proceedings of the 34th International Conference on Machine Learning (ICML) , pages 1799--1808. JMLR.org

work page 2017
[13]

Kawaguchi, K., Maruyama, Y., and Zheng, X. (2016). Global continuous optimization with error bound and fast convergence . Journal of Artificial Intelligence Research , 56:153--195

work page 2016
[14]

Kleinberg, R., Slivkins, A., and Upfal, E. (2008). Multi-armed bandit problems in metric spaces . In Symposium on Theory of Computing (STOC)

work page 2008
[15]

Li, L., Jamieson, K., DeSalvo, G., and Talwalkar, A. R. A. (2017). Hyperband: Bandit-based configuration evaluation for hyperparameter optimization . In International Conference on Learning Representations (ICLR)

work page 2017
[16]

and Carpentier, A

Locatelli, A. and Carpentier, A. (2018). Adaptivity to Smoothness in X-armed bandits . In Conference on Learning Theory (COLT)

work page 2018
[17]

Matyas, J. (1965). Random optimization. Automation and Remote Control , 26(2):246--253

work page 1965
[18]

Munos, R. (2011). Optimistic optimization of deterministic functions without the knowledge of its smoothness . In Neural Information Processing Systems (NeurIPS)

work page 2011
[19]

Munos, R. (2014). From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning . Foundations and Trends in Machine Learning

work page 2014
[20]

and Spokoiny, V

Nesterov, Y. and Spokoiny, V. (2017). Random gradient-free minimization of convex functions. Foundations of Computational Mathematics , 17(2):527--566

work page 2017
[21]

Sen, R., Kandasamy, K., and Shakkottai, S. (2018). Multi-Fidelity Black-Box Optimization with Hierarchical Partitions . International Conference on Machine Learning (ICML)

work page 2018
[22]

Sen, R., Kandasamy, K., and Shakkottai, S. (2019). Noisy Blackbox Optimization with Multi-Fidelity Queries: A Tree Search Approach . Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)

work page 2019
[23]

Shang, X., Kaufmann, E., and Valko, M. (2019). General parallel optimization without metric . In Algorithmic Learning Theory (ALT)

work page 2019
[24]

M., and Seeger, M

Srinivas, N., Krause, A., Kakade, S. M., and Seeger, M. (2010). Gaussian process optimization in the bandit setting: No regret and experimental design . International Conference on Machine Learning (ICML)

work page 2010
[25]

Valko, M., Carpentier, A., and Munos, R. (2013). Stochastic simultaneous optimistic optimization . In International Conference on Machine Learning (ICML)

work page 2013
[26]

N., Kian, B., Low, H., and Kankanhalli, M

Zhang, Y., Hoang, T. N., Kian, B., Low, H., and Kankanhalli, M. (2019). Information-Based Multi-Fidelity Bayesian Optimization . Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI)

work page 2019

[1] [1]

Auer, P., Ortner, R., and Szepesv \' a ri, C. (2007). Improved rates for the stochastic continuum-armed bandit problem . In Conference on Learning Theory (COLT)

work page 2007

[2] [2]

L., Gabillon, V., and Valko, M

Bartlett, P. L., Gabillon, V., and Valko, M. (2019). A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption . In Algorithmic Learning Theory (ALT)

work page 2019

[3] [3]

Bubeck, S., Munos, R., Stoltz, G., and Szepesv \' a ri, C. (2011). X-armed bandits . Journal of Machine Learning Research , 12:1587--1627

work page 2011

[4] [4]

J., and How, J

Cutler, M., Walsh, T. J., and How, J. P. (2014). Reinforcement learning with multi-fidelity simulators. In 2014 IEEE International Conference on Robotics and Automation (ICRA) , pages 3888--3895. IEEE

work page 2014

[5] [5]

Ghosh, S., Kristensen, J., Zhang, Y., Subber, W., and Wang, L. (2019). A Strategy for Adaptive Sampling of Multi-fidelity Gaussian Process to Reduce Predictive Uncertainty . preprint

work page 2019

[6] [6]

Grill, J.-B., Valko, M., and Munos, R. (2015). Black-box optimization of noisy functions with unknown smoothness . In Neural Information Processing Systems (NeurIPS)

work page 2015

[7] [7]

and Hassani, M

Hoorfar, A. and Hassani, M. (2008). Inequalities on the Lambert W function and hyperpower function . Journal of Inequalities in Pure and Applied Mathematics , 9(2):5--9

work page 2008

[8] [8]

T., Notz, W

Huang, D., Allen, T. T., Notz, W. I., and Miller, R. A. (2006). Sequential kriging optimization using multiple-fidelity evaluations. Structural and Multidisciplinary Optimization , 32(5):369--382

work page 2006

[9] [9]

B., Schneider, J., and Poczos, B

Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016a). Gaussian process bandit optimisation with multi-fidelity evaluations. Neural Information Processing Systems (NeurIPS)

work page

[10] [10]

B., Schneider, J., and Poczos, B

Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016b). Multi-fidelity Gaussian Process Bandit Optimisation . Journal of Artificial Intelligence Research

work page

[11] [11]

Kandasamy, K., Dasarathy, G., Poczos, B., and Schneider, J. (2016c). The multi-fidelity multi-armed bandit. In Neural Information Processing Systems (NeurIPS) , pages 1777--1785

work page

[12] [12]

Kandasamy, K., Dasarathy, G., Schneider, J., and Poczos, B. (2017). Multi-fidelity Bayesian optimisation with continuous approximations. In Proceedings of the 34th International Conference on Machine Learning (ICML) , pages 1799--1808. JMLR.org

work page 2017

[13] [13]

Kawaguchi, K., Maruyama, Y., and Zheng, X. (2016). Global continuous optimization with error bound and fast convergence . Journal of Artificial Intelligence Research , 56:153--195

work page 2016

[14] [14]

Kleinberg, R., Slivkins, A., and Upfal, E. (2008). Multi-armed bandit problems in metric spaces . In Symposium on Theory of Computing (STOC)

work page 2008

[15] [15]

Li, L., Jamieson, K., DeSalvo, G., and Talwalkar, A. R. A. (2017). Hyperband: Bandit-based configuration evaluation for hyperparameter optimization . In International Conference on Learning Representations (ICLR)

work page 2017

[16] [16]

and Carpentier, A

Locatelli, A. and Carpentier, A. (2018). Adaptivity to Smoothness in X-armed bandits . In Conference on Learning Theory (COLT)

work page 2018

[17] [17]

Matyas, J. (1965). Random optimization. Automation and Remote Control , 26(2):246--253

work page 1965

[18] [18]

Munos, R. (2011). Optimistic optimization of deterministic functions without the knowledge of its smoothness . In Neural Information Processing Systems (NeurIPS)

work page 2011

[19] [19]

Munos, R. (2014). From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning . Foundations and Trends in Machine Learning

work page 2014

[20] [20]

and Spokoiny, V

Nesterov, Y. and Spokoiny, V. (2017). Random gradient-free minimization of convex functions. Foundations of Computational Mathematics , 17(2):527--566

work page 2017

[21] [21]

Sen, R., Kandasamy, K., and Shakkottai, S. (2018). Multi-Fidelity Black-Box Optimization with Hierarchical Partitions . International Conference on Machine Learning (ICML)

work page 2018

[22] [22]

Sen, R., Kandasamy, K., and Shakkottai, S. (2019). Noisy Blackbox Optimization with Multi-Fidelity Queries: A Tree Search Approach . Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)

work page 2019

[23] [23]

Shang, X., Kaufmann, E., and Valko, M. (2019). General parallel optimization without metric . In Algorithmic Learning Theory (ALT)

work page 2019

[24] [24]

M., and Seeger, M

Srinivas, N., Krause, A., Kakade, S. M., and Seeger, M. (2010). Gaussian process optimization in the bandit setting: No regret and experimental design . International Conference on Machine Learning (ICML)

work page 2010

[25] [25]

Valko, M., Carpentier, A., and Munos, R. (2013). Stochastic simultaneous optimistic optimization . In International Conference on Machine Learning (ICML)

work page 2013

[26] [26]

N., Kian, B., Low, H., and Kankanhalli, M

Zhang, Y., Hoang, T. N., Kian, B., Low, H., and Kankanhalli, M. (2019). Information-Based Multi-Fidelity Bayesian Optimization . Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI)

work page 2019