Adaptive multi-fidelity optimization with fast learning rates
Pith reviewed 2026-05-10 07:22 UTC · model grok-4.3
The pith
Kometo algorithm achieves near-optimal regret rates in multi-fidelity optimization without knowing smoothness or fidelity parameters.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The Kometo algorithm achieves the minimax optimal rates for multi-fidelity optimization with only additional logarithmic factors, while being adaptive to the unknown smoothness and fidelity assumptions.
What carries the argument
The Kometo algorithm, an adaptive procedure that balances the cost and bias of fidelity approximations without prior knowledge of problem parameters.
Load-bearing premise
The objective function is locally smooth and the available fidelities have a well-defined cost-to-bias relationship that determines the achievable lower bounds.
What would settle it
Observing that Kometo fails to match the proven lower bounds in a constructed example where the cost-to-bias function is known but the algorithm does not adapt properly, or that it underperforms a tuned non-adaptive method in synthetic experiments.
Figures
read the original abstract
In multi-fidelity optimization, biased approximations of varying costs of the target function are available. This paper studies the problem of optimizing a locally smooth function with a limited budget, where the learner has to make a tradeoff between the cost and the bias of these approximations. We first prove lower bounds for the simple regret under different assumptions on the fidelities, based on a cost-to-bias function. We then present the Kometo algorithm which achieves, with additional logarithmic factors, the same rates without any knowledge of the function smoothness and fidelity assumptions, and improves previously proven guarantees. We finally empirically show that our algorithm outperforms previous multi-fidelity optimization methods without the knowledge of problem-dependent parameters.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript derives information-theoretic lower bounds on simple regret for multi-fidelity optimization of locally smooth functions, expressed in terms of a cost-to-bias function that relates fidelity costs to approximation bias. It then introduces the Kometo algorithm, which adaptively selects both evaluation points and fidelities via a doubling schedule and achieves matching upper bounds up to explicit logarithmic factors without requiring prior knowledge of the smoothness parameter or the cost-to-bias function. Empirical results on synthetic and real-world tasks show outperformance relative to prior multi-fidelity methods that assume known parameters.
Significance. If the matching upper and lower bounds hold, the work advances adaptive multi-fidelity optimization by removing the need for problem-dependent tuning parameters while retaining near-optimal rates; the explicit tracking of logarithmic factors in the doubling schedule and the fully adaptive nature of the algorithm constitute a clear technical improvement over earlier analyses that required known smoothness or fidelity models.
major comments (2)
- [Problem formulation / Assumptions] The lower-bound construction and the Kometo regret analysis both rest on the existence of a well-defined cost-to-bias function; this assumption is load-bearing for the claimed optimality and should be stated explicitly as an axiom in the problem formulation (currently only alluded to in the abstract).
- [Algorithm analysis] The abstract states that the extra logarithmic factors in Kometo’s regret do not depend on the unknown smoothness or fidelity parameters; the proof of this independence (via the doubling schedule) is central to the adaptivity claim and must be verified in the analysis section.
minor comments (2)
- [Notation] Notation for the cost-to-bias function and the local smoothness parameter should be introduced once and used consistently throughout the theoretical sections.
- [Experiments] The empirical section would benefit from an explicit statement of the budgets used and the number of independent runs for each method to allow direct comparison with the theoretical rates.
Simulated Author's Rebuttal
We thank the referee for the constructive report and positive recommendation. We address each major comment below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: [Problem formulation / Assumptions] The lower-bound construction and the Kometo regret analysis both rest on the existence of a well-defined cost-to-bias function; this assumption is load-bearing for the claimed optimality and should be stated explicitly as an axiom in the problem formulation (currently only alluded to in the abstract).
Authors: We agree that the cost-to-bias function is a foundational assumption underlying both the lower bounds and the upper bounds achieved by Kometo. While the abstract and the statements of the lower bounds refer to it, we will add an explicit axiomatic statement of its existence and properties in the problem formulation section of the revised manuscript to make its central role unambiguous. revision: yes
-
Referee: [Algorithm analysis] The abstract states that the extra logarithmic factors in Kometo’s regret do not depend on the unknown smoothness or fidelity parameters; the proof of this independence (via the doubling schedule) is central to the adaptivity claim and must be verified in the analysis section.
Authors: The independence of the logarithmic factors from the unknown smoothness parameter and the cost-to-bias function is established in the analysis of the doubling schedule (Section 4). The schedule doubles the number of evaluations at successive stages using only the observed costs and biases, without any dependence on these unknown quantities; the resulting log factors depend solely on the total budget and the number of stages. We will insert a short clarifying remark in the analysis section that explicitly verifies this parameter-independence and points to the relevant lemmas. revision: yes
Circularity Check
No significant circularity; bounds and rates derived independently from stated assumptions
full rationale
The derivation begins with explicit lower bounds constructed from the cost-to-bias function and local smoothness assumptions (external to any fitted quantities or algorithm outputs). The Kometo algorithm is then defined and analyzed to match those rates up to explicit logarithmic factors while remaining parameter-free; the analysis tracks the doubling schedule and regret terms without reducing to a re-statement of the lower-bound construction or to self-cited uniqueness results. No self-definitional steps, fitted-input predictions, or load-bearing self-citation chains appear in the provided abstract and skeptic summary. The central claim therefore retains independent content from the algorithm design and regret analysis.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The objective function is locally smooth
- domain assumption Fidelities admit a cost-to-bias function
Reference graph
Works this paper leans on
-
[1]
Auer, P., Ortner, R., and Szepesv \' a ri, C. (2007). Improved rates for the stochastic continuum-armed bandit problem . In Conference on Learning Theory (COLT)
work page 2007
-
[2]
L., Gabillon, V., and Valko, M
Bartlett, P. L., Gabillon, V., and Valko, M. (2019). A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption . In Algorithmic Learning Theory (ALT)
work page 2019
-
[3]
Bubeck, S., Munos, R., Stoltz, G., and Szepesv \' a ri, C. (2011). X-armed bandits . Journal of Machine Learning Research , 12:1587--1627
work page 2011
-
[4]
Cutler, M., Walsh, T. J., and How, J. P. (2014). Reinforcement learning with multi-fidelity simulators. In 2014 IEEE International Conference on Robotics and Automation (ICRA) , pages 3888--3895. IEEE
work page 2014
-
[5]
Ghosh, S., Kristensen, J., Zhang, Y., Subber, W., and Wang, L. (2019). A Strategy for Adaptive Sampling of Multi-fidelity Gaussian Process to Reduce Predictive Uncertainty . preprint
work page 2019
-
[6]
Grill, J.-B., Valko, M., and Munos, R. (2015). Black-box optimization of noisy functions with unknown smoothness . In Neural Information Processing Systems (NeurIPS)
work page 2015
-
[7]
Hoorfar, A. and Hassani, M. (2008). Inequalities on the Lambert W function and hyperpower function . Journal of Inequalities in Pure and Applied Mathematics , 9(2):5--9
work page 2008
-
[8]
Huang, D., Allen, T. T., Notz, W. I., and Miller, R. A. (2006). Sequential kriging optimization using multiple-fidelity evaluations. Structural and Multidisciplinary Optimization , 32(5):369--382
work page 2006
-
[9]
B., Schneider, J., and Poczos, B
Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016a). Gaussian process bandit optimisation with multi-fidelity evaluations. Neural Information Processing Systems (NeurIPS)
-
[10]
B., Schneider, J., and Poczos, B
Kandasamy, K., Dasarathy, G., Oliva, J. B., Schneider, J., and Poczos, B. (2016b). Multi-fidelity Gaussian Process Bandit Optimisation . Journal of Artificial Intelligence Research
-
[11]
Kandasamy, K., Dasarathy, G., Poczos, B., and Schneider, J. (2016c). The multi-fidelity multi-armed bandit. In Neural Information Processing Systems (NeurIPS) , pages 1777--1785
-
[12]
Kandasamy, K., Dasarathy, G., Schneider, J., and Poczos, B. (2017). Multi-fidelity Bayesian optimisation with continuous approximations. In Proceedings of the 34th International Conference on Machine Learning (ICML) , pages 1799--1808. JMLR.org
work page 2017
-
[13]
Kawaguchi, K., Maruyama, Y., and Zheng, X. (2016). Global continuous optimization with error bound and fast convergence . Journal of Artificial Intelligence Research , 56:153--195
work page 2016
-
[14]
Kleinberg, R., Slivkins, A., and Upfal, E. (2008). Multi-armed bandit problems in metric spaces . In Symposium on Theory of Computing (STOC)
work page 2008
-
[15]
Li, L., Jamieson, K., DeSalvo, G., and Talwalkar, A. R. A. (2017). Hyperband: Bandit-based configuration evaluation for hyperparameter optimization . In International Conference on Learning Representations (ICLR)
work page 2017
-
[16]
Locatelli, A. and Carpentier, A. (2018). Adaptivity to Smoothness in X-armed bandits . In Conference on Learning Theory (COLT)
work page 2018
-
[17]
Matyas, J. (1965). Random optimization. Automation and Remote Control , 26(2):246--253
work page 1965
-
[18]
Munos, R. (2011). Optimistic optimization of deterministic functions without the knowledge of its smoothness . In Neural Information Processing Systems (NeurIPS)
work page 2011
-
[19]
Munos, R. (2014). From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning . Foundations and Trends in Machine Learning
work page 2014
-
[20]
Nesterov, Y. and Spokoiny, V. (2017). Random gradient-free minimization of convex functions. Foundations of Computational Mathematics , 17(2):527--566
work page 2017
-
[21]
Sen, R., Kandasamy, K., and Shakkottai, S. (2018). Multi-Fidelity Black-Box Optimization with Hierarchical Partitions . International Conference on Machine Learning (ICML)
work page 2018
-
[22]
Sen, R., Kandasamy, K., and Shakkottai, S. (2019). Noisy Blackbox Optimization with Multi-Fidelity Queries: A Tree Search Approach . Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS)
work page 2019
-
[23]
Shang, X., Kaufmann, E., and Valko, M. (2019). General parallel optimization without metric . In Algorithmic Learning Theory (ALT)
work page 2019
-
[24]
Srinivas, N., Krause, A., Kakade, S. M., and Seeger, M. (2010). Gaussian process optimization in the bandit setting: No regret and experimental design . International Conference on Machine Learning (ICML)
work page 2010
-
[25]
Valko, M., Carpentier, A., and Munos, R. (2013). Stochastic simultaneous optimistic optimization . In International Conference on Machine Learning (ICML)
work page 2013
-
[26]
N., Kian, B., Low, H., and Kankanhalli, M
Zhang, Y., Hoang, T. N., Kian, B., Low, H., and Kankanhalli, M. (2019). Information-Based Multi-Fidelity Bayesian Optimization . Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI)
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.