Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for High Dimensions

Abhilasha Saroj; Guanhao Xu; Jinghui Yuan; Ross Wang; Roy Luo; Shaked Regev

arxiv: 2604.08569 · v1 · submitted 2026-03-25 · 💻 cs.LG

Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for High Dimensions

Abhilasha Saroj , Shaked Regev , Guanhao Xu , Jinghui Yuan , Roy Luo , Ross Wang This is my paper

Pith reviewed 2026-05-15 00:06 UTC · model grok-4.3

classification 💻 cs.LG

keywords Bayesian optimizationtrust-region methodshigh-dimensional optimizationtraffic simulationcalibrationmemory-guided searchacquisition functions

0 comments

The pith

Memory-guided trust-region Bayesian optimization reaches calibration targets faster than genetic algorithms in 84-dimensional traffic simulations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether adding memory guidance to trust-region Bayesian optimization improves results on expensive, noisy calibration tasks with many parameters. It compares a genetic algorithm against classical BO, TuRBO, Multi-TuRBO, and the new MG-TuRBO on two real traffic models, one with 14 inputs and one with 84. In the lower-dimensional case all Bayesian methods converge quickly, but in the 84D case MG-TuRBO paired with an adaptive acquisition rule pulls ahead by locating good parameter sets with fewer expensive simulation runs. The work therefore positions MG-TuRBO as a practical tool when the number of calibration variables grows large and the evaluation budget stays small.

Core claim

MG-TuRBO augments standard trust-region Bayesian optimization with memory of prior evaluations and a novel adaptive acquisition strategy; on the 84-dimensional traffic calibration task it reaches high-quality solutions noticeably faster and more consistently than GA or the other BO variants, while all methods perform comparably on the 14-dimensional instance.

What carries the argument

Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO), which stores and reuses information from previous trust-region searches to direct new local Bayesian optimizations in high dimensions.

Load-bearing premise

The performance edge observed on two specific traffic problems will generalize to other high-dimensional expensive optimization tasks without additional validation.

What would settle it

A controlled experiment on a fresh 80-plus-dimensional black-box problem with similar noise and budget constraints in which MG-TuRBO fails to match or exceed Multi-TuRBO would disprove the claim of broad high-D usefulness.

Figures

Figures reproduced from arXiv: 2604.08569 by Abhilasha Saroj, Guanhao Xu, Jinghui Yuan, Ross Wang, Roy Luo, Shaked Regev.

**Figure 2.** Figure 2: Optimization-phase convergence for the 14D network. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 5.** Figure 5: Optimization-phase convergence for the 84D calibration problem. [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

**Figure 4.** Figure 4: Final best GEH distributions for Thompson Sampling and Adaptive [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 6.** Figure 6: Trust-region search mechanisms in 84D (Adaptive acquisition). [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

read the original abstract

Traffic simulation and digital-twin calibration is a challenging optimization problem with a limited simulation budget. Each trial requires an expensive simulation run, and the relationship between calibration inputs and model error is often nonconvex, and noisy. The problem becomes more difficult as the number of calibration parameters increases. We compare a commonly used automatic calibration method, a genetic algorithm (GA), with Bayesian optimization methods (BOMs): classical Bayesian optimization (BO), Trust-Region BO (TuRBO), Multi-TuRBO, and a proposed Memory-Guided TuRBO (MG-TuRBO) method. We compare performance on 2 real-world traffic simulation calibration problems with 14 and 84 decision variables, representing lower- and higher-dimensional (14D and 84D) settings. For BOMs, we study two acquisition strategies, Thompson sampling and a novel adaptive strategy. We evaluate performance using final calibration quality, convergence behavior, and consistency across runs. The results show that BOMs reach good calibration targets much faster than GA in the lower-D problem. MG-TuRBO performs comparably in our 14D setting, it demonstrates noticeable advantages in the 84D problem, particularly when paired with our adaptive strategy. Our results suggest that MG-TuRBO is especially useful for high-D traffic simulation calibration and potentially for high-D problems in general.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MG-TuRBO adds memory guidance and an adaptive acquisition to TuRBO and shows gains on the 84D traffic case, but the evidence is narrow and lacks isolating controls.

read the letter

The paper's core contribution is a memory-guided extension of TuRBO paired with a new adaptive acquisition function. On the 84D traffic calibration task it pulls ahead of standard TuRBO, Multi-TuRBO, and genetic algorithms in final quality and convergence speed, while performing about the same as the baselines in the 14D case. The comparison to GA on real expensive simulations is the part that lands: those problems are noisy and nonconvex, and showing faster progress under a limited budget is a practical result for people who actually run traffic models or similar digital twins. The adaptive strategy appears to be the main lever that helps in the higher-dimensional setting, which is a straightforward engineering adjustment worth documenting. The memory component is presented as the distinguishing feature, but the write-up does not separate its contribution from the acquisition change. There are no ablations that keep the adaptive rule and turn off only the memory guidance, and the experiments stay within the two traffic instances rather than including standard high-dimensional synthetic functions. That leaves open the possibility that the observed edge is driven by problem structure or by the acquisition alone. The claims about usefulness for high-D problems in general therefore rest on limited ground. This work is aimed at applied researchers who calibrate simulation models in transportation or engineering and need methods that scale past a few dozen variables under tight budgets. A reader in that niche would get concrete comparison data and a usable variant to try. It is coherent enough on its own terms to deserve peer review; the main requests would be for ablations and at least one non-traffic benchmark to tighten the generalization argument.

Referee Report

2 major / 2 minor

Summary. The paper introduces Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for high-dimensional expensive black-box optimization, focusing on traffic simulation calibration. It compares MG-TuRBO (with memory guidance and a novel adaptive acquisition strategy) against genetic algorithms, standard BO, TuRBO, and Multi-TuRBO on two real-world problems with 14 and 84 decision variables, reporting that MG-TuRBO shows comparable performance in 14D and noticeable advantages in 84D, particularly with the adaptive strategy, and suggesting broader utility for high-D problems.

Significance. If the performance advantages are confirmed to stem from the memory guidance mechanism and generalize, the method could provide a practical extension of trust-region BO for high-dimensional simulation calibration tasks with limited budgets and noisy nonconvex objectives.

major comments (2)

Abstract and results sections: the claim that MG-TuRBO demonstrates 'noticeable advantages' in the 84D problem and is 'especially useful for high-D problems in general' rests on comparisons without any ablation that removes only the memory guidance component while retaining the adaptive acquisition strategy, leaving attribution of benefits unclear.
Abstract and experimental evaluation: no results are reported on standard high-dimensional synthetic benchmarks (e.g., 50D–100D Levy, Rosenbrock, or Ackley functions), so the generalization beyond the two specific traffic calibration instances cannot be assessed.

minor comments (2)

The manuscript should report the number of independent runs, error bars or standard deviations, and any statistical tests supporting the performance comparisons between methods.
Provide a precise algorithmic description (pseudocode or equations) of the memory guidance mechanism and the novel adaptive acquisition strategy in the methods section.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback on our manuscript. We have carefully considered the major comments and provide point-by-point responses below. We believe the suggested revisions will strengthen the paper and clarify the contributions of MG-TuRBO.

read point-by-point responses

Referee: Abstract and results sections: the claim that MG-TuRBO demonstrates 'noticeable advantages' in the 84D problem and is 'especially useful for high-D problems in general' rests on comparisons without any ablation that removes only the memory guidance component while retaining the adaptive acquisition strategy, leaving attribution of benefits unclear.

Authors: We agree that an explicit ablation isolating the contribution of the memory guidance mechanism, while keeping the adaptive acquisition strategy fixed, would clarify the source of the performance gains. In the revised manuscript, we will include such an ablation study on the 84D problem, comparing the full MG-TuRBO (memory guidance + adaptive acquisition) against a variant that uses only the adaptive acquisition without memory guidance. This will allow us to better attribute the benefits observed in the high-dimensional setting. revision: yes
Referee: Abstract and experimental evaluation: no results are reported on standard high-dimensional synthetic benchmarks (e.g., 50D–100D Levy, Rosenbrock, or Ackley functions), so the generalization beyond the two specific traffic calibration instances cannot be assessed.

Authors: While our work emphasizes real-world traffic simulation calibration, which presents practical challenges not fully captured by synthetic benchmarks, we acknowledge the value of including standard high-dimensional test functions to support broader claims. We will add results on 50D–100D Levy, Rosenbrock, and Ackley functions in the revised version, using comparable evaluation budgets, to better assess generalization. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical comparisons rest on external benchmarks

full rationale

The paper presents an empirical study comparing GA against several BO variants (including the proposed MG-TuRBO) on two fixed traffic-simulation calibration tasks (14D and 84D). Performance is measured by final calibration quality, convergence speed, and run-to-run consistency. No derivation chain, equations, or predictions appear; the central claim is simply that MG-TuRBO with the adaptive strategy performed better on the 84D instance than the baselines. Because the evaluation uses independent external benchmarks and reports raw observed metrics, no step reduces to a fitted quantity or self-citation by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard Bayesian optimization assumptions and the empirical performance of the proposed memory-guided extension; no explicit free parameters or invented entities are described in the abstract.

axioms (1)

domain assumption Gaussian process surrogate models and standard acquisition functions accurately capture the noisy, nonconvex relationship between calibration inputs and simulation error.
Implicit in all compared Bayesian optimization methods.

pith-pipeline@v0.9.0 · 5563 in / 1219 out tokens · 47027 ms · 2026-05-15T00:06:42.161186+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

15 extracted references · 15 canonical work pages

[1]

Development of a connected corridor real-time data-driven traffic digital twin simulation model,

A. J. Saroj, S. Roy, A. Guin, and M. Hunter, “Development of a connected corridor real-time data-driven traffic digital twin simulation model,”Journal of Transportation Engineering, Part A: Systems, vol. 147, no. 12, p. 04021096, 2021. [Online]. Available: https://ascelibrary.org/doi/abs/10.1061/JTEPBS.0000599

work page doi:10.1061/jtepbs.0000599 2021
[2]

Traffic analysis toolbox volume iv: Guidelines for applying traffic microsimulation modeling software,

FHW A, “Traffic analysis toolbox volume iv: Guidelines for applying traffic microsimulation modeling software,” Federal Highway Admin- istration (FHW A), Tech. Rep., 2022, calibration guidance and GEH statistic. Available at: https://ops.fhwa.dot.gov/trafficanalysistools/tat vol4/

work page 2022
[3]

Microscopic traffic simulation using sumo,

P. A. Lopezet al., “Microscopic traffic simulation using sumo,” in The 21st IEEE International Conference on Intelligent Transportation Systems. IEEE, 2018. [Online]. Available: https://elib.dlr.de/124092/

work page 2018
[4]

Multi-objective stochastic optimization algorithms to calibrate microsimulation models,

M. Karimi, M. Miriestahbanati, H. Esmaeeli, and C. Alecsandru, “Multi-objective stochastic optimization algorithms to calibrate microsimulation models,”Transportation Research Record, vol. 2673, no. 4, pp. 743–752, 2019. [Online]. Available: https: //doi.org/10.1177/0361198119838260

work page doi:10.1177/0361198119838260 2019
[5]

A systematic comparison for consistent scenario development using microscopic simulation software,

A. Saroj, G. Xu, Y . Shao, and C. R. Wang, “A systematic comparison for consistent scenario development using microscopic simulation software,” in2024 Winter Simulation Conference (WSC), 2024, pp. 194–205

work page 2024
[6]

Gaussian process optimization in the bandit setting: No regret and experimental design,

N. Srinivas, A. Krause, S. Kakade, and M. Seeger, “Gaussian process optimization in the bandit setting: No regret and experimental design,” inInternational Conference on Machine Learning (ICML), 2010

work page 2010
[7]

Scalable calibration of stochastic transporta- tion simulators using bayesian optimization,

C. Osorio and L. Chong, “Scalable calibration of stochastic transporta- tion simulators using bayesian optimization,”Transportation Science, vol. 51, no. 3, pp. 865–880, 2017

work page 2017
[8]

Bayesian optimization techniques for high-dimensional simulation-based transportation problems,

T. Tay and C. Osorio, “Bayesian optimization techniques for high-dimensional simulation-based transportation problems,” Transportation Research Part B: Methodological, vol. 164, pp. 210–243, 2022. [Online]. Available: https://www.sciencedirect.com/ science/article/pii/S0191261522001448

work page 2022
[9]

Scalable global optimization via local bayesian optimization,

D. Eriksson, M. Pearce, J. R. Gardner, R. D. Turner, and M. Poloczek, “Scalable global optimization via local bayesian optimization,” in Advances in Neural Information Processing Systems (NeurIPS), 2019

work page 2019
[10]

Parallelised bayesian optimisation via thompson sampling,

K. Kandasamy, A. Krishnamurthy, J. Schneider, and B. Poczos, “Parallelised bayesian optimisation via thompson sampling,” in Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics, vol. 84. PMLR, 09–11 Apr 2018, pp. 133–142. [Online]. Available: https://proceedings.mlr.press/ v84/kandasamy18a.html

work page 2018
[11]

Developing analysis, modeling, and simulation tools for connected automated vehicle applications: A case study on sr 99 in california,

Federal Highway Administration, “Developing analysis, modeling, and simulation tools for connected automated vehicle applications: A case study on sr 99 in california,” U.S. Department of Transportation, Federal Highway Administration, Tech. Rep. FHW A-HRT-21-039, 2021, accessed: 2026-03-13. [Online]. Available: https://www.fhwa. dot.gov/publications/rese...

work page 2021
[12]

Developing an automated microscopic traffic simulation scenario generation tool,

G. Xu, A. Saroj, C. R. Wang, and Y . Shao, “Developing an automated microscopic traffic simulation scenario generation tool,”Transporta- tion Research Record, vol. 2679, no. 11, pp. 650–672, 2025

work page 2025
[13]

Recent de- velopment and applications of SUMO – simulation of urban mobility,

D. Krajzewicz, J. Erdmann, M. Behrisch, and L. Bieker, “Recent de- velopment and applications of SUMO – simulation of urban mobility,” inInternational Journal on Advances in Systems and Measurements, vol. 5, no. 3&4, 2012, pp. 128–138

work page 2012
[14]

SciPy 1.0: Fundamental algorithms for scientific computing in python,

P. Virtanen, R. Gommerset al., “SciPy 1.0: Fundamental algorithms for scientific computing in python,” pp. 261–272, 2020

work page 2020
[15]

Optimization through multi-fidelity modeling,

S. Regev, R. Glasby, P. Laiu, and D. Reasor, “Optimization through multi-fidelity modeling,” Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States), Tech. Rep., 04 2025. [Online]. Available: https://www.osti.gov/biblio/2573203

work page arXiv 2025

[1] [1]

Development of a connected corridor real-time data-driven traffic digital twin simulation model,

A. J. Saroj, S. Roy, A. Guin, and M. Hunter, “Development of a connected corridor real-time data-driven traffic digital twin simulation model,”Journal of Transportation Engineering, Part A: Systems, vol. 147, no. 12, p. 04021096, 2021. [Online]. Available: https://ascelibrary.org/doi/abs/10.1061/JTEPBS.0000599

work page doi:10.1061/jtepbs.0000599 2021

[2] [2]

Traffic analysis toolbox volume iv: Guidelines for applying traffic microsimulation modeling software,

FHW A, “Traffic analysis toolbox volume iv: Guidelines for applying traffic microsimulation modeling software,” Federal Highway Admin- istration (FHW A), Tech. Rep., 2022, calibration guidance and GEH statistic. Available at: https://ops.fhwa.dot.gov/trafficanalysistools/tat vol4/

work page 2022

[3] [3]

Microscopic traffic simulation using sumo,

P. A. Lopezet al., “Microscopic traffic simulation using sumo,” in The 21st IEEE International Conference on Intelligent Transportation Systems. IEEE, 2018. [Online]. Available: https://elib.dlr.de/124092/

work page 2018

[4] [4]

Multi-objective stochastic optimization algorithms to calibrate microsimulation models,

M. Karimi, M. Miriestahbanati, H. Esmaeeli, and C. Alecsandru, “Multi-objective stochastic optimization algorithms to calibrate microsimulation models,”Transportation Research Record, vol. 2673, no. 4, pp. 743–752, 2019. [Online]. Available: https: //doi.org/10.1177/0361198119838260

work page doi:10.1177/0361198119838260 2019

[5] [5]

A systematic comparison for consistent scenario development using microscopic simulation software,

A. Saroj, G. Xu, Y . Shao, and C. R. Wang, “A systematic comparison for consistent scenario development using microscopic simulation software,” in2024 Winter Simulation Conference (WSC), 2024, pp. 194–205

work page 2024

[6] [6]

Gaussian process optimization in the bandit setting: No regret and experimental design,

N. Srinivas, A. Krause, S. Kakade, and M. Seeger, “Gaussian process optimization in the bandit setting: No regret and experimental design,” inInternational Conference on Machine Learning (ICML), 2010

work page 2010

[7] [7]

Scalable calibration of stochastic transporta- tion simulators using bayesian optimization,

C. Osorio and L. Chong, “Scalable calibration of stochastic transporta- tion simulators using bayesian optimization,”Transportation Science, vol. 51, no. 3, pp. 865–880, 2017

work page 2017

[8] [8]

Bayesian optimization techniques for high-dimensional simulation-based transportation problems,

T. Tay and C. Osorio, “Bayesian optimization techniques for high-dimensional simulation-based transportation problems,” Transportation Research Part B: Methodological, vol. 164, pp. 210–243, 2022. [Online]. Available: https://www.sciencedirect.com/ science/article/pii/S0191261522001448

work page 2022

[9] [9]

Scalable global optimization via local bayesian optimization,

D. Eriksson, M. Pearce, J. R. Gardner, R. D. Turner, and M. Poloczek, “Scalable global optimization via local bayesian optimization,” in Advances in Neural Information Processing Systems (NeurIPS), 2019

work page 2019

[10] [10]

Parallelised bayesian optimisation via thompson sampling,

K. Kandasamy, A. Krishnamurthy, J. Schneider, and B. Poczos, “Parallelised bayesian optimisation via thompson sampling,” in Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics, vol. 84. PMLR, 09–11 Apr 2018, pp. 133–142. [Online]. Available: https://proceedings.mlr.press/ v84/kandasamy18a.html

work page 2018

[11] [11]

Developing analysis, modeling, and simulation tools for connected automated vehicle applications: A case study on sr 99 in california,

Federal Highway Administration, “Developing analysis, modeling, and simulation tools for connected automated vehicle applications: A case study on sr 99 in california,” U.S. Department of Transportation, Federal Highway Administration, Tech. Rep. FHW A-HRT-21-039, 2021, accessed: 2026-03-13. [Online]. Available: https://www.fhwa. dot.gov/publications/rese...

work page 2021

[12] [12]

Developing an automated microscopic traffic simulation scenario generation tool,

G. Xu, A. Saroj, C. R. Wang, and Y . Shao, “Developing an automated microscopic traffic simulation scenario generation tool,”Transporta- tion Research Record, vol. 2679, no. 11, pp. 650–672, 2025

work page 2025

[13] [13]

Recent de- velopment and applications of SUMO – simulation of urban mobility,

D. Krajzewicz, J. Erdmann, M. Behrisch, and L. Bieker, “Recent de- velopment and applications of SUMO – simulation of urban mobility,” inInternational Journal on Advances in Systems and Measurements, vol. 5, no. 3&4, 2012, pp. 128–138

work page 2012

[14] [14]

SciPy 1.0: Fundamental algorithms for scientific computing in python,

P. Virtanen, R. Gommerset al., “SciPy 1.0: Fundamental algorithms for scientific computing in python,” pp. 261–272, 2020

work page 2020

[15] [15]

Optimization through multi-fidelity modeling,

S. Regev, R. Glasby, P. Laiu, and D. Reasor, “Optimization through multi-fidelity modeling,” Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States), Tech. Rep., 04 2025. [Online]. Available: https://www.osti.gov/biblio/2573203

work page arXiv 2025