TFM-S3 uses a tabular foundation model to predict returns and guide intermittent global exploration within an SVD-derived policy subspace, yielding faster early convergence and better final performance than TD3 and population-based methods under fixed rollout budgets.
Taking the human out of the loop: A review of Bayesian optimiza- tion,
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
A feedback optimization pipeline for tri-level mobility games outperforms Bayesian optimization and genetic algorithms on Zurich multimodal data while identifying incentives that boost multimodal use.
citing papers explorer
-
Can Tabular Foundation Models Guide Exploration in Robot Policy Learning?
TFM-S3 uses a tabular foundation model to predict returns and guide intermittent global exploration within an SVD-derived policy subspace, yielding faster early convergence and better final performance than TD3 and population-based methods under fixed rollout budgets.
-
Hierarchical Strategic Decision-Making in Layered Mobility Systems
A feedback optimization pipeline for tri-level mobility games outperforms Bayesian optimization and genetic algorithms on Zurich multimodal data while identifying incentives that boost multimodal use.