High-order generator regression from multi-step trajectories yields a second-order accurate estimator for finite-horizon continuous-time policy evaluation that outperforms the Bellman baseline in calibration studies and benchmarks.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
fields
stat.ML 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Geometric tempering yields exponential convergence bounds for both Wasserstein and Fisher-Rao flows but produces no speedup in the Fisher-Rao metric, with new adaptive schedules derived from the tempered dynamics.
citing papers explorer
-
Beyond Bellman: High-Order Generator Regression for Continuous-Time Policy Evaluation
High-order generator regression from multi-step trajectories yields a second-order accurate estimator for finite-horizon continuous-time policy evaluation that outperforms the Bellman baseline in calibration studies and benchmarks.
-
Properties and limitations of geometric tempering for gradient flow dynamics
Geometric tempering yields exponential convergence bounds for both Wasserstein and Fisher-Rao flows but produces no speedup in the Fisher-Rao metric, with new adaptive schedules derived from the tempered dynamics.