Continuous-time reinforcement learning control: A review of theoretical results, insights on performance, and needs for new designs

Brent A Wallace, Jennie Si · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

math.OC · 2022-09-15 · unverdicted · novelty 7.0

Policy iteration converges for entropy-regularized stochastic control via novel Hölder-Sobolev estimates yielding uniform bounds on value functions.

Showing 1 of 1 citing paper.

Convergence of Policy Iteration for Entropy-Regularized Stochastic Control Problems math.OC · 2022-09-15 · unverdicted · none · ref 25
Policy iteration converges for entropy-regularized stochastic control via novel Hölder-Sobolev estimates yielding uniform bounds on value functions.