Continuous-time mean-variance portfolio selection: a reinforcement learning framework

Haoran Wang, Xun Yu Zhou · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

math.OC · 2022-09-15 · unverdicted · novelty 7.0

Policy iteration converges for entropy-regularized stochastic control via novel Hölder-Sobolev estimates yielding uniform bounds on value functions.

Showing 1 of 1 citing paper.

Convergence of Policy Iteration for Entropy-Regularized Stochastic Control Problems math.OC · 2022-09-15 · unverdicted · none · ref 27
Policy iteration converges for entropy-regularized stochastic control via novel Hölder-Sobolev estimates yielding uniform bounds on value functions.