Establishes existence and uniqueness for optimal policies in continuous-time entropy-regularized mean-field control with common noise via an integrated q-function, plus explicit Gaussian characterization in the LQ setting.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
fields
math.OC 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
The authors propose actor-critic q-learning algorithms for mean-field control with common noise based on martingale orthogonality conditions and relaxed controls, establish convergence of inner iterations in the linear-quadratic case, and demonstrate performance on examples.
citing papers explorer
-
Continuous-time q-learning for mean-field control with common noise, part-I: Theoretical foundations
Establishes existence and uniqueness for optimal policies in continuous-time entropy-regularized mean-field control with common noise via an integrated q-function, plus explicit Gaussian characterization in the LQ setting.
-
Continuous-time q-learning for mean-field control with common noise, part-II: q-learning algorithms
The authors propose actor-critic q-learning algorithms for mean-field control with common noise based on martingale orthogonality conditions and relaxed controls, establish convergence of inner iterations in the linear-quadratic case, and demonstrate performance on examples.