Title resolution pending

· 2024 · arXiv 2410.07778

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Continuous-time q-learning for mean-field control with common noise, part-I: Theoretical foundations

math.OC · 2026-04-30 · unverdicted · novelty 7.0

Establishes existence and uniqueness for optimal policies in continuous-time entropy-regularized mean-field control with common noise via an integrated q-function, plus explicit Gaussian characterization in the LQ setting.

Discretization error from regularized Reinforcement Learning to continuous-time stochastic control

math.OC · 2026-04-23 · unverdicted · novelty 5.0

Derives quantitative convergence rates for the gap between optimal policies from regularized discrete-time Bellman equations and true optimal controls in underlying continuous-time stochastic problems.

citing papers explorer

Showing 2 of 2 citing papers.

Continuous-time q-learning for mean-field control with common noise, part-I: Theoretical foundations math.OC · 2026-04-30 · unverdicted · none · ref 1
Establishes existence and uniqueness for optimal policies in continuous-time entropy-regularized mean-field control with common noise via an integrated q-function, plus explicit Gaussian characterization in the LQ setting.
Discretization error from regularized Reinforcement Learning to continuous-time stochastic control math.OC · 2026-04-23 · unverdicted · none · ref 4
Derives quantitative convergence rates for the gap between optimal policies from regularized discrete-time Bellman equations and true optimal controls in underlying continuous-time stochastic problems.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer