Solutions to the regularized exploratory equilibrium HJB equation converge in suitable norms to a strong solution of the original EHJB as the entropy parameter vanishes, yielding existence of equilibria without conventional stringent regularity assumptions.
q-Learning in continuous time.Journal of Machine Learning Research, 24(161):1–61, 2023
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
math.OC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Equilibrium under Time-Inconsistency: A New Existence Theory by Vanishing Entropy Regularization
Solutions to the regularized exploratory equilibrium HJB equation converge in suitable norms to a strong solution of the original EHJB as the entropy parameter vanishes, yielding existence of equilibria without conventional stringent regularity assumptions.