On val ue iteration convergence in connected MDPs

· 2024 · arXiv 2406.09592

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration

math.OC · 2026-04-19 · unverdicted · novelty 7.0

Q-value iteration enters an invariant tube around Q* plus the all-ones vector in finite time, with distance decaying at rate given by the joint spectral radius of the transverse projected switching family, which can be strictly faster than the discount factor.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration math.OC · 2026-04-19 · unverdicted · none · ref 30
Q-value iteration enters an invariant tube around Q* plus the all-ones vector in finite time, with distance decaying at rate given by the joint spectral radius of the transverse projected switching family, which can be strictly faster than the discount factor.

On val ue iteration convergence in connected MDPs

fields

years

verdicts

representative citing papers

citing papers explorer