Thus, after ﬁnite-time identiﬁcation of X ∗ , the transverse component admits exponential upper bounds at any rate larger than the JSR ¯ρ∗ of the restricted optimal family

F or any ε∗ > 0 such that β∗ := ¯ρ∗ + ε∗ < 1, there exists a constant ˜Cε∗ > 0 such that ∥zKid+ℓ∥2 ≤ ˜Cε∗ β ℓ ∗ ∥zKid∥2, ∀ℓ ≥ 0

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration

math.OC · 2026-04-19 · unverdicted · novelty 7.0

Q-value iteration enters an invariant tube around Q* plus the all-ones vector in finite time, with distance decaying at rate given by the joint spectral radius of the transverse projected switching family, which can be strictly faster than the discount factor.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration math.OC · 2026-04-19 · unverdicted · none · ref 66
Q-value iteration enters an invariant tube around Q* plus the all-ones vector in finite time, with distance decaying at rate given by the joint spectral radius of the transverse projected switching family, which can be strictly faster than the discount factor.

Thus, after ﬁnite-time identiﬁcation of X ∗ , the transverse component admits exponential upper bounds at any rate larger than the JSR ¯ρ∗ of the restricted optimal family

fields

years

verdicts

representative citing papers

citing papers explorer