pith. sign in

Taking total expectation and setting ak :=E[V ∞ ε (ek)] gives the scalar recursion ak+1 ≤β 2 ε ak +α 2C2 ε Wmax + (4(1 +γ)B Q)2

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Lyapunov-Certified Direct Switching Theory for Q-Learning

cs.LG · 2026-04-21 · unverdicted · novelty 7.0

Q-learning error is recast as a switched linear recursion whose exponential rate is exactly the joint spectral radius of a direct switching family, yielding finite-time bounds via a product-defined Lyapunov function.

citing papers explorer

Showing 1 of 1 citing paper.

  • Lyapunov-Certified Direct Switching Theory for Q-Learning cs.LG · 2026-04-21 · unverdicted · none · ref 40

    Q-learning error is recast as a switched linear recursion whose exponential rate is exactly the joint spectral radius of a direct switching family, yielding finite-time bounds via a product-defined Lyapunov function.