pith. sign in

Constant stepsize q-learning: Distributional convergence, bias and extrapolation

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

years

2026 5 2025 3

verdicts

UNVERDICTED 8

representative citing papers

Central Limit Theorems for Asynchronous Averaged Q-Learning

cs.LG · 2025-09-23 · unverdicted · novelty 6.0

Establishes non-asymptotic and functional central limit theorems for asynchronous averaged Q-learning with explicit rates depending on iterations, state-action space, discount factor, and exploration quality.

citing papers explorer

Showing 8 of 8 citing papers.