Is q-learning minimax optimal? a tight sample complexity analysis.Operations Research, 72(1):222–236, 2024

Gen Li, Weichen Wu, Yuejie Chi, Cong Ma, Alessandro Rinaldo, Yuting Wei · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Gaussian Approximation for Asynchronous Q-learning

stat.ML · 2026-04-08 · unverdicted · novelty 7.0

Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.

citing papers explorer

Showing 1 of 1 citing paper.

Gaussian Approximation for Asynchronous Q-learning stat.ML · 2026-04-08 · unverdicted · none · ref 31
Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.

Is q-learning minimax optimal? a tight sample complexity analysis.Operations Research, 72(1):222–236, 2024

fields

years

verdicts

representative citing papers

citing papers explorer