Uncertainty quantification for Markov chains with application to temporal difference learning

Weichen Wu, Yuting Wei, Alessandro Rinaldo · 2025 · arXiv 2502.13822

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

representative citing papers

Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains

math.PR · 2026-01-13 · unverdicted · novelty 8.0

The paper proves the first optimal O(n^{-1/2}) Wasserstein-1 CLT rates for locally dependent sequences and geometrically ergodic Markov chains, plus new W_p rates for p greater than or equal to 2 under mild moments, with an application to U-statistics.

Gaussian Approximation for Asynchronous Q-learning

stat.ML · 2026-04-08 · unverdicted · novelty 7.0

Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.

On Gaussian approximation for entropy-regularized Q-learning with function approximation

stat.ML · 2026-05-17 · unverdicted · novelty 5.0

Establishes n^{-1/4} Gaussian approximation in convex distance for averaged entropy-regularized Q-learning with linear function approximation and polynomial stepsizes.

citing papers explorer

Showing 3 of 3 citing papers.

Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains math.PR · 2026-01-13 · unverdicted · none · ref 72 · internal anchor
The paper proves the first optimal O(n^{-1/2}) Wasserstein-1 CLT rates for locally dependent sequences and geometrically ergodic Markov chains, plus new W_p rates for p greater than or equal to 2 under mild moments, with an application to U-statistics.
Gaussian Approximation for Asynchronous Q-learning stat.ML · 2026-04-08 · unverdicted · none · ref 55 · internal anchor
Derived rates of order up to n^{-1/6} log^4(n S A) for the high-dimensional CLT of averaged asynchronous Q-learning iterates, plus a general martingale-difference CLT.
On Gaussian approximation for entropy-regularized Q-learning with function approximation stat.ML · 2026-05-17 · unverdicted · none · ref 37 · internal anchor
Establishes n^{-1/4} Gaussian approximation in convex distance for averaged entropy-regularized Q-learning with linear function approximation and polynomial stepsizes.

Uncertainty quantification for Markov chains with application to temporal difference learning

fields

years

verdicts

representative citing papers

citing papers explorer