Step 1: findκ.Due to the Markovian property, the matrixV k is a function ofs k−1 for everyk∈[n]

Combine the results above to achieve the desired Berry-Esseen bound

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Uncertainty quantification for Markov chain induced martingales with application to temporal difference learning

stat.ML · 2025-02-19 · unverdicted · novelty 7.0

Derives novel high-dimensional concentration inequalities for vector-valued Markov chain martingales and applies them to TD learning for consistency guarantees matching asymptotic variance up to logs and O(T^{-1/4} log T) Gaussian approximation rate.

citing papers explorer

Showing 1 of 1 citing paper.

Uncertainty quantification for Markov chain induced martingales with application to temporal difference learning stat.ML · 2025-02-19 · unverdicted · none · ref 5
Derives novel high-dimensional concentration inequalities for vector-valued Markov chain martingales and applies them to TD learning for consistency guarantees matching asymptotic variance up to logs and O(T^{-1/4} log T) Gaussian approximation rate.

Step 1: findκ.Due to the Markovian property, the matrixV k is a function ofs k−1 for everyk∈[n]

fields

years

verdicts

representative citing papers

citing papers explorer