B.5 Proof of Corollary 3.2 Part of the proof is inspired by the proof of Lemma B.8 in Cattaneo et al

Hence, the triangle inequality directly yields Wn ≤Tr(Σ n) + r q 1−λ F√n log 1 2 1 δ dν dµ µ,p ! (76) The theorem follows by combining (75), (76) using a union bound argument, taking σ2 =Tr(Σ n) + r q 1−λ F√n log 1 2 1 δ dν dµ µ,p ! · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Uncertainty quantification for Markov chain induced martingales with application to temporal difference learning

stat.ML · 2025-02-19 · unverdicted · novelty 7.0

Derives novel high-dimensional concentration inequalities for vector-valued Markov chain martingales and applies them to TD learning for consistency guarantees matching asymptotic variance up to logs and O(T^{-1/4} log T) Gaussian approximation rate.

citing papers explorer

Showing 1 of 1 citing paper.

Uncertainty quantification for Markov chain induced martingales with application to temporal difference learning stat.ML · 2025-02-19 · unverdicted · none · ref 1
Derives novel high-dimensional concentration inequalities for vector-valued Markov chain martingales and applies them to TD learning for consistency guarantees matching asymptotic variance up to logs and O(T^{-1/4} log T) Gaussian approximation rate.

B.5 Proof of Corollary 3.2 Part of the proof is inspired by the proof of Lemma B.8 in Cattaneo et al

fields

years

verdicts

representative citing papers

citing papers explorer