A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using L - Smoothness

Yao, Hengshuai · 2023

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Offline Two-Player Zero-Sum Markov Games with KL Regularization

cs.LG · 2026-05-13 · unverdicted · novelty 8.0

KL regularization enables Õ(1/n) convergence for offline Nash equilibria in zero-sum Markov games under unilateral concentrability via the ROSE framework and SOS-MD algorithm.

Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

Establishes almost sure convergence rates arbitrarily close to o(n^{1-2η}) for power-law rates η in (1/2,1) and o(n^{-1}) for harmonic rates in contractive stochastic approximation with Markovian noise.

Fast Rates in $\alpha$-Potential Games via Regularized Mirror Descent

cs.GT · 2026-04-30 · unverdicted · novelty 7.0 · 2 refs

Proposes OPMD algorithm achieving accelerated O(1/n) rates for offline Nash equilibrium learning in alpha-potential games via reference-anchored data coverage.

Pessimism-Free Offline Learning in General-Sum Games via KL Regularization

cs.LG · 2026-04-30 · unverdicted · novelty 7.0 · 2 refs

KL regularization enables pessimism-free offline learning in general-sum games, recovering regularized Nash equilibria at accelerated rate O(1/n) via GANE and converging to coarse correlated equilibria at standard rate O(1/sqrt(n)+1/T) via GAMD.

citing papers explorer

Showing 4 of 4 citing papers.

Offline Two-Player Zero-Sum Markov Games with KL Regularization cs.LG · 2026-05-13 · unverdicted · none · ref 103
KL regularization enables Õ(1/n) convergence for offline Nash equilibria in zero-sum Markov games under unilateral concentrability via the ROSE framework and SOS-MD algorithm.
Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift cs.LG · 2026-05-08 · unverdicted · none · ref 66
Establishes almost sure convergence rates arbitrarily close to o(n^{1-2η}) for power-law rates η in (1/2,1) and o(n^{-1}) for harmonic rates in contractive stochastic approximation with Markovian noise.
Fast Rates in $\alpha$-Potential Games via Regularized Mirror Descent cs.GT · 2026-04-30 · unverdicted · none · ref 139 · 2 links
Proposes OPMD algorithm achieving accelerated O(1/n) rates for offline Nash equilibrium learning in alpha-potential games via reference-anchored data coverage.
Pessimism-Free Offline Learning in General-Sum Games via KL Regularization cs.LG · 2026-04-30 · unverdicted · none · ref 130 · 2 links
KL regularization enables pessimism-free offline learning in general-sum games, recovering regularized Nash equilibria at accelerated rate O(1/n) via GANE and converging to coarse correlated equilibria at standard rate O(1/sqrt(n)+1/T) via GAMD.

A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using L - Smoothness

fields

years

verdicts

representative citing papers

citing papers explorer