Finite- sample analysis of contractive stochastic approximation using smooth convex envelopes.Ad- vances in Neural Information Processing Systems, 33:8223–8234

Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam · 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

Develops quotient-categorical representations that render the average-reward distributional Bellman operator well-defined, non-expansive, and convergent under i.i.d. and Markovian sampling.

A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

Finite-iteration guarantees are established for asynchronous scalar categorical TD in Cramér geometry and multivariate signed-categorical TD in MMD geometry under i.i.d., Markovian, and episodic sampling.

citing papers explorer

Showing 2 of 2 citing papers.

Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning cs.LG · 2026-05-11 · unverdicted · none · ref 13
Develops quotient-categorical representations that render the average-reward distributional Bellman operator well-defined, non-expansive, and convergent under i.i.d. and Markovian sampling.
A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning cs.LG · 2026-05-07 · unverdicted · none · ref 11
Finite-iteration guarantees are established for asynchronous scalar categorical TD in Cramér geometry and multivariate signed-categorical TD in MMD geometry under i.i.d., Markovian, and episodic sampling.

Finite- sample analysis of contractive stochastic approximation using smooth convex envelopes.Ad- vances in Neural Information Processing Systems, 33:8223–8234

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer