Finite- time analysis of decentralized temporal-difference learning with linear function approximation

· 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Distributed TD Tracking with Linear Function Approximation over Directed Communication Networks

math.OC · 2026-05-06 · unverdicted · novelty 6.0

PP-DTD achieves linear convergence to a neighborhood of the optimum under constant step-sizes and O(T^{-1}) under decaying step-sizes for distributed TD policy evaluation in MARL over directed graphs, claimed as the first with rates comparable to single-agent TD.

citing papers explorer

Showing 1 of 1 citing paper.

Distributed TD Tracking with Linear Function Approximation over Directed Communication Networks math.OC · 2026-05-06 · unverdicted · none · ref 12
PP-DTD achieves linear convergence to a neighborhood of the optimum under constant step-sizes and O(T^{-1}) under decaying step-sizes for distributed TD policy evaluation in MARL over directed graphs, claimed as the first with rates comparable to single-agent TD.

Finite- time analysis of decentralized temporal-difference learning with linear function approximation

fields

years

verdicts

representative citing papers

citing papers explorer