Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

Han Liu; Kaiqing Zhang; Tamer Ba\c{s}ar; Tong Zhang; Zhuoran Yang

arxiv: 1802.08757 · v2 · pith:QPRW2DV4new · submitted 2018-02-23 · 💻 cs.LG · cs.AI· cs.MA· math.OC· stat.ML

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

Kaiqing Zhang , Zhuoran Yang , Han Liu , Tong Zhang , Tamer Ba\c{s}ar This is my paper

classification 💻 cs.LG cs.AIcs.MAmath.OCstat.ML

keywords agentsalgorithmsdecentralizedfullynetworkagentfunctionfunctions

0 comments

read the original abstract

We consider the problem of \emph{fully decentralized} multi-agent reinforcement learning (MARL), where the agents are located at the nodes of a time-varying communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. Within this setting, the collective goal of the agents is to maximize the globally averaged return over the network through exchanging information with their neighbors. To this end, we propose two decentralized actor-critic algorithms with function approximation, which are applicable to large-scale MARL problems where both the number of states and the number of agents are massively large. Under the decentralized structure, the actor step is performed individually by each agent with no need to infer the policies of others. For the critic step, we propose a consensus update via communication over the network. Our algorithms are fully incremental and can be implemented in an online fashion. Convergence analyses of the algorithms are provided when the value functions are approximated within the class of linear functions. Extensive simulation results with both linear and nonlinear function approximations are presented to validate the proposed algorithms. Our work appears to be the first study of fully decentralized MARL algorithms for networked agents with function approximation, with provable convergence guarantees.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MOSAIC at ELT: Design and First Performance Results of Novel Robotic Optical-Relay Positioners
astro-ph.IM 2026-06 unverdicted novelty 4.0

Design and first performance results of novel robotic optical-relay positioners for the MOSAIC instrument on the ELT.
Thermal Characterization of a 6-Positioner, 6.2-mm-Pitch Module for Stage-5 Telescopes
astro-ph.IM 2026-06 unverdicted novelty 2.0

Thermal qualification tests on 6.2-mm-pitch fiber positioners confirm stable repeatability, backlash, and linearity across -20°C to 30°C with no degradation.