An initial introduction to cooperative multi-agent reinforcement learning

Christopher Amato, “An initial introduction to cooperative multi-agent reinforcement learning”,arXiv preprint arXiv:2405 · 2024 · arXiv 2405.06161

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Distilling Collaborative Dynamics into Latent Space for Implicit Coordination in Decentralized Multi-Agent Manipulation

cs.RO · 2026-06-22 · unverdicted · novelty 6.0 · 2 refs

CLS-DP distills privileged multi-agent dynamics into a collaborative latent space that each agent infers from local RGB observations to condition diffusion-based actions, achieving 38% mean success on six RoboFactory tasks versus 20% for the best centralized baseline.

Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.

citing papers explorer

Showing 2 of 2 citing papers.

Distilling Collaborative Dynamics into Latent Space for Implicit Coordination in Decentralized Multi-Agent Manipulation cs.RO · 2026-06-22 · unverdicted · none · ref 19 · 2 links
CLS-DP distills privileged multi-agent dynamics into a collaborative latent space that each agent infers from local RGB observations to condition diffusion-based actions, achieving 38% mean success on six RoboFactory tasks versus 20% for the best centralized baseline.
Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning cs.AI · 2026-05-12 · unverdicted · none · ref 17
MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.

An initial introduction to cooperative multi-agent reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer