Multi-agent reinforcement learning: A selective overview of theories and algorithms

Kaiqing Zhang, Zhuoran Yang, Tamer Ba¸ sar · 2021 · arXiv 1911.10635

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Quantum Advantage in Multi Agent Reinforcement Learning

cs.LG · 2026-05-14 · conditional · novelty 6.0

Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.

Dynamic Hypergame for Task Assignment in Multi-platform Mobile Crowdsensing Under Incomplete Information

cs.NI · 2026-05-05 · unverdicted · novelty 6.0

PACMAB is a perception-aware two-sided learning framework for multi-platform mobile crowdsensing that models the setting as a dynamic hypergame and achieves at least 41% more completed tasks than benchmarks in simulations without assuming complete information.

Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI

cs.AI · 2026-04-19 · unverdicted · novelty 5.0

CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.

Stability and Sensitivity Analysis for Objective Misspecifications Among Model Predictive Game Controllers

eess.SY · 2026-04-09 · unverdicted · novelty 5.0

The paper provides stability criteria for multi-agent systems with heterogeneous model predictive game controllers and quantifies sensitivity of equilibria to objective misspecifications.

citing papers explorer

Showing 4 of 4 citing papers.

Quantum Advantage in Multi Agent Reinforcement Learning cs.LG · 2026-05-14 · conditional · none · ref 1
Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.
Dynamic Hypergame for Task Assignment in Multi-platform Mobile Crowdsensing Under Incomplete Information cs.NI · 2026-05-05 · unverdicted · none · ref 41
PACMAB is a perception-aware two-sided learning framework for multi-platform mobile crowdsensing that models the setting as a dynamic hypergame and achieves at least 41% more completed tasks than benchmarks in simulations without assuming complete information.
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI cs.AI · 2026-04-19 · unverdicted · none · ref 1
CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.
Stability and Sensitivity Analysis for Objective Misspecifications Among Model Predictive Game Controllers eess.SY · 2026-04-09 · unverdicted · none · ref 10
The paper provides stability criteria for multi-agent systems with heterogeneous model predictive game controllers and quantifies sensitivity of equilibria to objective misspecifications.

Multi-agent reinforcement learning: A selective overview of theories and algorithms

fields

years

verdicts

representative citing papers

citing papers explorer