Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.
Multi-agent reinforcement learning: A selective overview of theories and algorithms
4 Pith papers cite this work. Polarity classification is still indexing.
years
2026 4representative citing papers
PACMAB is a perception-aware two-sided learning framework for multi-platform mobile crowdsensing that models the setting as a dynamic hypergame and achieves at least 41% more completed tasks than benchmarks in simulations without assuming complete information.
CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.
The paper provides stability criteria for multi-agent systems with heterogeneous model predictive game controllers and quantifies sensitivity of equilibria to objective misspecifications.
citing papers explorer
-
Quantum Advantage in Multi Agent Reinforcement Learning
Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.
-
Dynamic Hypergame for Task Assignment in Multi-platform Mobile Crowdsensing Under Incomplete Information
PACMAB is a perception-aware two-sided learning framework for multi-platform mobile crowdsensing that models the setting as a dynamic hypergame and achieves at least 41% more completed tasks than benchmarks in simulations without assuming complete information.
-
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI
CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.
-
Stability and Sensitivity Analysis for Objective Misspecifications Among Model Predictive Game Controllers
The paper provides stability criteria for multi-agent systems with heterogeneous model predictive game controllers and quantifies sensitivity of equilibria to objective misspecifications.