PC3D trains decentralized policies to recover and use personalized coordination context from local histories, enabling higher returns than baselines on variable-roster cooperative MARL tasks with both seen and unseen team sizes.
Multi-agent deep reinforcement learning: a survey.Artificial Intelligence Review, 55(2):895–943
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.LG 2years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Smart Commander applies hierarchical reinforcement learning to optimize sequential maintenance, sortie generation, and resource allocation decisions across a military aircraft fleet, outperforming flat DRL and rule-based methods in a custom simulation.
citing papers explorer
-
PC3D: Zero-Shot Cooperation Across Variable Rosters via Personalized Context Distillation
PC3D trains decentralized policies to recover and use personalized coordination context from local histories, enabling higher returns than baselines on variable-roster cooperative MARL tasks with both seen and unseen team sizes.
-
Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization
Smart Commander applies hierarchical reinforcement learning to optimize sequential maintenance, sortie generation, and resource allocation decisions across a military aircraft fleet, outperforming flat DRL and rule-based methods in a custom simulation.