The Complexity of Decentralized Control of Markov Decision Processes

· 2013 · cs.AI · arXiv 1301.3836

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Planning for distributed agents with partial state information is considered from a decision- theoretic perspective. We describe generalizations of both the MDP and POMDP models that allow for decentralized control. For even a small number of agents, the finite-horizon problems corresponding to both of our models are complete for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov processes. In contrast to the MDP and POMDP problems, the problems we consider provably do not admit polynomial-time algorithms and most likely require doubly exponential time to solve in the worst case. We have thus provided mathematical evidence corresponding to the intuition that decentralized planning problems cannot easily be reduced to centralized problems and solved exactly using established techniques.

representative citing papers

Quantum Advantage in Multi Agent Reinforcement Learning

cs.LG · 2026-05-14 · conditional · novelty 6.0

Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.

citing papers explorer

Showing 1 of 1 citing paper.

Quantum Advantage in Multi Agent Reinforcement Learning cs.LG · 2026-05-14 · conditional · none · ref 9 · internal anchor
Entangled QMARL agents approach the Tsirelson bound of 0.854 in CHSH while unentangled versions match classical baselines, and hybrid quantum-classical setups outperform both in CoopNav.

The Complexity of Decentralized Control of Markov Decision Processes

fields

years

verdicts

representative citing papers

citing papers explorer