pith. sign in

arxiv: 2303.13539 · v1 · pith:75KAXS2Pnew · submitted 2023-03-16 · 💻 cs.LG · cs.GT

Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games

classification 💻 cs.LG cs.GT
keywords gameslearningmarlmulti-agentstochasticdecentralizedgeneralpolicy
0
0 comments X
read the original abstract

Stochastic games are a popular framework for studying multi-agent reinforcement learning (MARL). Recent advances in MARL have focused primarily on games with finitely many states. In this work, we study multi-agent learning in stochastic games with general state spaces and an information structure in which agents do not observe each other's actions. In this context, we propose a decentralized MARL algorithm and we prove the near-optimality of its policy updates. Furthermore, we study the global policy-updating dynamics for a general class of best-reply based algorithms and derive a closed-form characterization of convergence probabilities over the joint policy space.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.