pith. machine review for the scientific record. sign in

arxiv: 1907.09467 · v1 · submitted 2019-07-20 · 💻 cs.LG · cs.AI· cs.MA

Recognition: unknown

Arena: a toolkit for Multi-Agent Reinforcement Learning

Authors on Pith no claims yet
classification 💻 cs.LG cs.AIcs.MA
keywords marlinterfacesarenaagentcompatiblecooperative-competitivelearningmulti-agent
0
0 comments X
read the original abstract

We introduce Arena, a toolkit for multi-agent reinforcement learning (MARL) research. In MARL, it usually requires customizing observations, rewards and actions for each agent, changing cooperative-competitive agent-interaction, and playing with/against a third-party agent, etc. We provide a novel modular design, called Interface, for manipulating such routines in essentially two ways: 1) Different interfaces can be concatenated and combined, which extends the OpenAI Gym Wrappers concept to MARL scenarios. 2) During MARL training or testing, interfaces can be embedded in either wrapped OpenAI Gym compatible Environments or raw environment compatible Agents. We offer off-the-shelf interfaces for several popular MARL platforms, including StarCraft II, Pommerman, ViZDoom, Soccer, etc. The interfaces effectively support self-play RL and cooperative-competitive hybrid MARL. Also, Arena can be conveniently extended to your own favorite MARL platform.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.