pith. sign in

arxiv: 1712.07305 · v1 · pith:GMDTLLTYnew · submitted 2017-12-20 · 💻 cs.AI

Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning

classification 💻 cs.AI
keywords perspectivesagentsarchitecturedeepidealearningmanymaster-slave
0
0 comments X
read the original abstract

Many tasks in artificial intelligence require the collaboration of multiple agents. We exam deep reinforcement learning for multi-agent domains. Recent research efforts often take the form of two seemingly conflicting perspectives, the decentralized perspective, where each agent is supposed to have its own controller; and the centralized perspective, where one assumes there is a larger model controlling all agents. In this regard, we revisit the idea of the master-slave architecture by incorporating both perspectives within one framework. Such a hierarchical structure naturally leverages advantages from one another. The idea of combining both perspectives is intuitive and can be well motivated from many real world systems, however, out of a variety of possible realizations, we highlights three key ingredients, i.e. composed action representation, learnable communication and independent reasoning. With network designs to facilitate these explicitly, our proposal consistently outperforms latest competing methods both in synthetic experiments and when applied to challenging StarCraft micromanagement tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Obstacle-aware navigation of smart microswimmers in a turbulent flow

    physics.flu-dyn 2026-03 unverdicted novelty 5.0

    Obstacle-aware adversarial Q-learning lets smart microswimmers navigate turbulent flows with obstacles better than naive swimmers or surfers.