Multi-Agent Generative Adversarial Imitation Learning

Dorsa Sadigh; Hongyu Ren; Jiaming Song; Stefano Ermon

arxiv: 1807.09936 · v1 · pith:FSD2ZS3Vnew · submitted 2018-07-26 · 💻 cs.LG · cs.AI· cs.MA· stat.ML

Multi-Agent Generative Adversarial Imitation Learning

Jiaming Song , Hongyu Ren , Dorsa Sadigh , Stefano Ermon This is my paper

classification 💻 cs.LG cs.AIcs.MAstat.ML

keywords learningmulti-agentimitationenvironmentsmultipleusedaccessactor-critic

0 comments

read the original abstract

Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal. However, most existing approaches are not applicable in multi-agent settings due to the existence of multiple (Nash) equilibria and non-stationary environments. We propose a new framework for multi-agent imitation learning for general Markov games, where we build upon a generalized notion of inverse reinforcement learning. We further introduce a practical multi-agent actor-critic algorithm with good empirical performance. Our method can be used to imitate complex behaviors in high-dimensional environments with multiple cooperative or competing agents.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

LLM-Enhanced Multi-Agent Reinforcement Learning with Expert Workflow for Real-Time P2P Energy Trading
cs.MA 2025-07 unverdicted novelty 6.0

An LLM-enhanced MARL system with differential attention critic produces lower economic costs and voltage violations than baselines in simulated real-time P2P electricity trading.