Learning to Cooperate via Policy Search

Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Leslie Pack Kaelbling · 2001 · cs.LG · arXiv cs/0105032

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Cooperative games are those in which both agents share the same payoff structure. Value-based reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents. Policy search methods are a reasonable alternative to value-based methods for partially observable environments. In this paper, we provide a gradient-based distributed policy-search method for cooperative games and compare the notion of local optimum to that of Nash equilibrium. We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.

representative citing papers

Cross-Modal Navigation with Multi-Agent Reinforcement Learning

cs.RO · 2026-05-07 · unverdicted · novelty 5.0

CRONA is a MARL framework that uses modality-specialized agents with auxiliary beliefs and a centralized multi-modal critic to achieve better performance and efficiency than single-agent baselines on visual-acoustic navigation tasks.

Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

cs.AI · 2026-01-29

citing papers explorer

Showing 2 of 2 citing papers.

Cross-Modal Navigation with Multi-Agent Reinforcement Learning cs.RO · 2026-05-07 · unverdicted · none · ref 69
CRONA is a MARL framework that uses modality-specialized agents with auxiliary beliefs and a centralized multi-modal critic to achieve better performance and efficiency than single-agent baselines on visual-acoustic navigation tasks.
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic cs.AI · 2026-01-29 · unreviewed · ref 23 · internal anchor

Learning to Cooperate via Policy Search

fields

years

verdicts

representative citing papers

citing papers explorer