Pareto actor-critic for equilibrium selection in multi-agent reinforcement learning

Christianos, F · 2023 · arXiv 2209.14344

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies

cs.LG · 2025-08-01 · unverdicted · novelty 5.0

CoSER adaptively samples joint actions in CTDE MARL to reduce sampling error relative to the joint on-policy distribution, empirically improving reliability of independent policy gradient convergence.

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

cs.AI · 2026-06-03 · unverdicted · novelty 4.0

Localized affinity regularization improves multi-agent performance on both competitive and cooperative objectives in a Fog of Love environment compared to standard MADDPG.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies cs.LG · 2025-08-01 · unverdicted · none · ref 4
CoSER adaptively samples joint actions in CTDE MARL to reduce sampling error relative to the joint on-policy distribution, empirically improving reliability of independent policy gradient convergence.
Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment cs.AI · 2026-06-03 · unverdicted · none · ref 50
Localized affinity regularization improves multi-agent performance on both competitive and cooperative objectives in a Fog of Love environment compared to standard MADDPG.

Pareto actor-critic for equilibrium selection in multi-agent reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer