When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently ?

Ziang Song, Song Mei, Yu Bai · 2021 · arXiv 2110.04184

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation

cs.LG · 2026-05-04 · unverdicted · novelty 8.0

The work gives the first algorithms for general robust Markov games with linear function approximation whose sample complexity breaks the curse of multiagency for large state spaces in both generative and online settings.

Sample-efficient inductive matrix completion with noise and inexact side-information

stat.ML · 2026-05-16 · unverdicted · novelty 7.0

A projected gradient descent algorithm for noisy inductive matrix completion achieves linear convergence and stable recovery at sample complexity governed by side-information dimension, extending to inexact side-information with optimal error degradation.

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

cs.LG · 2026-04-06 · unverdicted · novelty 7.0

Provides the first finite-time convergence guarantees for Q-value iteration in general-sum Stackelberg Markov games.

Corruption-robust Offline Multi-agent Reinforcement Learning From Human Feedback

cs.LG · 2026-03-30 · unverdicted · novelty 7.0

Introduces robust estimators for linear Markov games in offline MARLHF that achieve O(ε^{1-o(1)}) or O(√ε) bounds on Nash or CCE gaps under uniform or unilateral coverage.

Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments

cs.GT · 2023-03-09 · unverdicted · novelty 7.0

Introduces HS-S (aggregating dynamic threat powers) and Coco-S (fixed points of statewise HS Bellman operator) for stochastic games, proves they coincide for two players but disagree for three, shows uniqueness via extended axioms and topological degree theory, and gives sampling estimators.

citing papers explorer

Showing 5 of 5 citing papers.

Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation cs.LG · 2026-05-04 · unverdicted · none · ref 19
The work gives the first algorithms for general robust Markov games with linear function approximation whose sample complexity breaks the curse of multiagency for large state spaces in both generative and online settings.
Sample-efficient inductive matrix completion with noise and inexact side-information stat.ML · 2026-05-16 · unverdicted · none · ref 157
A projected gradient descent algorithm for noisy inductive matrix completion achieves linear convergence and stable recovery at sample complexity governed by side-information dimension, extending to inexact side-information with optimal error degradation.
Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games cs.LG · 2026-04-06 · unverdicted · none · ref 12
Provides the first finite-time convergence guarantees for Q-value iteration in general-sum Stackelberg Markov games.
Corruption-robust Offline Multi-agent Reinforcement Learning From Human Feedback cs.LG · 2026-03-30 · unverdicted · none · ref 12
Introduces robust estimators for linear Markov games in offline MARLHF that achieve O(ε^{1-o(1)}) or O(√ε) bounds on Nash or CCE gaps under uniform or unilateral coverage.
Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments cs.GT · 2023-03-09 · unverdicted · none · ref 22
Introduces HS-S (aggregating dynamic threat powers) and Coco-S (fixed points of statewise HS Bellman operator) for stochastic games, proves they coincide for two players but disagree for three, shows uniqueness via extended axioms and topological degree theory, and gives sampling estimators.

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently ?

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer