arXiv preprint arXiv:2402.01704 , year=

Ian Gemp, Roma Patel, Yoram Bachrach, Marc Lanctot, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls · 2024 · arXiv 2402.01704

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

Communicate-Predict-Act: Evaluating Social Intelligence of Agents

cs.CY · 2026-04-09 · unverdicted · novelty 7.0

A new evaluation framework for LLM social intelligence finds that influence, transparency, and adaptability predict agent success in games better than theory of mind or deep planning, with metrics achieving AUC 0.82 in predicting pairwise outcomes.

Common-agency Games for Multi-Objective Test-Time Alignment

cs.GT · 2026-05-08 · unverdicted · novelty 6.0

CAGE uses common-agency games and an EPEC algorithm to compute equilibrium policies that balance multiple conflicting objectives for test-time LLM alignment.

Distilling Game Code World Model Generation into Lightweight Large Language Models

cs.AI · 2026-05-23 · unverdicted · novelty 4.0

SFT followed by RLVR on Qwen2.5-3B-Instruct raises syntactic and execution correctness when generating Game Code World Models across 30 games.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Communicate-Predict-Act: Evaluating Social Intelligence of Agents cs.CY · 2026-04-09 · unverdicted · none · ref 11
A new evaluation framework for LLM social intelligence finds that influence, transparency, and adaptability predict agent success in games better than theory of mind or deep planning, with metrics achieving AUC 0.82 in predicting pairwise outcomes.
Common-agency Games for Multi-Objective Test-Time Alignment cs.GT · 2026-05-08 · unverdicted · none · ref 55
CAGE uses common-agency games and an EPEC algorithm to compute equilibrium policies that balance multiple conflicting objectives for test-time LLM alignment.
Distilling Game Code World Model Generation into Lightweight Large Language Models cs.AI · 2026-05-23 · unverdicted · none · ref 10
SFT followed by RLVR on Qwen2.5-3B-Instruct raises syntactic and execution correctness when generating Game Code World Models across 30 games.

arXiv preprint arXiv:2402.01704 , year=

fields

years

verdicts

representative citing papers

citing papers explorer