arXiv preprint arXiv:2505.07215 , year=

Vivek Verma, David Huang, William Chen, Dan Klein, Nicholas Tomlin · 2025 · arXiv 2505.07215

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models

cs.AI · 2026-05-22 · unverdicted · novelty 7.0

GENSTRAT generates fresh imperfect-information card games and a six-axis capability profile plus jaggedness metric to evaluate LLM strategic competence with resistance to saturation.

HEALing Entropy Collapse: Enhancing Exploration in Few-Shot RLVR via Hybrid-Domain Entropy Dynamics Alignment

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

HEAL mitigates entropy collapse in few-shot RLVR by selectively adding general-domain data and aligning trajectory-level entropy dynamics, matching full-shot performance with 32 target samples.

Scalable Environments Drive Generalizable Agents

cs.AI · 2026-05-18 · unverdicted · novelty 5.0

Generalizable agents require environment scaling via diverse executable rule-sets, distinguished from trajectory and task scaling in a new taxonomy.

citing papers explorer

Showing 3 of 3 citing papers.

GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models cs.AI · 2026-05-22 · unverdicted · none · ref 24
GENSTRAT generates fresh imperfect-information card games and a six-axis capability profile plus jaggedness metric to evaluate LLM strategic competence with resistance to saturation.
HEALing Entropy Collapse: Enhancing Exploration in Few-Shot RLVR via Hybrid-Domain Entropy Dynamics Alignment cs.LG · 2026-04-20 · unverdicted · none · ref 56
HEAL mitigates entropy collapse in few-shot RLVR by selectively adding general-domain data and aligning trajectory-level entropy dynamics, matching full-shot performance with 32 target samples.
Scalable Environments Drive Generalizable Agents cs.AI · 2026-05-18 · unverdicted · none · ref 31
Generalizable agents require environment scaling via diverse executable rule-sets, distinguished from trajectory and task scaling in a new taxonomy.

arXiv preprint arXiv:2505.07215 , year=

fields

years

verdicts

representative citing papers

citing papers explorer