arXiv preprint arXiv:2510.15414 , year =

Creative problem solving in knowledge-rich contexts · 2026 · arXiv 2510.15414

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Equilibrium Residuals Expose Three Regimes of Matrix-Game Strategic Reasoning in Language Models

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

LLMs rely on semantic cues for matrix-game equilibria but can acquire approximate computation via residual training on small instances, with a Lipschitz proof enabling transfer to larger anonymous games.

Playing with Words, Improving with Rewards: Training Language Models for Creative Association

cs.CL · 2026-05-27 · unverdicted · novelty 6.0

LLMs trained on Codenames via RLVR exhibit scale-dependent effects: the 8B model gains on creativity benchmarks while smaller models gain on reasoning benchmarks.

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

cs.AI · 2026-04-20 · unverdicted · novelty 6.0

Agent-World autonomously synthesizes verifiable real-world tasks and uses continuous self-evolution to train 8B and 14B agents that outperform proprietary models on 23 benchmarks.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Equilibrium Residuals Expose Three Regimes of Matrix-Game Strategic Reasoning in Language Models cs.LG · 2026-05-11 · unverdicted · none · ref 15
LLMs rely on semantic cues for matrix-game equilibria but can acquire approximate computation via residual training on small instances, with a Lipschitz proof enabling transfer to larger anonymous games.
Playing with Words, Improving with Rewards: Training Language Models for Creative Association cs.CL · 2026-05-27 · unverdicted · none · ref 3
LLMs trained on Codenames via RLVR exhibit scale-dependent effects: the 8B model gains on creativity benchmarks while smaller models gain on reasoning benchmarks.
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence cs.AI · 2026-04-20 · unverdicted · none · ref 124
Agent-World autonomously synthesizes verifiable real-world tasks and uses continuous self-evolution to train 8B and 14B agents that outperform proprietary models on 23 benchmarks.

arXiv preprint arXiv:2510.15414 , year =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer