Limited reasoning space: The cage of long-horizon reasoning in llms, 2026

Zhenyu Li, Guanlin Wu, Cheems Wang, Yongqiang Zhao · 2026 · arXiv 2602.19281

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

State commitment learning: training language models to distinguish computation from memory

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

Introduces state commitment learning and Counterfactual Erasure RL (CERL) to train models to commit only persistent state, reducing answer dependence on hidden thoughts across math, logic, QA, and tool-use tasks without accuracy loss.

citing papers explorer

Showing 1 of 1 citing paper.

State commitment learning: training language models to distinguish computation from memory cs.LG · 2026-05-22 · unverdicted · none · ref 8
Introduces state commitment learning and Counterfactual Erasure RL (CERL) to train models to commit only persistent state, reducing answer dependence on hidden thoughts across math, logic, QA, and tool-use tasks without accuracy loss.

Limited reasoning space: The cage of long-horizon reasoning in llms, 2026

fields

years

verdicts

representative citing papers

citing papers explorer