pith:3RES4GZV
Reasoning Can Be Restored by Correcting a Few Decision Tokens
Base LLMs lose most reasoning ability at a few early planning tokens that can be fixed by brief stronger-model intervention.
arxiv:2605.16874 v1 · 2026-05-16 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{3RES4GZVLQLXB7VYV6PPRKXDTJ}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Across benchmarks, the reasoning advantage is highly sparse and concentrates on a small set of early, planning-related decision tokens. For instance, on Qwen3-0.6B, only ~8% of generated tokens account for the salient disagreement, and these tokens concentrate early in the response, are strongly enriched in planning-related decisions (17x), and coincide with high base-model uncertainty.
That the positions of high likelihood-based distributional disagreement between base and reasoning models are precisely the causal points where the base model’s early planning errors derail the subsequent reasoning trajectory, rather than merely correlated symptoms.
Reasoning gaps between base LLMs and LRMs concentrate on ~8% of early planning tokens; intervening with the reasoning model only at high-disagreement positions recovers performance.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:03:27.622281Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
dc492e1b355c1770feb8af9ef8aae39a46d3136310fa37c18d8d4f6af2791272
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/3RES4GZVLQLXB7VYV6PPRKXDTJ \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: dc492e1b355c1770feb8af9ef8aae39a46d3136310fa37c18d8d4f6af2791272
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "3eb0e59a986b0dd21b782a2e73c6c5252b1f6a3e0f75243f097c982a4f0a5045",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2026-05-16T08:33:31Z",
"title_canon_sha256": "c0004652b2b530242bee41cf3e561ad9c0833ae09b31c751047edb1048d36ad9"
},
"schema_version": "1.0",
"source": {
"id": "2605.16874",
"kind": "arxiv",
"version": 1
}
}