pith:UDLFAG2B
Why Do Reasoning Models Lose Coverage? The Role of Data and Forks in the Road
Fine-tuning data with ambiguous decision points causes reasoning models to lose coverage.
arxiv:2605.17026 v1 · 2026-05-16 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{UDLFAG2B7A46W3H7LB2RUCD3D4}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We hypothesize that this behavior is driven by properties of the fine-tuning data, specifically related to decision points or 'forks in the road' scenarios where model faces indecipherable patterns with multiple valid reasoning paths.
The controlled case studies using graph branching and reasoning modes accurately capture the decision-point dynamics present in real fine-tuning datasets for reasoning models.
Coverage shrinkage after SFT in reasoning models correlates with prevalence of decision-point scenarios in data and can be partially mitigated by targeted data synthesis and diversity-aware decoding.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:03:36.598466Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
a0d6501b41f839eb6cff58751a087b1f00e1514d87ce92f330b0564b43cfde37
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/UDLFAG2B7A46W3H7LB2RUCD3D4 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: a0d6501b41f839eb6cff58751a087b1f00e1514d87ce92f330b0564b43cfde37
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d14dbf0a3b6d1cbe75b6f97fc24d666061829bdaafd3036adf4687a76ca40ddb",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-16T14:55:12Z",
"title_canon_sha256": "d7a0d58a5af3dcdca97a65fc5a3285d611baf18f0803c7355c6a7bf183dc665c"
},
"schema_version": "1.0",
"source": {
"id": "2605.17026",
"kind": "arxiv",
"version": 1
}
}