pith:U62UDQVY
Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning
Modeling abstention as an RL action lets LLMs stop unpromising reasoning when value drops below reward
arxiv:2604.18419 v4 · 2026-04-20 · cs.LG · cs.CL · stat.ML
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{U62UDQVYCT42ORT674ES3WOGT5}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We show that abstaining when the value function falls below this reward strictly outperforms natural baselines under general conditions.
That the value function can be approximated accurately enough during generation to make the threshold rule reliable, and that the regularized RL formulation faithfully captures the token-by-token decision process in real LLMs.
A regularized RL framework for mid-generation abstention in LLMs shows that stopping when the value function falls below a reward threshold strictly improves selective accuracy over baselines.
Receipt and verification
| First computed | 2026-05-26T02:05:09.541440Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
a7b541c2b814f9a7467eff092dd9c69f70c64c238c36e3ac032cbc144bccd897
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/U62UDQVYCT42ORT674ES3WOGT5 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: a7b541c2b814f9a7467eff092dd9c69f70c64c238c36e3ac032cbc144bccd897
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "4a38d24792bc36c2a3fa74d9f3ec5de4e5130d01677fc8562d7f67b7db34990d",
"cross_cats_sorted": [
"cs.CL",
"stat.ML"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-04-20T15:38:45Z",
"title_canon_sha256": "df6fafc24c5b84e0d6a5f7fd1acc530d806a9e6f39dc0dd6a16cbf20101f576c"
},
"schema_version": "1.0",
"source": {
"id": "2604.18419",
"kind": "arxiv",
"version": 4
}
}