pith:46ATTTLZ
Fitting Is Not Enough: Smoothness in Extremely Quantized LLMs
Extremely quantized LLMs lose generation quality from smoothness degradation in token predictions, beyond numerical accuracy loss alone.
arxiv:2605.08894 v2 · 2026-05-09 · cs.CL · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{46ATTTLZ3TGGDKJHWKXFEUBYZ6}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Extremely quantized LLMs suffer from systematic smoothness degradation beyond numerical precision loss; preserving smoothness via a simple principle in post-training quantization and quantization-aware training brings additional gains beyond numerical accuracy.
The smoothness proxy used in the paper accurately captures the degradation that matters for generation quality, and the observed reduction in effective token candidates is a direct causal consequence of that smoothness loss rather than an artifact of the proxy or the evaluation setup.
Extremely quantized LLMs exhibit systematic smoothness degradation that reduces effective token candidates and degrades generation; a smoothness-preserving principle in PTQ and QAT delivers gains beyond numerical accuracy.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:00:41.694431Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
e78139cd79dccc61a927b2ae525038cf8292447e1628b3a8c823407e2577ce60
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/46ATTTLZ3TGGDKJHWKXFEUBYZ6 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e78139cd79dccc61a927b2ae525038cf8292447e1628b3a8c823407e2577ce60
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "5d517aa534d4cec95f2c9559f81d0e05402905774f9862b7e56e0368fc6a989f",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-09T11:19:51Z",
"title_canon_sha256": "9ff425cadc9833ff754fc12594764fc53634ceadec8ea2ff6100303586bf7cfe"
},
"schema_version": "1.0",
"source": {
"id": "2605.08894",
"kind": "arxiv",
"version": 2
}
}