pith:ZW67IC5T
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
Diffusion LLMs can reach up to 27 times higher throughput by adding a reusable block-wise KV cache and decoding only high-confidence tokens in parallel.
arxiv:2505.22618 v3 · 2025-05-28 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ZW67IC5TVTEVM75APPZUYYMUZL}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Experimental results on LLaDA and Dream models across multiple LLM benchmarks demonstrate up to 27.6× throughput improvement with minimal accuracy loss, closing the performance gap with autoregressive models.
That the block-wise approximate KV cache introduces only negligible performance drop and that a single confidence threshold can be chosen to preserve generation quality across benchmarks without post-hoc per-task retuning.
Fast-dLLM adds reusable KV cache blocks and selective parallel decoding to diffusion LLMs, closing most of the speed gap with autoregressive models without retraining.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:49.113794Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
cdbdf40bb3acc9567fa07bf34c6194cac65e7a7edf1940d6d865ac759c9a4607
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ZW67IC5TVTEVM75APPZUYYMUZL \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: cdbdf40bb3acc9567fa07bf34c6194cac65e7a7edf1940d6d865ac759c9a4607
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "e44c8d1bf337f28a9eef6d07f8920fbb29ab4a8cc9ed4ba42ac5d50a4bbe688a",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by-nc-nd/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2025-05-28T17:39:15Z",
"title_canon_sha256": "1d06872d1f805a26fa9fccdb0e1669fc666f60fd1dd3785d349b212af66fdf32"
},
"schema_version": "1.0",
"source": {
"id": "2505.22618",
"kind": "arxiv",
"version": 3
}
}