pith:G7JVWCF7
VeriCache: Turning Lossy KV Cache into Lossless LLM Inference
VeriCache achieves identical outputs to full-KV-cache decoding at up to 4 times higher throughput by drafting with compressed caches and verifying in parallel.
arxiv:2605.17613 v1 · 2026-05-17 · cs.AR · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{G7JVWCF7HAP27HN3I25TCYOTUM}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
VeriCache achieves up to 4X higher throughput than full-KV inference while producing identical outputs.
Compressed-KV decoding can be parallelized with full-KV swap because one is HBM-bandwidth-bound and the other is PCIe/network-bound, and the compressed KV cache often produces output similar to the full KV cache allowing a long drafting horizon to amortize each swap.
VeriCache turns lossy KV cache compression into lossless LLM inference by drafting with compressed cache and verifying drafts with full cache, achieving up to 4x throughput with identical outputs.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:04:48.583788Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
37d35b08bf381faf9dbb46bb3161d3a309ffa9da2bfcdc609f0ac473be4a05f4
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/G7JVWCF7HAP27HN3I25TCYOTUM \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 37d35b08bf381faf9dbb46bb3161d3a309ffa9da2bfcdc609f0ac473be4a05f4
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "e959bc3e2213d0ef4b3f5827bce07ced4011e605cc285e908904109d420dc309",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AR",
"submitted_at": "2026-05-17T19:18:39Z",
"title_canon_sha256": "f17e8e43a9a9b82cf8e4f1ecc1104f52d5fe1fbd1ea5049711c2adf10a457ff9"
},
"schema_version": "1.0",
"source": {
"id": "2605.17613",
"kind": "arxiv",
"version": 1
}
}