pith:Z4MBO6YK
KV Cache Offloading for Context-Intensive Tasks
KV-cache offloading causes major accuracy losses on tasks that require pulling lots of details from long inputs, but a simpler alternative recovers performance across models.
arxiv:2604.08426 v4 · 2026-04-09 · cs.LG · cs.AI · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{Z4MBO6YKC3ON2JYN5WVCJJK75X}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Existing KV-cache offloading techniques produce significant performance degradation on context-intensive tasks; a simpler alternative strategy significantly improves accuracy across multiple LLM families and benchmarks.
The assumption that the observed accuracy drops are caused primarily by low-rank key projections and unreliable landmarks rather than by other implementation details of the offloading systems or by the specific choice of evaluation prompts and metrics.
KV offloading degrades accuracy on context-intensive tasks due to low-rank key projections and unreliable landmarks; a simpler alternative improves results across models and benchmarks.
References
Cited by
Receipt and verification
| First computed | 2026-05-20T00:01:41.123824Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
cf18177b0a16dcdd270dedaa24a55fede162bbe452ca8530d2a411ac612a2100
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/Z4MBO6YKC3ON2JYN5WVCJJK75X \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: cf18177b0a16dcdd270dedaa24a55fede162bbe452ca8530d2a411ac612a2100
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "87d9201feef55e531e4c7771d4acb9404df4e293b23a2de9748b705a17c2adf4",
"cross_cats_sorted": [
"cs.AI",
"cs.CL"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-04-09T16:30:44Z",
"title_canon_sha256": "a95638f90930d2a3d264e375f1f001c33755d2ca4274281faf5cafcd6bac51d3"
},
"schema_version": "1.0",
"source": {
"id": "2604.08426",
"kind": "arxiv",
"version": 4
}
}