pith:HKNIHO4Q
EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction
EchoKV compresses the KV cache by reconstructing discarded components from retained ones using attention head similarities.
arxiv:2603.22910 v2 · 2026-03-24 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{HKNIHO4QFLZG2VZKX2YRGXOCFR}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
EchoKV consistently outperforms existing methods across multiple compression ratios and backbone models while preserving the throughput of full-cache inference in short-context scenarios.
That intrinsic inter-layer and intra-layer similarities among attention heads are sufficiently stable and informative for a lightweight network to accurately reconstruct the discarded KV components without introducing errors that degrade downstream performance.
EchoKV compresses LLM KV caches by reconstructing missing components from partial data via inter- and intra-layer attention similarities, outperforming prior methods on LongBench and RULER while supporting on-demand full-cache inference.
References
Formal links
Receipt and verification
| First computed | 2026-05-18T03:09:22.581986Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
3a9a83bb902af26d572abeb1135dc22c45a496fe6d2f1aed316d8d677dd3a4a6
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/HKNIHO4QFLZG2VZKX2YRGXOCFR \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 3a9a83bb902af26d572abeb1135dc22c45a496fe6d2f1aed316d8d677dd3a4a6
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "83b6017948d1ddfa7fe6d32c85acbd709353e81c36033790db0a1379dd017175",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-03-24T07:58:42Z",
"title_canon_sha256": "0237fd64785f5c9872c9b505fb8aae054d36de633f147c5171a02e95736711a1"
},
"schema_version": "1.0",
"source": {
"id": "2603.22910",
"kind": "arxiv",
"version": 2
}
}