pith:VNFUHXCL
HeatKV: Head-tuned KV-cache Compression for Visual Autoregressive Modeling
Head-specific attention ranking doubles KV-cache compression in visual autoregressive image models while preserving quality.
arxiv:2605.14877 v1 · 2026-05-14 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{VNFUHXCLJAPFGIGBHHUNERJ47Y}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Applied to the Infinity-2B model, HeatKV achieves 2× higher compression ratio in memory allocation for KV cache compared to existing methods, while maintaining similar or better image fidelity, prompt alignment and human perception score.
That a static pruning schedule derived from attention scores on a small offline calibration set will generalize to arbitrary prompts and generation lengths without measurable quality loss.
HeatKV ranks attention heads by their focus on prior scales using offline calibration data and applies a static per-head pruning schedule, delivering 2x higher KV-cache compression than prior methods on the Infinity-2B model with comparable image fidelity.
References
Formal links
Receipt and verification
| First computed | 2026-05-17T23:38:56.081842Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
ab4b43dc4b481e5320c139e8d2453cfe073be42803dbf4c2e4b9c684cad8d1ea
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/VNFUHXCLJAPFGIGBHHUNERJ47Y \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: ab4b43dc4b481e5320c139e8d2453cfe073be42803dbf4c2e4b9c684cad8d1ea
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "1ef38dabdc5d4752e11d311d9445b475b24b4c6ac345155aa2f530c37a22414c",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2026-05-14T14:22:34Z",
"title_canon_sha256": "4b56ae184d7afff6d28f203935951a47bae3d761028d3320a9f5096242185147"
},
"schema_version": "1.0",
"source": {
"id": "2605.14877",
"kind": "arxiv",
"version": 1
}
}