HeatKV ranks attention heads by their focus on prior scales using offline calibration data and applies a static per-head pruning schedule, delivering 2x higher KV-cache compression than prior methods on the Infinity-2B model with comparable image fidelity.
XPSR: Cross-Modal Priors for Diffusion-Based Image Super-Resolution
1 Pith paper cite this work, alongside 19 external citations. Polarity classification is still indexing.
1
Pith paper citing it
19
external citations · Crossref
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
HeatKV: Head-tuned KV-cache Compression for Visual Autoregressive Modeling
HeatKV ranks attention heads by their focus on prior scales using offline calibration data and applies a static per-head pruning schedule, delivering 2x higher KV-cache compression than prior methods on the Infinity-2B model with comparable image fidelity.