Collaborative decod- ing makes visual auto-regressive modeling efficient,

· 2024 · arXiv 2411.17787

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

HACK++: Towards More Effective Head-Aware Key-Value Compression for Efficient Visual Autoregressive Modeling

cs.CV · 2026-06-06 · unverdicted · novelty 7.0

HACK++ is a head-aware KV cache compression framework for VAR models that decouples current-scale attention from historical cache under adaptive per-head budgets to achieve near-lossless generation at 30% attention and 10% cache budgets.

citing papers explorer

Showing 1 of 1 citing paper.

HACK++: Towards More Effective Head-Aware Key-Value Compression for Efficient Visual Autoregressive Modeling cs.CV · 2026-06-06 · unverdicted · none · ref 54
HACK++ is a head-aware KV cache compression framework for VAR models that decouples current-scale attention from historical cache under adaptive per-head budgets to achieve near-lossless generation at 30% attention and 10% cache budgets.

Collaborative decod- ing makes visual auto-regressive modeling efficient,

fields

years

verdicts

representative citing papers

citing papers explorer