Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time

Liu, Z · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

LKV: End-to-End Learning of Head-wise Budgets and Token Selection for LLM KV Cache Eviction

cs.LG · 2026-04-22 · conditional · novelty 6.0

LKV learns task-optimized global budgets and intrinsic KV token importance without attention matrices, delivering near-lossless performance at 15% cache retention on LongBench.

citing papers explorer

Showing 1 of 1 citing paper.

LKV: End-to-End Learning of Head-wise Budgets and Token Selection for LLM KV Cache Eviction cs.LG · 2026-04-22 · conditional · none · ref 22
LKV learns task-optimized global budgets and intrinsic KV token importance without attention matrices, delivering near-lossless performance at 15% cache retention on LongBench.

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time

fields

years

verdicts

representative citing papers

citing papers explorer