Fast k-nearest neighbour search via prioritized dci

Ke Li, Jitendra Malik · 2081

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs

cs.LG · 2026-04-12 · unverdicted · novelty 6.0

IceCache combines semantic token clustering with PagedAttention to keep only 25% of the KV cache tokens while retaining 99% accuracy on LongBench and matching or beating prior offloading methods in latency.

citing papers explorer

Showing 1 of 1 citing paper.

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs cs.LG · 2026-04-12 · unverdicted · none · ref 6
IceCache combines semantic token clustering with PagedAttention to keep only 25% of the KV cache tokens while retaining 99% accuracy on LongBench and matching or beating prior offloading methods in latency.

Fast k-nearest neighbour search via prioritized dci

fields

years

verdicts

representative citing papers

citing papers explorer