Harsh Jhamtani and Taylor Berg-Kirkpatrick

Senqiao Yang, Yukang Chen, Zhuotao Tian, Chengyao Wang, Jingyao Li, Bei Yu, Jiaya Jia · 2018 · arXiv 2509.00419

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

CIVIC: End-to-End Sequence Compactness for Efficient Vision-Language Models

cs.AI · 2026-05-27 · unverdicted · novelty 6.0

CIVIC is a path-consistent compact visual inference framework that reduces KV-cache memory to approximately one-third and end-to-end latency in VLMs while preserving accuracy via text-aligned KL distillation and adaptive spatial retention.

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

cs.AI · 2026-04-07 · unverdicted · novelty 6.0

HybridKV reduces KV cache memory by up to 7.9x and speeds decoding by 1.52x in MLLMs with almost no performance loss by classifying heads into static and dynamic types and compressing them differently.

citing papers explorer

Showing 2 of 2 citing papers after filters.

CIVIC: End-to-End Sequence Compactness for Efficient Vision-Language Models cs.AI · 2026-05-27 · unverdicted · none · ref 2
CIVIC is a path-consistent compact visual inference framework that reduces KV-cache memory to approximately one-third and end-to-end latency in VLMs while preserving accuracy via text-aligned KL distillation and adaptive spatial retention.
HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference cs.AI · 2026-04-07 · unverdicted · none · ref 1
HybridKV reduces KV cache memory by up to 7.9x and speeds decoding by 1.52x in MLLMs with almost no performance loss by classifying heads into static and dynamic types and compressing them differently.

Harsh Jhamtani and Taylor Berg-Kirkpatrick

fields

years

verdicts

representative citing papers

citing papers explorer