Gonzalez, Matei Zaharia, and Ion Stoica

Shiyi Cao, Shu Liu, Tyler Griggs, Peter Schafhalter, Xiaoxuan Liu, Ying Sheng, Joseph E · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference

cs.CL · 2026-05-18 · unverdicted · novelty 6.0

KVDrive introduces a multi-tier KV cache management system that achieves up to 1.74x higher throughput for long-context LLM inference through adaptive cache placement, pipeline restructuring, and cross-tier coordination while preserving accuracy.

citing papers explorer

Showing 1 of 1 citing paper.

KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference cs.CL · 2026-05-18 · unverdicted · none · ref 4
KVDrive introduces a multi-tier KV cache management system that achieves up to 1.74x higher throughput for long-context LLM inference through adaptive cache placement, pipeline restructuring, and cross-tier coordination while preserving accuracy.

Gonzalez, Matei Zaharia, and Ion Stoica

fields

years

verdicts

representative citing papers

citing papers explorer