Title resolution pending

Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

TokenCake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications

cs.DC · 2025-10-21 · unverdicted · novelty 6.0

TokenCake introduces agent-aware temporal and spatial schedulers for KV cache management in LLM multi-agent serving, claiming over 47% lower end-to-end latency and up to 16.9% better GPU memory utilization than vLLM on representative benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

TokenCake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications cs.DC · 2025-10-21 · unverdicted · none · ref 4
TokenCake introduces agent-aware temporal and spatial schedulers for KV cache management in LLM multi-agent serving, claiming over 47% lower end-to-end latency and up to 16.9% better GPU memory utilization than vLLM on representative benchmarks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer