The llama 3 herd of models,

· 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Comparative Characterization of KV Cache Management Strategies for LLM Inference

cs.AR · 2026-04-06 · unverdicted · novelty 3.0

Benchmarks of vLLM, InfiniGen, and H2O identify conditions under which each KV cache strategy delivers the best trade-off between memory consumption and inference performance.

citing papers explorer

Showing 1 of 1 citing paper.

Comparative Characterization of KV Cache Management Strategies for LLM Inference cs.AR · 2026-04-06 · unverdicted · none · ref 16
Benchmarks of vLLM, InfiniGen, and H2O identify conditions under which each KV cache strategy delivers the best trade-off between memory consumption and inference performance.

The llama 3 herd of models,

fields

years

verdicts

representative citing papers

citing papers explorer