A.2 Model Licenses We employ open-weight language models accessed via the Hugging Face Transformers library (Wolf et al., 2020)

GSM-∞:Released by the authors at Infini- AI-Lab (GitHub), this synthetic dataset is generated programmatically, is used in accordance with the terms specified in the repository · 2020 · arXiv 4470.0783

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

DepthKV: Layer-Dependent KV Cache Pruning for Long-Context LLM Inference

cs.CL · 2026-04-27 · unverdicted · novelty 5.0

DepthKV allocates a fixed global KV cache budget across LLM layers based on per-layer pruning sensitivity, outperforming uniform pruning at the same overall budget.

citing papers explorer

Showing 1 of 1 citing paper.

DepthKV: Layer-Dependent KV Cache Pruning for Long-Context LLM Inference cs.CL · 2026-04-27 · unverdicted · none · ref 7
DepthKV allocates a fixed global KV cache budget across LLM layers based on per-layer pruning sensitivity, outperforming uniform pruning at the same overall budget.

A.2 Model Licenses We employ open-weight language models accessed via the Hugging Face Transformers library (Wolf et al., 2020)

fields

years

verdicts

representative citing papers

citing papers explorer