Vidur: A large-scale simulation framework for llm inference.Proceedings of Machine Learning and Systems, 6:351–366

Amey Agrawal, Nitin Kedia, Jayashree Mohan, Ashish Panwar, Nipun Kwatra, Bhargav S Gulavani, Ramachandran Ramjee, Alexey Tumanov · 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure

cs.DC · 2026-05-07 · unverdicted · novelty 7.0

CCL-Bench packages traces and metadata to compute detailed compute, memory, and communication efficiency metrics, surfacing performance insights unavailable from end-to-end benchmarks.

Flow-Controlled Scheduling for LLM Inference with Provable Stability Guarantees

cs.LG · 2026-04-13 · unverdicted · novelty 6.0

A flow-control framework for LLM inference derives necessary and sufficient stability conditions and experimentally improves throughput, latency, and KV cache stability over common baselines.

citing papers explorer

Showing 2 of 2 citing papers.

CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure cs.DC · 2026-05-07 · unverdicted · none · ref 2
CCL-Bench packages traces and metadata to compute detailed compute, memory, and communication efficiency metrics, surfacing performance insights unavailable from end-to-end benchmarks.
Flow-Controlled Scheduling for LLM Inference with Provable Stability Guarantees cs.LG · 2026-04-13 · unverdicted · none · ref 2
A flow-control framework for LLM inference derives necessary and sufficient stability conditions and experimentally improves throughput, latency, and KV cache stability over common baselines.

Vidur: A large-scale simulation framework for llm inference.Proceedings of Machine Learning and Systems, 6:351–366

fields

years

verdicts

representative citing papers

citing papers explorer