Measurement study finds LLM serving systems sacrifice 60-93% throughput to meet human-centric TTFT/TPOT SLOs unnecessary for programmatic long-horizon tasks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.NI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Human-Less LLM Serving: Quantifying the Human Tax on Throughput
Measurement study finds LLM serving systems sacrifice 60-93% throughput to meet human-centric TTFT/TPOT SLOs unnecessary for programmatic long-horizon tasks.