Spotweb: Running latency-sensitive distributed web services on transient cloud servers,

· 2019 · arXiv 7681.332539

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

ShuntServe: Cost-Efficient LLM Serving on Heterogeneous Spot GPU Clusters

cs.DC · 2026-06-17 · unverdicted · novelty 5.0

ShuntServe reports 1.42x and 1.35x higher throughput than baselines plus 31.9 percent and 31.2 percent cost-efficiency gains over on-demand instances for Llama-3.1-70B and Qwen3-32B on heterogeneous AWS spot clusters.

citing papers explorer

Showing 1 of 1 citing paper.

ShuntServe: Cost-Efficient LLM Serving on Heterogeneous Spot GPU Clusters cs.DC · 2026-06-17 · unverdicted · none · ref 19
ShuntServe reports 1.42x and 1.35x higher throughput than baselines plus 31.9 percent and 31.2 percent cost-efficiency gains over on-demand instances for Llama-3.1-70B and Qwen3-32B on heterogeneous AWS spot clusters.

Spotweb: Running latency-sensitive distributed web services on transient cloud servers,

fields

years

verdicts

representative citing papers

citing papers explorer