SAGA reduces AI agent task completion time by 1.64x on 64-GPU clusters by scheduling at the full workflow level with execution graphs, affinity batching, and completion-time fairness.
A New Presumed Commit Optimization for Two Phase Commit
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4roles
background 2polarities
background 2representative citing papers
Second-generation serverless platforms using lightweight isolates and edge deployment achieve roughly 10 ms warm latency and negligible cold starts, according to architecture analysis of seven platforms and microbenchmarks totaling over 38 million function calls.
A roadmap and explanatory guide to seminal papers in computer systems.
citing papers explorer
-
SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters
SAGA reduces AI agent task completion time by 1.64x on 64-GPU clusters by scheduling at the full workflow level with execution graphs, affinity batching, and completion-time fairness.
-
New Kids: An Architecture and Performance Investigation of Second-Generation Serverless Platforms
Second-generation serverless platforms using lightweight isolates and edge deployment achieve roughly 10 ms warm latency and negligible cold starts, according to architecture analysis of seven platforms and microbenchmarks totaling over 38 million function calls.
-
The Computer System Trail
A roadmap and explanatory guide to seminal papers in computer systems.
- Declarative Data Services: Structured Agentic Discovery for Composing Data Systems