B-PASTE uses beam-aware speculation of tool-call branches ranked by critical-path reduction to deliver up to 1.4x end-to-end speedup in resource-constrained LLM agents.
SpecFaaS: Accelerating Serverless Applications with Speculative Function Execution
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.DC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
B-PASTE: Beam-Aware Pattern-Guided Speculative Execution for Resource-Constrained LLM Agents
B-PASTE uses beam-aware speculation of tool-call branches ranked by critical-path reduction to deliver up to 1.4x end-to-end speedup in resource-constrained LLM agents.