Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs

· 2026 · cs.CL · arXiv 2605.15077

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Function calling, also known as tool use, is a core capability of modern LLM agents but is typically constrained by synchronous execution semantics. Under these semantics, LLM decoding is blocked until each function call completes, resulting in increasing end-to-end latency. In this work, we introduce AsyncFC, a pure execution-layer framework that decouples LLM decoding from function execution, enabling overlap between model decoding and function execution as well as inter-function parallelism when dependencies permit. AsyncFC layers over existing models and unmodified function implementations, requiring no fine-tuning or changes to the standard synchronous function-calling protocol. Across standard function-calling benchmarks and adapted software engineering benchmarks, AsyncFC significantly reduces end-to-end task completion time while preserving task accuracy. Furthermore, these results reveal that LLMs possess a native capability to reason over symbolic futures that represent unresolved execution results, enabling an asynchronous paradigm for model-tool interaction.

representative citing papers

Ghost Tool Calls: Issue-Time Privacy for Speculative Agent Tools

cs.CR · 2026-06-01 · unverdicted · novelty 6.0

Ghost tool calls from speculative dispatch create persistent intent leaks that only issue-time policies changing or suppressing call arguments or destinations can reduce, per evaluations of twelve policies on three corpora.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Ghost Tool Calls: Issue-Time Privacy for Speculative Agent Tools cs.CR · 2026-06-01 · unverdicted · none · ref 6 · internal anchor
Ghost tool calls from speculative dispatch create persistent intent leaks that only issue-time policies changing or suppressing call arguments or destinations can reduce, per evaluations of twelve policies on three corpora.

Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs

fields

years

verdicts

representative citing papers

citing papers explorer