SDOF combines an RLHF-trained intent router with a state-aware dispatcher using finite automata to constrain multi-agent orchestration, reporting 80.9% routing accuracy and 86.5% task completion on a recruitment platform while blocking unsafe actions.
MemoryArena: Benchmarking agent memory in inter- dependent multi-session agentic tasks.arXiv preprint, 2026
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SDOF: Taming the Alignment Tax in Multi-Agent Orchestration with State-Constrained Dispatch
SDOF combines an RLHF-trained intent router with a state-aware dispatcher using finite automata to constrain multi-agent orchestration, reporting 80.9% routing accuracy and 86.5% task completion on a recruitment platform while blocking unsafe actions.