Evoflux applies evolutionary search at inference time to repair executable tool workflows for compact agents, outperforming SFT and SFT+DPO on held-out MCP-Bench tasks with live servers and 250 tools.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics:
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
OCC-RAG develops task-specialized SLMs (0.6B and 1.7B) via a new synthetic data pipeline for multi-hop reasoning and context faithfulness, claiming to match or exceed 2-6x larger general models on HotpotQA, MuSiQue, TAT-QA, ConFiQA, and MuSiQue-Un.
citing papers explorer
-
Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents
Evoflux applies evolutionary search at inference time to repair executable tool workflows for compact agents, outperforming SFT and SFT+DPO on held-out MCP-Bench tasks with live servers and 250 tools.