CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems

Bryan Wang; Fan Lai; Jinmiao Fu; Moyan Li; Shaoyuan Xu; Xinhai Hou; Yifan Yu

arxiv: 2509.24088 · v2 · pith:KZ3N6RZAnew · submitted 2025-09-28 · 💻 cs.MA

CORRECT: COndensed eRror RECognition via knowledge Transfer in multi-agent systems

Yifan Yu , Moyan Li , Shaoyuan Xu , Jinmiao Fu , Xinhai Hou , Fan Lai , Bryan Wang This is my paper

classification 💻 cs.MA

keywords erroracrosscorrectfailurerecognitionsystemstrajectorieswhile

0 comments

read the original abstract

Multi-agent systems (MAS) are increasingly capable of tackling complex real-world tasks, yet their reliance on inter-agent coordination, tool use, and long-horizon reasoning makes error recognition particularly challenging. Minor errors can propagate across agents, escalating into task failures while producing long, intertwined execution trajectories that impose significant costs for both human developers and automated systems to debug and analyze. Our key insight is that, despite surface differences in failure trajectories (e.g., logs), MAS errors often recur with similar structural patterns. This paper presents CORRECT, the first lightweight, training-free framework that leverages an online cache of distilled error schemata to recognize and transfer knowledge of failure structures across new requests. This cache-based reuse allows LLMs to perform targeted error localization at inference time, avoiding the need for expensive retraining while adapting to dynamic MAS deployments in subseconds. To support rigorous study in this domain, we also introduce CORRECT-Error, a large-scale dataset of over 2,000 annotated trajectories collected through a novel error-injection pipeline guided by real-world distributions, and further validated through human evaluation to ensure alignment with natural failure patterns. Experiments across seven diverse MAS applications show that CORRECT improves step-level error localization up to 19.8% over existing advances while at near-zero overhead, substantially narrowing the gap between automated and human-level error recognition.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Conformal Agent Error Attribution
cs.LG 2026-05 unverdicted novelty 6.0

A new filtration-based conformal prediction method attributes errors in multi-agent systems by producing contiguous sequence sets with finite-sample coverage guarantees, enabling rollback recovery.
The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project
cs.LG 2026-03 unverdicted novelty 5.0

The Workload-Router-Pool architecture is a 3D framework for LLM inference optimization that synthesizes prior vLLM work into a 3x3 interaction matrix and proposes 21 research directions at the intersections.