pith. sign in

Securing Multi-Agent Systems Against Corruptions via Node Contribution Backpropagation

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it
abstract

Multi-Agent Systems (MAS) have become a prevalent paradigm for Large Language Model (LLM) applications. However, the complex multi-agent design in MAS introduces unique trustworthiness concerns: adversarial agents can inject misleading information that propagates contagiously through the system, corrupting benign agents and leading to false outputs. Existing graph-based defenses model agents as nodes and communications as edges, yet are limited to static-graph defenses. In this paper, we propose a dynamic defense paradigm that models MAS communication as a signed directed acyclic graph and computes each agent's contribution to the final decision via backward propagation, enabling accurate identification and isolation of malicious agents to secure multi-agent task collaboration. Experimental results in complex and dynamic MAS environments demonstrate that our method notably outperforms existing MAS defense mechanisms, providing an effective guardrail for trustworthy MAS deployment. Our code is available at https://github.com/ChengcanWu/BPD.

citation-role summary

background 1

citation-polarity summary

fields

cs.CR 3

years

2026 3

verdicts

UNVERDICTED 3

roles

background 1

polarities

unclear 1

clear filters

representative citing papers

citing papers explorer

Showing 3 of 3 citing papers after filters.