Wu and Aviv Tamar and Jean Harb and Pieter Abbeel and Igor Mordatch , title =

Ryan Lowe, Yi I

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Insider Attacks in Multi-Agent LLM Consensus Systems

cs.MA · 2026-05-08 · unverdicted · novelty 5.0

A malicious agent in multi-agent LLM consensus systems can be trained via a surrogate world model and RL to reduce consensus rates and prolong disagreement more effectively than direct prompt attacks.

citing papers explorer

Showing 1 of 1 citing paper.

Insider Attacks in Multi-Agent LLM Consensus Systems cs.MA · 2026-05-08 · unverdicted · none · ref 150
A malicious agent in multi-agent LLM consensus systems can be trained via a surrogate world model and RL to reduce consensus rates and prolong disagreement more effectively than direct prompt attacks.

Wu and Aviv Tamar and Jean Harb and Pieter Abbeel and Igor Mordatch , title =

fields

years

verdicts

representative citing papers

citing papers explorer