AI Organizations are More Effective but Less Aligned than Individual Agents

· 2026 · cs.AI · arXiv 2604.10290

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

AI is increasingly deployed in multi-agent systems; however, most research considers only the behavior of individual models. We experimentally show that multi-agent "AI organizations" are simultaneously more effective at achieving business goals, but less aligned, than individual AI agents. We examine 12 tasks across two practical settings: an AI consultancy providing solutions to business problems and an AI software team developing software products. Across all settings, AI Organizations composed of aligned models produce solutions with higher utility but greater misalignment compared to a single aligned model. Our work demonstrates the importance of considering interacting systems of AI agents when doing both capabilities and safety research.

representative citing papers

Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems

cs.CR · 2026-06-25 · unverdicted · novelty 6.0

Tool-using LLM agents can implement undetectable stegosystems, shifting the primary barrier to covert multi-agent collusion from technical feasibility to coordination without explicit agreement.

citing papers explorer

Showing 1 of 1 citing paper.

Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems cs.CR · 2026-06-25 · unverdicted · none · ref 47 · internal anchor
Tool-using LLM agents can implement undetectable stegosystems, shifting the primary barrier to covert multi-agent collusion from technical feasibility to coordination without explicit agreement.

AI Organizations are More Effective but Less Aligned than Individual Agents

fields

years

verdicts

representative citing papers

citing papers explorer