LLMs Corrupt Your Documents When You Delegate

· 2026 · cs.CL · arXiv 2604.15597

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large Language Models (LLMs) are poised to disrupt knowledge work, with the emergence of delegated work as a new interaction paradigm (e.g., vibe coding). Delegation requires trust - the expectation that the LLM will faithfully execute the task without introducing errors into documents. We introduce DELEGATE-52 to study the readiness of AI systems in delegated workflows. DELEGATE-52 simulates long delegated workflows that require in-depth document editing across 52 professional domains, such as coding, crystallography, and music notation. Our large-scale experiment with 19 LLMs reveals that current models degrade documents during delegation: even frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT 5.4) corrupt an average of 25% of document content by the end of long workflows, with other models failing more severely. Additional experiments reveal that agentic tool use does not improve performance on DELEGATE-52, and that degradation severity is exacerbated by document size, length of interaction, or presence of distractor files. Our analysis shows that current LLMs are unreliable delegates: they introduce sparse but severe errors that silently corrupt documents, compounding over long interaction.

representative citing papers

When Summaries Distort Decisions: Information Fidelity in LLM-Compressed Financial Analysis

cs.AI · 2026-06-28 · unverdicted · novelty 5.0

LLM-based compression of financial source material can alter downstream investment decisions via decontextualization and model dependency, addressed by an agentic auditing approach that checks multiple compressions against the original.

citing papers explorer

Showing 1 of 1 citing paper.

When Summaries Distort Decisions: Information Fidelity in LLM-Compressed Financial Analysis cs.AI · 2026-06-28 · unverdicted · none · ref 3 · internal anchor
LLM-based compression of financial source material can alter downstream investment decisions via decontextualization and model dependency, addressed by an agentic auditing approach that checks multiple compressions against the original.

LLMs Corrupt Your Documents When You Delegate

fields

years

verdicts

representative citing papers

citing papers explorer