The illusion of role separation: Hidden shortcuts in llm role learning (and how to fix them).arXiv preprint arXiv:2505.00626,

Wang, Z · arXiv 2505.00626

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

VATS: Exploiting Implicit Authority in Error-Path Injection via Systematic Mutation

cs.AI · 2026-06-06 · unverdicted · novelty 6.0

Error messages in the Model Context Protocol can be systematically mutated across seven dimensions to triple indirect prompt injection success rates, reaching up to 100% compliance on four frontier models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

VATS: Exploiting Implicit Authority in Error-Path Injection via Systematic Mutation cs.AI · 2026-06-06 · unverdicted · none · ref 14
Error messages in the Model Context Protocol can be systematically mutated across seven dimensions to triple indirect prompt injection success rates, reaching up to 100% compliance on four frontier models.

The illusion of role separation: Hidden shortcuts in llm role learning (and how to fix them).arXiv preprint arXiv:2505.00626,

fields

years

verdicts

representative citing papers

citing papers explorer