Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents

Zhan, Qiusi, Fang, Richard, Panchal, Henil Shalin, Kang, Daniel , year = · 2025 · arXiv 2503.00061

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection

cs.CR · 2026-04-20 · unverdicted · novelty 7.0 · 2 refs

The work introduces and partially evaluates seven cross-domain prompt injection detectors, reporting F1 gains on benchmarks like deepset/prompt-injections and indirect-injection sets via local alignment, stylometry, and fatigue tracking.

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

cs.AI · 2026-05-30 · conditional · novelty 6.0

Controlled experiments show adversarial feeds can tip uncertain LLM agent decisions from 5% to 100% alignment with the feed while leaving firmly held defaults unchanged, following a dose-response pattern across multiple models and domains.

Prompt Injection as Role Confusion

cs.CL · 2026-02-22

citing papers explorer

Showing 1 of 1 citing paper after filters.

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection cs.CR · 2026-04-20 · unverdicted · none · ref 24 · 2 links
The work introduces and partially evaluates seven cross-domain prompt injection detectors, reporting F1 gains on benchmarks like deepset/prompt-injections and indirect-injection sets via local alignment, stylometry, and fatigue tracking.

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents

fields

years

verdicts

representative citing papers

citing papers explorer