Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents , author= · 2025 · arXiv 2503.00061

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

representative citing papers

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection

cs.CR · 2026-04-20 · unverdicted · novelty 7.0 · 2 refs

The work introduces and partially evaluates seven cross-domain prompt injection detectors, reporting F1 gains on benchmarks like deepset/prompt-injections and indirect-injection sets via local alignment, stylometry, and fatigue tracking.

Security--Fidelity Tradeoffs: The Hidden Cost of Prompt Injection Defense

cs.CR · 2026-06-29 · unverdicted · novelty 6.0

Prompt injection defenses create a security-fidelity tradeoff with no model or defense achieving both high security and high fidelity on the SecFid benchmark across 1,168 examples.

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

cs.AI · 2026-05-30 · conditional · novelty 6.0

Controlled experiments show adversarial feeds can tip uncertain LLM agent decisions from 5% to 100% alignment with the feed while leaving firmly held defaults unchanged, following a dose-response pattern across multiple models and domains.

Prompt Injection as Role Confusion

cs.CL · 2026-02-22

citing papers explorer

Showing 1 of 1 citing paper after filters.

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults cs.AI · 2026-05-30 · conditional · none · ref 14
Controlled experiments show adversarial feeds can tip uncertain LLM agent decisions from 5% to 100% alignment with the feed while leaving firmly held defaults unchanged, following a dose-response pattern across multiple models and domains.

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents

fields

years

verdicts

representative citing papers

citing papers explorer