pith. sign in

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

years

2026 4

clear filters

representative citing papers

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

cs.AI · 2026-05-30 · conditional · novelty 6.0

Controlled experiments show adversarial feeds can tip uncertain LLM agent decisions from 5% to 100% alignment with the feed while leaving firmly held defaults unchanged, following a dose-response pattern across multiple models and domains.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults cs.AI · 2026-05-30 · conditional · none · ref 14

    Controlled experiments show adversarial feeds can tip uncertain LLM agent decisions from 5% to 100% alignment with the feed while leaving firmly held defaults unchanged, following a dose-response pattern across multiple models and domains.