Distractor injection attacks on large reasoning models: Characterization and defense,

· 2025 · arXiv 2510.16259

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

cs.CL · 2026-06-01 · unverdicted · novelty 7.0

SkillHarm benchmark shows current AI agents are vulnerable to lifecycle-aware skill poisoning with success rates up to 86.3% for fixed-payload attacks and 69.3% for self-mutating attacks.

Overthink-Triggered Slowdown Attacks on LVLM-Based Robotic Systems

cs.CR · 2026-07-01 · unverdicted · novelty 6.0

Adversaries can use crafted scene text to trigger overthinking in LVLM-based robots, producing transferable slowdowns up to 6.96x latency amplification.

citing papers explorer

Showing 2 of 2 citing papers.

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction cs.CL · 2026-06-01 · unverdicted · none · ref 23
SkillHarm benchmark shows current AI agents are vulnerable to lifecycle-aware skill poisoning with success rates up to 86.3% for fixed-payload attacks and 69.3% for self-mutating attacks.
Overthink-Triggered Slowdown Attacks on LVLM-Based Robotic Systems cs.CR · 2026-07-01 · unverdicted · none · ref 23
Adversaries can use crafted scene text to trigger overthinking in LVLM-based robots, producing transferable slowdowns up to 6.96x latency amplification.

Distractor injection attacks on large reasoning models: Characterization and defense,

fields

years

verdicts

representative citing papers

citing papers explorer