Reasoning models are more easily gaslighted than you think,

Bin Zhu, Hailong Yin, Jingjing Chen, Yu-Gang Jiang, “Reasoning models are more easily gaslighted than you think,” arXiv preprint arXiv:2506 · 2025 · arXiv 2506.09677

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Benchmarking Gaslighting Attacks Against Speech Large Language Models

cs.CL · 2025-09-24 · unverdicted · novelty 6.0

Gaslighting attacks using Anger, Cognitive Disruption, Sarcasm, Implicit, and Professional Negation strategies cause a 24.3% average accuracy drop in Speech LLMs while also triggering behavioral changes like apologies and refusals.

citing papers explorer

Showing 1 of 1 citing paper.

Benchmarking Gaslighting Attacks Against Speech Large Language Models cs.CL · 2025-09-24 · unverdicted · none · ref 11
Gaslighting attacks using Anger, Cognitive Disruption, Sarcasm, Implicit, and Professional Negation strategies cause a 24.3% average accuracy drop in Speech LLMs while also triggering behavioral changes like apologies and refusals.

Reasoning models are more easily gaslighted than you think,

fields

years

verdicts

representative citing papers

citing papers explorer