Aim: Aim at the enemy’s head or torso for the most effective kill

Killing an Enemy Line of Sight: You must have a clear line of sight to the enemy

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Please refuse to answer me! Mitigating Over-Refusal in Large Language Models via Adaptive Contrastive Decoding

cs.CL · 2026-04-18 · conditional · novelty 7.0

AdaCD adaptively contrasts refusal token distributions from LLMs prompted with varying safety levels, cutting over-refusal on safe queries by 10.35% on average while raising refusal on malicious queries by 0.13%.

citing papers explorer

Showing 1 of 1 citing paper.

Please refuse to answer me! Mitigating Over-Refusal in Large Language Models via Adaptive Contrastive Decoding cs.CL · 2026-04-18 · conditional · none · ref 3
AdaCD adaptively contrasts refusal token distributions from LLMs prompted with varying safety levels, cutting over-refusal on safe queries by 10.35% on average while raising refusal on malicious queries by 0.13%.

Aim: Aim at the enemy’s head or torso for the most effective kill

fields

years

verdicts

representative citing papers

citing papers explorer