arXiv preprint arXiv:2505.21425

Guard: Dual-agent based backdoor defense on chain-of-thought in neural code generation · 2022 · arXiv 2505.21425

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

MirageBackdoor: A Stealthy Attack that Induces Think-Well-Answer-Wrong Reasoning

cs.CR · 2026-04-08 · unverdicted · novelty 8.0

MirageBackdoor is the first backdoor attack that preserves clean chain-of-thought reasoning in LLMs while steering the final answer to a specific incorrect target under a trigger.

citing papers explorer

Showing 1 of 1 citing paper.

MirageBackdoor: A Stealthy Attack that Induces Think-Well-Answer-Wrong Reasoning cs.CR · 2026-04-08 · unverdicted · none · ref 2
MirageBackdoor is the first backdoor attack that preserves clean chain-of-thought reasoning in LLMs while steering the final answer to a specific incorrect target under a trigger.

arXiv preprint arXiv:2505.21425

fields

years

verdicts

representative citing papers

citing papers explorer