International symposium on research in attacks, intrusions, and defenses , pages=

Fine-pruning: Defending against backdooring attacks on deep neural networks , author= · 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

cs.AI · 2024-06-14 · conditional · novelty 7.0

LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.

cs.CR · 2026-05-21

Showing 1 of 1 citing paper after filters.