Robust Anomaly Detection and Backdoor Attack Detection Via Differential Privacy

Dawn Song; Min Du; Ruoxi Jia

arxiv: 1911.07116 · v1 · pith:YZKSPKY4new · submitted 2019-11-16 · 💻 cs.LG · cs.AI· cs.CR

Robust Anomaly Detection and Backdoor Attack Detection Via Differential Privacy

Min Du , Ruoxi Jia , Dawn Song This is my paper

classification 💻 cs.LG cs.AIcs.CR

keywords detectiondatasetdifferentialprivacysamplesbackdoornoveltyoutlier

0 comments

read the original abstract

Outlier detection and novelty detection are two important topics for anomaly detection. Suppose the majority of a dataset are drawn from a certain distribution, outlier detection and novelty detection both aim to detect data samples that do not fit the distribution. Outliers refer to data samples within this dataset, while novelties refer to new samples. In the meantime, backdoor poisoning attacks for machine learning models are achieved through injecting poisoning samples into the training dataset, which could be regarded as "outliers" that are intentionally added by attackers. Differential privacy has been proposed to avoid leaking any individual's information, when aggregated analysis is performed on a given dataset. It is typically achieved by adding random noise, either directly to the input dataset, or to intermediate results of the aggregation mechanism. In this paper, we demonstrate that applying differential privacy can improve the utility of outlier detection and novelty detection, with an extension to detect poisoning samples in backdoor attacks. We first present a theoretical analysis on how differential privacy helps with the detection, and then conduct extensive experiments to validate the effectiveness of differential privacy in improving outlier detection, novelty detection, and backdoor attack detection.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Pmeta-TLA: Backdoor Attacks for Speech Classification Models via Meta-Learning with Timbre Leakage Attack
cs.CR 2026-07 unverdicted novelty 5.0

Pmeta-TLA combines a frame-level timbre leakage trigger with meta-learning and PCGrad to inject multiple backdoors into speech models in one training run, claiming better attack success, stealth, and lower cost than b...