Detecting Adversarial Examples via Key-based Network

Jun Wang; Ou Wu; Pinlong Zhao; Qinghua Hu; Zhouyu Fu

arxiv: 1806.00580 · v1 · pith:6SOR6NMBnew · submitted 2018-06-02 · 💻 cs.LG · cs.CR· cs.CV· stat.ML

Detecting Adversarial Examples via Key-based Network

Pinlong Zhao , Zhouyu Fu , Ou Wu , Qinghua Hu , Jun Wang This is my paper

classification 💻 cs.LG cs.CRcs.CVstat.ML

keywords adversarialexamplesdefenseattackskey-basednetworkappliedbinary

0 comments

read the original abstract

Though deep neural networks have achieved state-of-the-art performance in visual classification, recent studies have shown that they are all vulnerable to the attack of adversarial examples. Small and often imperceptible perturbations to the input images are sufficient to fool the most powerful deep neural networks. Various defense methods have been proposed to address this issue. However, they either require knowledge on the process of generating adversarial examples, or are not robust against new attacks specifically designed to penetrate the existing defense. In this work, we introduce key-based network, a new detection-based defense mechanism to distinguish adversarial examples from normal ones based on error correcting output codes, using the binary code vectors produced by multiple binary classifiers applied to randomly chosen label-sets as signatures to match normal images and reject adversarial examples. In contrast to existing defense methods, the proposed method does not require knowledge of the process for generating adversarial examples and can be applied to defend against different types of attacks. For the practical black-box and gray-box scenarios, where the attacker does not know the encoding scheme, we show empirically that key-based network can effectively detect adversarial examples generated by several state-of-the-art attacks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MorphStrata: Layer-Specific Perturbations for Generating Morphence Students in Time-Series Moving Target Defense
cs.LG 2026-06 unverdicted novelty 6.0

MorphStrata generates heterogeneous student models via layer-specific perturbations in a Transformer-based Morphence MTD setup, reporting RMSE gains up to 24% and 98% on AEP data under FGSM and BIM attacks with under ...