When Not to Classify: Detection of Reverse Engineering Attacks on DNN Image Classifiers

David J. Miller; George Kesidis; Yujia Wang

arxiv: 1811.02658 · v1 · pith:NLOPOBWLnew · submitted 2018-10-31 · 💻 cs.CV · cs.LG· stat.ML

When Not to Classify: Detection of Reverse Engineering Attacks on DNN Image Classifiers

Yujia Wang , David J. Miller , George Kesidis This is my paper

classification 💻 cs.CV cs.LGstat.ML

keywords attacksclassifierevasiontest-timedetectdetectioneffectiveengineering

0 comments

read the original abstract

This paper addresses detection of a reverse engineering (RE) attack targeting a deep neural network (DNN) image classifier; by querying, RE's aim is to discover the classifier's decision rule. RE can enable test-time evasion attacks, which require knowledge of the classifier. Recently, we proposed a quite effective approach (ADA) to detect test-time evasion attacks. In this paper, we extend ADA to detect RE attacks (ADA-RE). We demonstrate our method is successful in detecting "stealthy" RE attacks before they learn enough to launch effective test-time evasion attacks.

This paper has not been read by Pith yet.

When Not to Classify: Detection of Reverse Engineering Attacks on DNN Image Classifiers

discussion (0)