Adversarial Defense of Image Classification Using a Variational Auto-Encoder

Henry Pfister; Yi Luo

arxiv: 1812.02891 · v1 · pith:MMEOVNUOnew · submitted 2018-12-07 · 💻 cs.CV

Adversarial Defense of Image Classification Using a Variational Auto-Encoder

Yi Luo , Henry Pfister This is my paper

classification 💻 cs.CV

keywords adversarialattacksdefenseauto-encoderclassificationimagepotentialvariational

0 comments

read the original abstract

Deep neural networks are known to be vulnerable to adversarial attacks. This exposes them to potential exploits in security-sensitive applications and highlights their lack of robustness. This paper uses a variational auto-encoder (VAE) to defend against adversarial attacks for image classification tasks. This VAE defense has a few nice properties: (1) it is quite flexible and its use of randomness makes it harder to attack; (2) it can learn disentangled representations that prevent blurry reconstruction; and (3) a patch-wise VAE defense strategy is used that does not require retraining for different size images. For moderate to severe attacks, this system outperforms or closely matches the performance of JPEG compression, with the best quality parameter. It also has more flexibility and potential for improvement via training.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Variational Autoencoder-Based Black-Box Adversarial Attack on Collaborative DNN Inference
cs.CR 2025-08 unverdicted novelty 6.0

AdVAR-DNN employs a variational autoencoder to create untraceable adversarial samples that compromise black-box collaborative DNN inference by exploiting model partitioning information exchange, achieving high misclas...