Generative poisoning attack method against neural networks

· 2017 · cs.CR · arXiv 1703.01340

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Poisoning attack is identified as a severe security threat to machine learning algorithms. In many applications, for example, deep neural network (DNN) models collect public data as the inputs to perform re-training, where the input data can be poisoned. Although poisoning attack against support vector machines (SVM) has been extensively studied before, there is still very limited knowledge about how such attack can be implemented on neural networks (NN), especially DNNs. In this work, we first examine the possibility of applying traditional gradient-based method (named as the direct gradient method) to generate poisoned data against NNs by leveraging the gradient of the target model w.r.t. the normal data. We then propose a generative method to accelerate the generation rate of the poisoned data: an auto-encoder (generator) used to generate poisoned data is updated by a reward function of the loss, and the target NN model (discriminator) receives the poisoned data to calculate the loss w.r.t. the normal data. Our experiment results show that the generative method can speed up the poisoned data generation rate by up to 239.38x compared with the direct gradient method, with slightly lower model accuracy degradation. A countermeasure is also designed to detect such poisoning attack methods by checking the loss of the target model.

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning

cs.CR · 2017-12-15 · unverdicted · novelty 7.0

Injecting around 50 poisoned samples with a stealthy trigger creates backdoors in deep learning models achieving over 90% attack success under a weak threat model with no model or data knowledge required.

Differentiable Optimization Layers for Guaranteed Fairness in Deep Learning

cs.LG · 2026-05-16 · unverdicted · novelty 6.0

Introduces a fairness layer for deep learning models that guarantees output parity and an online primal-dual algorithm for aggregate fairness guarantees in streaming predictions with small batch sizes.

citing papers explorer

Showing 2 of 2 citing papers.

Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning cs.CR · 2017-12-15 · unverdicted · none · ref 70
Injecting around 50 poisoned samples with a stealthy trigger creates backdoors in deep learning models achieving over 90% attack success under a weak threat model with no model or data knowledge required.
Differentiable Optimization Layers for Guaranteed Fairness in Deep Learning cs.LG · 2026-05-16 · unverdicted · none · ref 13 · internal anchor
Introduces a fairness layer for deep learning models that guarantees output parity and an online primal-dual algorithm for aggregate fairness guarantees in streaming predictions with small batch sizes.

Generative poisoning attack method against neural networks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer