Adversarially robust transfer learning

Ali Shafahi; Amin Ghiasi; Chen Zhu; Christoph Studer; David Jacobs; Parsa Saadatpanah; Tom Goldstein

arxiv: 1905.08232 · v2 · pith:EOIUSTR4new · submitted 2019-05-20 · 💻 cs.LG · cs.CR· cs.CV· stat.ML

Adversarially robust transfer learning

Ali Shafahi , Parsa Saadatpanah , Chen Zhu , Amin Ghiasi , Christoph Studer , David Jacobs , Tom Goldstein This is my paper

classification 💻 cs.LG cs.CRcs.CVstat.ML

keywords robustlearningnetworkproducerobustnesstransferadversariallydata

0 comments

read the original abstract

Transfer learning, in which a network is trained on one task and re-purposed on another, is often used to produce neural network classifiers when data is scarce or full-scale training is too costly. When the goal is to produce a model that is not only accurate but also adversarially robust, data scarcity and computational limitations become even more cumbersome. We consider robust transfer learning, in which we transfer not only performance but also robustness from a source model to a target domain. We start by observing that robust networks contain robust feature extractors. By training classifiers on top of these feature extractors, we produce new models that inherit the robustness of their parent networks. We then consider the case of fine tuning a network by re-training end-to-end in the target domain. When using lifelong learning strategies, this process preserves the robustness of the source network while achieving high accuracy. By using such strategies, it is possible to produce accurate and robust models with little data, and without the cost of adversarial training. Additionally, we can improve the generalization of adversarially trained models, while maintaining their robustness.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation
cs.CV 2025-12 conditional novelty 6.0

SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.