Defending against Patch-Based and Texture-Based Adversarial Attacks with Spectral Decomposition
Pith reviewed 2026-05-10 15:40 UTC · model grok-4.3
The pith
Spectral decomposition via wavelets plus adversarial training defends image classifiers from patch and texture attacks even when the attacker adapts to the defense.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
ASD uses the multi-resolution and localization properties of the Discrete Wavelet Transform to capture both high-frequency fine-grained perturbations and low-frequency spatially pervasive perturbations from patch-based and texture-based attacks. When integrated with an off-the-shelf adversarial training model, this spectral analysis yields a comprehensive defense that achieves state-of-the-art robustness against strong adaptive adversaries specifically designed to counter the method.
What carries the argument
Adversarial Spectrum Defense (ASD), which applies Discrete Wavelet Transform (DWT) spectral decomposition to input images to isolate adversarial perturbations across frequency scales before feeding the result to an adversarially trained classifier.
If this is right
- Classifiers gain improved resistance to physically realizable attacks in applications such as person detection for surveillance and autonomous systems.
- The single framework addresses both localized high-frequency perturbations and spatially extensive low-frequency changes.
- Performance against adaptive adversaries exceeds that of prior defense methods.
- Off-the-shelf adversarially trained models can be augmented with spectral decomposition without requiring new network architectures.
Where Pith is reading between the lines
- The localization property of the wavelet decomposition could be used to identify the spatial location of a patch attack within an image.
- Similar frequency-domain checks might help against other input-manipulation threats where perturbations have characteristic scale signatures.
- Combining spectral analysis with training-based robustness suggests that hybrid defenses can compound protection when each component targets different aspects of the attack.
Load-bearing premise
The spectral signatures of perturbations from patch and texture attacks differ enough from natural image content that DWT decomposition can separate them reliably, and this separation survives even when the attacker knows the defense and optimizes against it.
What would settle it
An adaptive attack that produces adversarial images whose DWT coefficients closely match those of clean images yet still cause the defended model to misclassify would show that the spectral separation does not hold.
Figures
read the original abstract
Adversarial examples present significant challenges to the security of Deep Neural Network (DNN) applications. Specifically, there are patch-based and texture-based attacks that are usually used to craft physical-world adversarial examples, posing real threats to security-critical applications such as person detection in surveillance and autonomous systems, because those attacks are physically realizable. Existing defense mechanisms face challenges in the adaptive attack setting, i.e., the attacks are specifically designed against them. In this paper, we propose Adversarial Spectrum Defense (ASD), a defense mechanism that leverages spectral decomposition via Discrete Wavelet Transform (DWT) to analyze adversarial patterns across multiple frequency scales. The multi-resolution and localization capability of DWT enables ASD to capture both high-frequency (fine-grained) and low-frequency (spatially pervasive) perturbations. By integrating this spectral analysis with the off-the-shelf Adversarial Training (AT) model, ASD provides a comprehensive defense strategy against both patch-based and texture-based adversarial attacks. Extensive experiments demonstrate that ASD+AT achieved state-of-the-art (SOTA) performance against various attacks, outperforming the APs of previous defense methods by 21.73%, in the face of strong adaptive adversaries specifically designed against ASD. Code available at https://github.com/weiz0823/adv-spectral-defense .
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Adversarial Spectrum Defense (ASD), which applies Discrete Wavelet Transform (DWT) to decompose images across multiple frequency scales and localize both high-frequency fine-grained and low-frequency pervasive perturbations from patch-based and texture-based adversarial attacks. ASD is integrated with off-the-shelf adversarial training (AT) to form a combined defense, with the abstract claiming state-of-the-art robustness that outperforms prior methods by 21.73% under strong adaptive adversaries specifically designed against ASD. Code is made available.
Significance. If the empirical results hold under properly constructed adaptive attacks that optimize against the full ASD+AT pipeline, the method could supply a practical, multi-resolution preprocessing step for defending physically realizable attacks in applications such as surveillance and autonomous systems. The explicit release of code supports reproducibility, which is a clear strength for an empirical defense paper.
major comments (2)
- [Abstract] Abstract: The headline claim of SOTA performance with a 21.73% gain over prior defenses is stated without any accompanying experimental details on datasets, attack implementations (including whether back-propagation occurs through DWT), evaluation metrics, baselines, or ablation studies. This omission renders the central empirical claim unverifiable and load-bearing for the paper's contribution.
- [Abstract] Abstract: The description of the adaptive adversaries does not specify the threat model (e.g., whether the attacker has white-box access to the DWT decomposition, whether DWT parameters are frozen, or how the combined loss is formulated). Without this, it is impossible to confirm that the reported robustness is not an artifact of an insufficiently strong adaptive attack that fails to target the spectral separation directly.
minor comments (1)
- [Abstract] The abstract refers to 'extensive experiments' yet supplies only a single aggregate percentage; a concise results summary or table reference would improve clarity for readers.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We agree that the abstract requires expansion to better support the central claims and will revise it in the next version. Below we respond point by point to the major comments.
read point-by-point responses
-
Referee: [Abstract] Abstract: The headline claim of SOTA performance with a 21.73% gain over prior defenses is stated without any accompanying experimental details on datasets, attack implementations (including whether back-propagation occurs through DWT), evaluation metrics, baselines, or ablation studies. This omission renders the central empirical claim unverifiable and load-bearing for the paper's contribution.
Authors: We acknowledge that the abstract, as a concise summary, omits the supporting experimental details, which are instead presented in the Experiments and Ablation sections of the manuscript. To address the concern directly, we will revise the abstract to include a brief summary of the setup: the datasets used, the evaluation metric (AP), that attacks are implemented with back-propagation through the differentiable DWT, and reference to the baselines and ablations. This change will make the SOTA claim more verifiable from the abstract itself while remaining within length constraints. revision: yes
-
Referee: [Abstract] Abstract: The description of the adaptive adversaries does not specify the threat model (e.g., whether the attacker has white-box access to the DWT decomposition, whether DWT parameters are frozen, or how the combined loss is formulated). Without this, it is impossible to confirm that the reported robustness is not an artifact of an insufficiently strong adaptive attack that fails to target the spectral separation directly.
Authors: We agree that explicit specification of the threat model strengthens the paper. In the work, adaptive attacks are white-box with full access to the ASD+AT pipeline, gradients flow through the DWT (which is differentiable), DWT parameters are fixed and non-learnable, and the loss is the standard adversarial objective on the defended model. We will revise the abstract to state: 'under white-box adaptive adversaries optimized against the full ASD pipeline with back-propagation through DWT.' This clarification confirms that attacks target the spectral decomposition and will be incorporated in the revised manuscript. revision: yes
Circularity Check
No circularity: empirical integration of DWT with standard AT
full rationale
The paper presents ASD as an empirical defense that applies off-the-shelf Discrete Wavelet Transform for multi-resolution spectral analysis and combines it with standard Adversarial Training. No mathematical derivations, equations, fitted parameters renamed as predictions, or self-citation chains appear in the provided text. The SOTA claim rests on experimental results under adaptive attacks rather than any self-referential reduction. The method is self-contained against external benchmarks with no load-bearing steps that collapse to inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Discrete Wavelet Transform provides multi-resolution localization that captures both high-frequency fine-grained and low-frequency spatially pervasive adversarial perturbations.
invented entities (1)
-
Adversarial Spectrum Defense (ASD)
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Explaining and harnessing adversarial examples,
I. J. Goodfellow, J. Shlens, and C. Szegedy, “Explaining and harnessing adversarial examples,” inInternational Conference on Learning Representations, 2015. 1, 2, 3
work page 2015
-
[2]
Towards deep learning models resistant to adversarial attacks,
A. Madry, A. Makelov, L. Schmidt, D. Tsipras, and A. Vladu, “Towards deep learning models resistant to adversarial attacks,” inInternational Conference on Learning Representations, 2018. 1, 2, 3, 6, 7
work page 2018
-
[3]
T. B. Brown, D. Mané, A. Roy, M. Abadi, and J. Gilmer, “Adversarial patch,” inNeurIPS 2017 Workshop on Machine Learning and Computer Security, 2017. 2, 11, 12
work page 2017
-
[4]
Adversarial examples in the physical world,
A. Kurakin, I. J. Goodfellow, and S. Bengio, “Adversarial examples in the physical world,” inArtificial Intelligence Safety and Security. Chapman and Hall/CRC, 2018, pp. 99–112. 1, 2
work page 2018
-
[5]
Fooling automated surveillance cameras: adversarial patches to attack person detection,
S. Thys, W. Van Ranst, and T. Goedemé, “Fooling automated surveillance cameras: adversarial patches to attack person detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019, pp. 1–7. 1, 2, 5, 7, 9
work page 2019
-
[6]
Phys- ically realizable natural-looking clothing textures evade person detectors via 3d modeling,
Z. Hu, W. Chu, X. Zhu, H. Zhang, B. Zhang, and X. Hu, “Phys- ically realizable natural-looking clothing textures evade person detectors via 3d modeling,” inProceedings of the IEEE/CVF 13 Conference on Computer Vision and Pattern Recognition, 2023, pp. 16 975–16 984. 1, 2, 7, 8, 9
work page 2023
-
[7]
Towards evaluating the robustness of neural networks,
N. Carlini and D. Wagner, “Towards evaluating the robustness of neural networks,” in2017 IEEE Symposium on Security and Privacy (SP). IEEE, 2017, pp. 39–57. 1, 2, 3
work page 2017
-
[8]
Adversarial t-shirt! evading person detectors in a physical world,
K. Xu, G. Zhang, S. Liu, Q. Fan, M. Sun, H. Chen, P.-Y . Chen, Y . Wang, and X. Lin, “Adversarial t-shirt! evading person detectors in a physical world,” inEuropean Conference on Computer Vision. Springer, 2020, pp. 665–681. 1, 2, 7, 9
work page 2020
-
[9]
Making an invisibility cloak: Real world adversarial attacks on object detec- tors,
Z. Wu, S.-N. Lim, L. S. Davis, and T. Goldstein, “Making an invisibility cloak: Real world adversarial attacks on object detec- tors,” inEuropean Conference on Computer Vision. Springer, 2020, pp. 1–17. 2
work page 2020
-
[10]
Naturalistic physical adversarial patch for object detectors,
Y .-C.-T. Hu, B.-H. Kung, D. S. Tan, J.-C. Chen, K.-L. Hua, and W.-H. Cheng, “Naturalistic physical adversarial patch for object detectors,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 7848–7857. 2, 10
work page 2021
-
[11]
Adver- sarial texture for fooling person detectors in the physical world,
Z. Hu, S. Huang, X. Zhu, F. Sun, B. Zhang, and X. Hu, “Adver- sarial texture for fooling person detectors in the physical world,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 13 307–13 316. 1, 2, 5, 7, 8, 9
work page 2022
-
[12]
Local gradients smoothing: Defense against localized adversarial attacks,
M. Naseer, S. Khan, and F. Porikli, “Local gradients smoothing: Defense against localized adversarial attacks,” inIEEE Winter Conference on Applications of Computer Vision, 2019, pp. 1300–
work page 2019
-
[13]
Defending physical adversarial attack on object detection via adversarial patch-feature energy,
T. Kim, Y . Yu, and Y . M. Ro, “Defending physical adversarial attack on object detection via adversarial patch-feature energy,” inProceedings of the 30th ACM International Conference on Multimedia, 2022, pp. 1905–1913. 3, 7, 9
work page 2022
-
[14]
J. Liu, A. Levine, C. P. Lau, R. Chellappa, and S. Feizi, “Segment and complete: Defending object detectors against adversarial patch attacks with robust patch detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 14 973–14 982. 3, 7, 9
work page 2022
-
[15]
Napguard: To- wards detecting naturalistic adversarial patches,
S. Wu, J. Wang, J. Zhao, Y . Wang, and X. Liu, “Napguard: To- wards detecting naturalistic adversarial patches,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 24 367–24 376. 1, 3, 7, 9
work page 2024
-
[16]
Jedi: Entropy-based localization and removal of adversarial patches,
B. Tarchoun, A. Ben Khalifa, M. A. Mahjoub, N. Abu-Ghazaleh, and I. Alouani, “Jedi: Entropy-based localization and removal of adversarial patches,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 4087–
work page 2023
-
[17]
Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples,
A. Athalye, N. Carlini, and D. Wagner, “Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples,” inInternational Conference on Machine Learning. PMLR, 2018, pp. 274–283. 1, 7, 8
work page 2018
-
[18]
On adaptive attacks to adversarial example defenses,
F. Tramer, N. Carlini, W. Brendel, and A. Madry, “On adaptive attacks to adversarial example defenses,”Advances in Neural Information Processing Systems, vol. 33, pp. 1633–1645, 2020. 1, 7
work page 2020
-
[19]
Continuous and discrete wavelet transforms,
C. E. Heil and D. F. Walnut, “Continuous and discrete wavelet transforms,”SIAM Review, vol. 31, no. 4, pp. 628–666, 1989. 2, 3
work page 1989
-
[20]
Human detection from images and videos: A survey,
D. T. Nguyen, W. Li, and P. O. Ogunbona, “Human detection from images and videos: A survey,”Pattern Recognition, vol. 51, pp. 148–175, 2016. 2
work page 2016
-
[21]
Physical adversarial attack meets computer vision: A decade survey,
H. Wei, H. Tang, X. Jia, Z. Wang, H. Yu, Z. Li, S. Satoh, L. Van Gool, and Z. Wang, “Physical adversarial attack meets computer vision: A decade survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024. 2
work page 2024
-
[22]
Numbod: A spatial-frequency fusion attack against object detectors,
Z. Zhou, B. Li, Y . Song, Z. Yu, S. Hu, W. Wan, L. Y . Zhang, D. Yao, and H. Jin, “Numbod: A spatial-frequency fusion attack against object detectors,” inProceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 1, 2025, pp. 1201–1209. 2, 7, 9
work page 2025
-
[23]
Toward generic and controllable attacks against object detection,
G. Li, Y . Xu, J. Ding, and G.-S. Xia, “Toward generic and controllable attacks against object detection,”IEEE Transactions on Geoscience and Remote Sensing, vol. 62, pp. 1–12, 2024. 2, 7, 9
work page 2024
-
[24]
Adversarial yolo: Defense human detection patch attacks via detecting adversarial patches,
N. Ji, Y . Feng, H. Xie, X. Xiang, and N. Liu, “Adversarial yolo: Defense human detection patch attacks via detecting adversarial patches,”arXiv preprint arXiv:2103.08860, 2021. 3
-
[25]
Adversarial patch detection and mitigation by detecting high entropy regions,
N. Bunzel, A. Siwakoti, and G. Klause, “Adversarial patch detection and mitigation by detecting high entropy regions,” in2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W). IEEE, 2023, pp. 124–128
work page 2023
-
[26]
Patchzero: Defending against adversarial patch attacks by detecting and zeroing the patch,
K. Xu, Y . Xiao, Z. Zheng, K. Cai, and R. Nevatia, “Patchzero: Defending against adversarial patch attacks by detecting and zeroing the patch,” inIEEE Winter Conference on Applications of Computer Vision, 2023, pp. 4632–4641
work page 2023
-
[27]
Defending from physically-realizable adversarial attacks through internal over-activation analysis,
G. Rossolini, F. Nesti, F. Brau, A. Biondi, and G. Buttazzo, “Defending from physically-realizable adversarial attacks through internal over-activation analysis,” inProceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 12, 2023, pp. 15 064–15 072. 3
work page 2023
-
[28]
Pad: Patch-agnostic defense against adversarial patch attacks,
L. Jing, R. Wang, W. Ren, X. Dong, and C. Zou, “Pad: Patch-agnostic defense against adversarial patch attacks,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 24 472–24 481. 3
work page 2024
-
[29]
Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks,
F. Croce and M. Hein, “Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks,” inInternational Conference on Machine Learning. PMLR, 2020, pp. 2206–2216. 3, 7
work page 2020
-
[30]
On the importance of backbone to the adversarial robustness of object detectors,
X. Li, H. Chen, and X. Hu, “On the importance of backbone to the adversarial robustness of object detectors,”IEEE Transactions on Information Forensics and Security, 2025. 3, 6, 7, 9
work page 2025
-
[31]
Feature distillation: Dnn-oriented jpeg compression against ad- versarial examples,
Z. Liu, Q. Liu, T. Liu, N. Xu, X. Lin, Y . Wang, and W. Wen, “Feature distillation: Dnn-oriented jpeg compression against ad- versarial examples,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, 2019, Conference Proceedings, pp. 860–868. 3, 7, 9, 11
work page 2019
-
[32]
G. Chen, P. Peng, L. Ma, J. Li, L. Du, and Y . Tian, “Amplitude- phase recombination: Rethinking robustness of convolutional neural networks in frequency domain,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 458–467
work page 2021
-
[33]
Y . Chen, Q. Ren, and J. Yan, “Rethinking and improving robustness of convolutional neural networks: a shapley value- based approach in frequency domain,”Advances in Neural Information Processing Systems, vol. 35, pp. 324–337, 2022. 3
work page 2022
-
[34]
Improving adversarial robustness of masked autoencoders via test-time frequency-domain prompting,
Q. Huang, X. Dong, D. Chen, Y . Chen, L. Yuan, G. Hua, W. Zhang, and N. Yu, “Improving adversarial robustness of masked autoencoders via test-time frequency-domain prompting,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 1600–1610
work page 2023
-
[35]
Adversarial example detection by predicting adversarial noise in the frequency domain,
S. Jung, M. Chung, and Y .-G. Shin, “Adversarial example detection by predicting adversarial noise in the frequency domain,”Multimedia Tools and Applications, vol. 82, no. 16, pp. 25 235–25 251, 2023. 3
work page 2023
-
[36]
Lpf- defense: 3d adversarial defense based on frequency analysis,
H. Naderi, K. Noorbakhsh, A. Etemadi, and S. Kasaei, “Lpf- defense: 3d adversarial defense based on frequency analysis,” PLOS One, vol. 18, no. 2, p. e0271388, 2023
work page 2023
-
[37]
Adversarial robustness of convolutional models learned in the frequency domain,
S. Chaudhury and T. Yamasaki, “Adversarial robustness of convolutional models learned in the frequency domain,” in2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024, pp. 7455–7459. 3
work page 2024
-
[38]
Image transformation-based defense against adversarial perturbation on deep learning models,
A. Agarwal, R. Singh, M. Vatsa, and N. Ratha, “Image transformation-based defense against adversarial perturbation on deep learning models,”IEEE Transactions on Dependable and Secure Computing, vol. 18, no. 5, pp. 2106–2121, 2020
work page 2020
-
[39]
Defense against adversarial examples based on wavelet domain analysis,
A. Sarvar and M. Amirmazlaghani, “Defense against adversarial examples based on wavelet domain analysis,”Applied Intelli- gence, vol. 53, no. 1, pp. 423–439, 2023
work page 2023
-
[40]
Adviris: a hybrid approach to detecting adversarial iris examples using wavelet transform,
K. Meenakshi and G. Maragatham, “Adviris: a hybrid approach to detecting adversarial iris examples using wavelet transform,” 14 International Journal of Speech Technology, vol. 25, no. 2, pp. 435–441, 2022. 3
work page 2022
-
[41]
The haar-wavelet transform in digital image processing: its status and achievements,
P. Porwik and A. Lisowska, “The haar-wavelet transform in digital image processing: its status and achievements,”Machine Graphics and Vision, vol. 13, no. 1/2, pp. 79–98, 2004. 4
work page 2004
-
[42]
Faster r-cnn: Towards real-time object detection with region proposal networks,
S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,”Ad- vances in Neural Information Processing Systems, vol. 28, 2015. 4, 6, 9, 10
work page 2015
-
[43]
Histograms of oriented gradients for human detection,
N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, vol. 1. IEEE, 2005, pp. 886–893. 4, 7, 8
work page 2005
-
[44]
Fcos: Fully convolutional one-stage object detection,
Z. Tian, C. Shen, H. Chen, and T. He, “Fcos: Fully convolutional one-stage object detection,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 9627–
work page 2019
-
[45]
G. Jocher, A. Chaurasia, and J. Qiu, “Ultralytics yolov8,” 2023. [Online]. Available: https://github.com/ultralytics/ultralytics 6, 9
work page 2023
-
[46]
Microsoft coco: Common objects in context,
T.-Y . Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick, “Microsoft coco: Common objects in context,” inEuropean Conference on Computer Vision. Springer, 2014, pp. 740–755. 7, 8
work page 2014
-
[47]
Information distribution based defense against physical attacks on object detection,
G. Zhou, H. Gao, P. Chen, J. Liu, J. Dai, J. Han, and R. Li, “Information distribution based defense against physical attacks on object detection,” inIEEE International Conference on Multimedia and Expo Workshops, 2020, pp. 1–6. 9
work page 2020
-
[48]
Deep residual learning for image recognition,
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2016, pp. 770–778. 12
work page 2016
-
[49]
Imagenet large scale visual recognition challenge,
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernsteinet al., “Imagenet large scale visual recognition challenge,”International Journal of Computer Vision, vol. 115, pp. 211–252, 2015. 11, 12
work page 2015
-
[50]
Rf-detr: neural architecture search for real-time detection transformers,
I. Robinson, P. Robicheaux, M. Popov, D. Ramanan, and N. Peri, “Rf-detr: neural architecture search for real-time detection transformers,” inInternational Conference on Learning Representations, 2026. 11
work page 2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.