Metamorphic Detection of Adversarial Examples in Deep Learning Models With Affine Transformations
Pith reviewed 2026-05-24 23:43 UTC · model grok-4.3
The pith
Distance ratio preserving affine transformations detect adversarial examples by comparing deep learning model outputs on original and transformed images.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By applying metamorphic relations based on distance ratio preserving affine image transformations which compare the behavior of the original and transformed image, the proposed approach can determine whether or not the input image is adversarial with a high degree of accuracy.
What carries the argument
Metamorphic relations using distance ratio preserving affine image transformations to compare model predictions on original versus transformed inputs.
If this is right
- The method can identify adversarial manipulations that are imperceptible to humans.
- It provides a way to guard against attacks in safety-critical industries such as self-driving cars and face recognition.
- Detection works by checking consistency of model behavior under the transformations.
- The approach is automatic and does not require knowledge of the attack details.
Where Pith is reading between the lines
- This detection strategy might extend to other data types like audio or text if similar transformations can be defined.
- Combining it with existing defenses could improve overall robustness against adversarial attacks.
- The accuracy might vary depending on the specific deep learning model architecture used.
Load-bearing premise
Distance ratio preserving affine transformations produce consistent model behavior on non-adversarial images but inconsistent behavior on adversarial images.
What would settle it
Finding a set of clean images where the model changes its prediction after applying the affine transformations, or adversarial images where the prediction stays the same.
Figures
read the original abstract
Adversarial attacks are small, carefully crafted perturbations, imperceptible to the naked eye; that when added to an image cause deep learning models to misclassify the image with potentially detrimental outcomes. With the rise of artificial intelligence models in consumer safety and security intensive industries such as self-driving cars, camera surveillance and face recognition, there is a growing need for guarding against adversarial attacks. In this paper, we present an approach that uses metamorphic testing principles to automatically detect such adversarial attacks. The approach can detect image manipulations that are so small, that they are impossible to detect by a human through visual inspection. By applying metamorphic relations based on distance ratio preserving affine image transformations which compare the behavior of the original and transformed image; we show that our proposed approach can determine whether or not the input image is adversarial with a high degree of accuracy.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a metamorphic testing approach to detect adversarial examples in deep learning image classifiers. It defines metamorphic relations based on distance-ratio-preserving affine transformations (e.g., rotations, scalings) and claims that comparing model behavior on an original image versus its transformed version allows reliable identification of adversarial inputs, achieving high accuracy.
Significance. If empirically validated with supporting experiments, the method could provide a useful, attack-agnostic detection technique for adversarial examples that does not require model retraining or knowledge of the perturbation, which would be relevant for safety-critical CV applications.
major comments (2)
- [Abstract] Abstract: the assertion that the approach determines whether an input is adversarial 'with a high degree of accuracy' is unsupported, as the manuscript supplies no experimental results, datasets, baselines, error bars, or implementation details.
- [Abstract] Abstract (final sentence): the core assumption that distance-ratio-preserving affine transformations produce consistent predictions on clean images but inconsistent predictions on adversarial images lacks any theoretical argument, invariance proof, or preliminary evidence; standard CNNs are known to change outputs under modest affine transforms unless explicitly trained for invariance.
minor comments (2)
- [Abstract] Abstract: punctuation and sentence structure issues, e.g., 'imperceptible to the naked eye; that when added' disrupts readability and should be rephrased.
- [Abstract] Abstract: the industry list ('self-driving cars, camera surveillance and face recognition') lacks an Oxford comma and parallel construction.
Simulated Author's Rebuttal
Thank you for the constructive feedback on our manuscript. We appreciate the referee's identification of key issues in the abstract and will address them directly in our point-by-point response.
read point-by-point responses
-
Referee: [Abstract] Abstract: the assertion that the approach determines whether an input is adversarial 'with a high degree of accuracy' is unsupported, as the manuscript supplies no experimental results, datasets, baselines, error bars, or implementation details.
Authors: We agree that the current manuscript does not include any experimental results, datasets, or implementation details to support the claim of determining adversarial inputs 'with a high degree of accuracy.' The abstract will be revised to remove this unsupported assertion. The revised version will either qualify the claim or defer it until supporting experiments can be added. revision: yes
-
Referee: [Abstract] Abstract (final sentence): the core assumption that distance-ratio-preserving affine transformations produce consistent predictions on clean images but inconsistent predictions on adversarial images lacks any theoretical argument, invariance proof, or preliminary evidence; standard CNNs are known to change outputs under modest affine transforms unless explicitly trained for invariance.
Authors: The referee correctly identifies that the manuscript provides no theoretical argument, invariance proof, or evidence for the core assumption. While the approach is motivated by the geometric properties of distance-ratio-preserving transformations, we acknowledge that standard CNNs are not inherently invariant to affine transforms. In the revision, we will expand the abstract and manuscript to include a more detailed rationale for the metamorphic relations, explicitly discuss the limitations regarding CNN invariance, and note this as requiring further investigation or preliminary validation. revision: yes
Circularity Check
No circularity: empirical metamorphic testing method with no derivations or self-referential predictions
full rationale
The paper describes an application of metamorphic testing using distance-ratio-preserving affine transformations to detect adversarial examples by comparing model behavior on original and transformed images. No equations, derivations, fitted parameters presented as predictions, or load-bearing self-citations appear in the abstract or described approach. The central claim rests on an empirical assumption about model consistency under transforms, which is evaluated through experiments rather than reduced to a definition or prior self-citation by construction. This is a standard non-circular empirical method paper.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Distance ratio preserving affine transformations produce consistent classification behavior for non-adversarial images but inconsistent behavior for adversarial images.
Reference graph
Works this paper leans on
-
[1]
Imagenet classification with deep convolutional neural networks,
A. Krizhevsky, I. Sutskever, and G. E. Hinton ,“Imagenet classification with deep convolutional neural networks,” in Advances in neural information processing systems, 2012, pp. 1097– 1105
work page 2012
-
[2]
Very Deep Convolutional Networks for Large-Scale Image Recognition
K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in arXiv:1409.1556, 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[3]
Intriguing properties of neural networks
C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, and R. Fergus, “Intriguing properties of neural networks,” in arXiv:1312.6199, 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[4]
Practical Black -Box Attacks against Machine Learning
N. Papernot, P. McDaniel, I. Goodfellow, S. Jha, Z. B. Celik, and A. Swami. “Practical Black -Box Attacks against Machine Learning”, ACM Asia Conference on Computer and Communications Security (ASIACCS), April 2017
work page 2017
-
[6]
YOLOv3: An Incremental Improvement
J. Redmon and A . Farhadi. “YOLOv3: An Incremental Improvement.” CoRRabs/1804.02767, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[7]
Robust Physical -World Attacks on Deep Learning Visual Classification,
K. Eykholt, I. Evtimov, E. Fernandes, B. Li, A. Rahmati, C. Xiao, A. Prakash, T. Kohno, and D. Song, “Robust Physical -World Attacks on Deep Learning Visual Classification,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 1625-1634
work page 2018
-
[8]
Black-box Adversarial Attacks with Limited Queries and Information
A. Ilyas, L. Engstrom, A. Athalye, and J. Lin, “Black-box Adversarial Attacks with Limited Queries and Information ”, in ICML, 2018
work page 2018
-
[9]
IEEE Std 1012-2016 - IEEE Standard for System, Software, and Hardware Verification and Validation
IEEE. “IEEE Std 1012-2016 - IEEE Standard for System, Software, and Hardware Verification and Validation ”, https://standards.ieee.org/findstds/standard/ 1012 -2016.html. [Online; accessed 7-Jan-2018]
work page 2016
-
[10]
ImageNet: a large -scale hierarchical image database
J. Deng, W. Dong, R. Socher, L-J. Li, K. Li, and L. Fei-Fei, “ImageNet: a large -scale hierarchical image database ”, in CVPR, 2009
work page 2009
-
[11]
A Survey on Metamorphic Testing
S. Segura, G. Fraser, A. Sánchez, and A. Ruiz-Cortés, “A Survey on Metamorphic Testing ”, in IEEE Transactions on Software Engineering. Vol 42(9), 2016, pp. 805-824
work page 2016
-
[12]
Deep Residual Learning for Image Recognition
K. He, X, Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition”, in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770-778
work page 2016
-
[13]
A. Dwarakanath, M. Ahuja, S. Sikand, R. M. Rao, R. P. Bose, N. Dubash, and S. Podder, “Identifying implementation bugs in machine learning based image classifiers using metamorphic testing”, in Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2018, pp. 118-128
work page 2018
-
[14]
Detecting Adversarial Examples Through Image Transformation
S. Tian, G. Yang, a nd Y. Cai, “Detecting Adversarial Examples Through Image Transformation”, in AAAI, 2018
work page 2018
-
[15]
Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods
N. Carlini and D. Wagner, “Adversarial examples are not easily detected: Bypassing ten detection methods ”, in arXiv preprint arXiv:1705.07263, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[16]
Explaining and Harnessing Adversarial Examples
I. J. Goodfellow, J. Shlens, and C. Szegedy, “Explaining and Harnessing Adversarial Examples”, inCoRR, abs/1412.6572, 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[17]
A S ystematic Way of Affine Transformation Using Image Registration
J. Singla and G. Raman, “A S ystematic Way of Affine Transformation Using Image Registration”, 2012
work page 2012
-
[18]
A. Krizhevsky, V. Nair, and G . Hinton, ‘CIFAR -10 and CIFAR - 100’. https://www.cs.toronto.edu/~kriz/cifar.html
-
[19]
MNIST handwritten digit database
Y. LeCun and C. Cortes, (2010), “MNIST handwritten digit database”, 2010. http://yann.lecun.com/exdb/mnist/
work page 2010
-
[20]
DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks
S-M. Moosavi-Dezfooli, A. Fawzi and P. Frossard, “DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks ”, in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 2574-2582
work page 2016
-
[21]
Metamorphic testing: a new approach for generating next test cases
T. Y. Chen, S. C. Cheung, and S. M. Yiu. “Metamorphic testing: a new approach for generating next test cases ”, Technical Report HKUST-CS98-01, Department of Computer Science, Hong Kong University of Science and Technology, Hong Kong, 1998
work page 1998
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.