Robustness of Vision Foundation Models to Common Perturbations
Pith reviewed 2026-05-10 11:02 UTC · model grok-4.3
The pith
Vision foundation models are generally non-robust to common perturbations like JPEG compression, brightness, and contrast adjustments.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present the first systematic study on foundation models' robustness to common perturbations that alter embedding vectors. We propose three robustness metrics and formulate five desired mathematical properties for these metrics, analyzing which properties they satisfy or violate. Using these metrics, we evaluate six industry-scale foundation models across nine common perturbation categories, finding them generally non-robust. We also show that common perturbations degrade downstream application performance and that robustness values can predict performance impacts. Finally, we propose a fine-tuning approach to improve robustness without sacrificing utility.
What carries the argument
Three robustness metrics, each checked against five mathematical properties, that quantify changes in embedding vectors caused by image perturbations.
If this is right
- Downstream tasks experience measurable drops in accuracy when inputs undergo common perturbations.
- The numerical robustness values directly predict the size of those accuracy drops.
- Fine-tuning on perturbed examples raises robustness scores while leaving original task utility intact.
- Models from different providers exhibit similar patterns of sensitivity across the nine perturbation types.
- Embedding-based applications become less reliable unless robustness is explicitly addressed.
Where Pith is reading between the lines
- Applications that rely on these embeddings may need to preprocess inputs or adopt the fine-tuning step to maintain consistent behavior.
- The metrics could serve as a quick benchmark when comparing new training methods or model architectures for sensitivity to everyday image variation.
- Persistent non-robustness might point to deeper limitations in how current training data and objectives handle natural image variability.
Load-bearing premise
The three proposed robustness metrics accurately capture how perturbations affect performance in actual downstream applications.
What would settle it
A new model that scores high on the proposed robustness metrics but still shows large drops in downstream accuracy when the same perturbations are applied would falsify the claim that the metrics track practical impact.
Figures
read the original abstract
A vision foundation model outputs an embedding vector for an image, which can be affected by common editing operations (e.g., JPEG compression, brightness, contrast adjustments). These common perturbations alter embedding vectors and may impact the performance of downstream tasks using these embeddings. In this work, we present the first systematic study on foundation models' robustness to such perturbations. We propose three robustness metrics and formulate five desired mathematical properties for these metrics, analyzing which properties they satisfy or violate. Using these metrics, we evaluate six industry-scale foundation models (OpenAI, Meta) across nine common perturbation categories, finding them generally non-robust. We also show that common perturbations degrade downstream application performance (e.g., classification accuracy) and that robustness values can predict performance impacts. Finally, we propose a fine-tuning approach to improve robustness without sacrificing utility.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to perform the first systematic study on the robustness of vision foundation models to common perturbations. It proposes three robustness metrics and analyzes their satisfaction of five mathematical properties. The evaluation covers six industry-scale models from OpenAI and Meta across nine perturbation categories, concluding they are generally non-robust. It demonstrates that these perturbations degrade downstream application performance like classification accuracy, that the robustness metrics can predict such performance impacts, and proposes a fine-tuning approach to improve robustness without sacrificing utility.
Significance. If the results hold, the paper makes a significant contribution by identifying vulnerabilities in widely-used vision foundation models to everyday perturbations, which is important for applications relying on their embeddings. The analysis of mathematical properties for the new metrics and the empirical demonstration of their predictive power for downstream tasks are notable strengths. The proposed fine-tuning method adds practical value. This could encourage the community to prioritize robustness in model development.
minor comments (2)
- The abstract outlines the contributions but would benefit from briefly specifying the exact number of models evaluated and perturbation categories to provide a more complete overview at a glance.
- In the evaluation results, include error bars, standard deviations, or statistical significance tests for the robustness metric values and downstream performance degradations to better support the 'generally non-robust' conclusion and the predictive claims.
Simulated Author's Rebuttal
We thank the referee for their positive summary of our work, recognition of its significance, and recommendation for minor revision. The referee's description accurately reflects the manuscript's contributions regarding robustness metrics, evaluation of vision foundation models, downstream impact analysis, and the proposed fine-tuning method. No specific major comments were provided in the report.
Circularity Check
No significant circularity; empirical study with independent metric validation
full rationale
The paper proposes three robustness metrics and five mathematical properties, then explicitly analyzes which properties each metric satisfies or violates. It evaluates six external foundation models on nine perturbation categories using direct measurements, demonstrates downstream task degradation (e.g., classification accuracy) via separate experiments, and shows observed correlations between robustness scores and performance drops. A fine-tuning method is proposed to improve robustness. No derivation reduces by construction to fitted parameters, self-definitions, or self-citation chains; all load-bearing claims rest on external data and explicit property checks rather than renaming or tautological prediction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Are transformers more robust than cnns? InNeurIPS, 2021
Yutong Bai, Jieru Mei, Alan L Yuille, and Cihang Xie. Are transformers more robust than cnns? InNeurIPS, 2021. 8
work page 2021
-
[2]
Under- standing robustness of transformers for image classification
Srinadh Bhojanapalli, Ayan Chakrabarti, Daniel Glasner, Daliang Li, Thomas Unterthiner, and Andreas Veit. Under- standing robustness of transformers for image classification. InICCV, 2021. 8
work page 2021
-
[3]
Food-101 – mining discriminative components with random forests
Lukas Bossard, Matthieu Guillaumin, and Luc Van Gool. Food-101 – mining discriminative components with random forests. InECCV, 2014. 5
work page 2014
-
[4]
Towards evaluating the robustness of neural networks
Nicholas Carlini and David Wagner. Towards evaluating the robustness of neural networks. InS&P, 2017. 1
work page 2017
-
[5]
A simple framework for contrastive learning of visual representations
Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geof- frey Hinton. A simple framework for contrastive learning of visual representations. InICML, 2020. 8
work page 2020
-
[6]
Imagenet: A large-scale hierarchical image database
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. InCVPR, 2009. 5
work page 2009
-
[7]
Lijie Fan, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, and Chuang Gan. When does contrastive learning preserve ad- versarial robustness from pretraining to finetuning? 2021. 1
work page 2021
-
[8]
Momentum contrast for unsupervised visual repre- sentation learning
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. Momentum contrast for unsupervised visual repre- sentation learning. InCVPR, 2020. 8
work page 2020
-
[9]
Benchmarking neural network robustness to common corruptions and perturbations
Dan Hendrycks and Thomas Dietterich. Benchmarking neural network robustness to common corruptions and perturbations. InICLR, 2019. 5, 8
work page 2019
-
[10]
The many faces of robust- ness: A critical analysis of out-of-distribution generalization
Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kada- vath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, et al. The many faces of robust- ness: A critical analysis of out-of-distribution generalization. InICCV, 2021. 8
work page 2021
-
[11]
Baden- coder: Backdoor attacks to pre-trained encoders in self- supervised learning
Jinyuan Jia, Yupei Liu, and Neil Zhenqiang Gong. Baden- coder: Backdoor attacks to pre-trained encoders in self- supervised learning. InS&P, 2022. 1
work page 2022
-
[12]
Robust pre-training by adversarial contrastive learning
Ziyu Jiang, Tianlong Chen, Ting Chen, and Zhangyang Wang. Robust pre-training by adversarial contrastive learning. NeurIPS, 2020
work page 2020
-
[13]
Evading watermark based detection of ai-generated content
Zhengyuan Jiang, Jinghuai Zhang, and Neil Zhenqiang Gong. Evading watermark based detection of ai-generated content. InCCS, 2023
work page 2023
-
[14]
An embarrassingly simple backdoor attack on self-supervised learning
Changjiang Li, Ren Pang, Zhaohan Xi, Tianyu Du, Shouling Ji, Yuan Yao, and Ting Wang. An embarrassingly simple backdoor attack on self-supervised learning. InICCV, 2023. 1
work page 2023
-
[15]
Junnan Li, Dongxu Li, Caiming Xiong, and Steven Hoi. Blip: Bootstrapping language-image pre-training for unified vision- language understanding and generation. InICML, 2022. 8
work page 2022
-
[16]
Poi- sonedEncoder: Poisoning the unlabeled pre-training data in contrastive learning
Hongbin Liu, Jinyuan Jia, and Neil Zhenqiang Gong. Poi- sonedEncoder: Poisoning the unlabeled pre-training data in contrastive learning. InUSENIX Security Symposium, 2022. 1
work page 2022
-
[17]
Indoor segmentation and support inference from rgbd images
Pushmeet Kohli Nathan Silberman, Derek Hoiem and Rob Fergus. Indoor segmentation and support inference from rgbd images. InECCV, 2012. 5
work page 2012
-
[18]
Dinov2: Learning robust visual features without supervision.arXiv,
Maxime Oquab, Timoth´ee Darcet, Th´eo Moutakanni, Huy V o, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, et al. Dinov2: Learning robust visual features without supervision.arXiv,
-
[19]
Vision transformers are robust learners
Sayak Paul and Pin-Yu Chen. Vision transformers are robust learners. InAAAI, 2022. 8
work page 2022
-
[20]
Reaas: En- abling adversarially robust downstream classifiers via robust encoder as a service
Wenjie Qu, Jinyuan Jia, and Neil Zhenqiang Gong. Reaas: En- abling adversarially robust downstream classifiers via robust encoder as a service. InNDSS, 2023. 1
work page 2023
-
[21]
Learning transferable visual models from natural language supervision
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. Learning transferable visual models from natural language supervision. InICML, 2021. 1, 4, 8
work page 2021
-
[22]
Backdoor attacks on self- supervised learning
Aniruddha Saha, Ajinkya Tejankar, Soroush Abbasi Kooh- payegani, and Hamed Pirsiavash. Backdoor attacks on self- supervised learning. InCVPR, 2022. 1
work page 2022
-
[23]
Intriguing properties of neural networks
Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. Intriguing properties of neural networks. InICLR, 2014. 1
work page 2014
-
[24]
Can cnns be more robust than transformers? InICLR, 2023
Zeyu Wang, Yutong Bai, Yuyin Zhou, and Cihang Xie. Can cnns be more robust than transformers? InICLR, 2023. 8
work page 2023
-
[25]
Smallest enclosing disks (balls and ellipsoids)
Emo Welzl. Smallest enclosing disks (balls and ellipsoids). In New Results and New Trends in Computer Science. Springer,
-
[26]
Promptbench: Towards evaluating the robustness of large language models on adversarial prompts
Kaijie Zhu, Jindong Wang, Jiaheng Zhou, Zichen Wang, Hao Chen, Yidong Wang, Linyi Yang, Wei Ye, Neil Zhenqiang Gong, Yue Zhang, et al. Promptbench: Towards evaluating the robustness of large language models on adversarial prompts. arXiv, 2023. 8 9 A. Impact Statements This work strengthens the reliability of vision foundation models by systematically eval...
work page 2023
-
[27]
Without loss of generality, we assume the subdomain K′ contains n discrete values k1,· · ·, k n. Then, we have the following: nX i=1 f(P(x, k i)) =0.(13) Based on Equation 10, we have the following equation group: ||f(P(x, k 1))||2 2 −2f T (P(x, k 1))·c+||c|| 2 2 ≤r 2 ||f(P(x, k 2))||2 2 −2f T (P(x, k 2))·c+||c|| 2 2 ≤r 2 · · · ||f(P(x, k n))||2...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.