Robust CLIP: Unsupervised adver- sarial fine-tuning of vision embeddings for robust large vision-language models

Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Is the Modality Gap a Bug or a Feature? A Robustness Perspective

cs.CV · 2026-03-30 · unverdicted · novelty 7.0

Minimizing contrastive loss produces an orthogonal modality gap vector whose size is monotonically tied to robustness, so post-processing that reduces the gap improves robustness with no loss in clean accuracy.

citing papers explorer

Showing 1 of 1 citing paper.

Is the Modality Gap a Bug or a Feature? A Robustness Perspective cs.CV · 2026-03-30 · unverdicted · none · ref 24
Minimizing contrastive loss produces an orthogonal modality gap vector whose size is monotonically tied to robustness, so post-processing that reduces the gap improves robustness with no loss in clean accuracy.

Robust CLIP: Unsupervised adver- sarial fine-tuning of vision embeddings for robust large vision-language models

fields

years

verdicts

representative citing papers

citing papers explorer