Vlm-robustbench: A comprehensive benchmark for robustness of vision-language models

Rohit Saxena, Alessandro Suglia, Pasquale Minervini · 2026 · arXiv 2603.06148

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

TempGlitch is a controlled benchmark showing that 12 evaluated VLMs perform near chance level on detecting five types of temporal glitches in gameplay videos, with denser sampling and larger models providing no reliable improvement.

RemoteShield: Enable Robust Multimodal Large Language Models for Earth Observation

cs.CV · 2026-04-19 · unverdicted · novelty 6.0

RemoteShield improves robustness of Earth observation MLLMs by training on semantic equivalence clusters of clean and perturbed inputs via preference learning to maintain consistent reasoning under noise.

citing papers explorer

Showing 2 of 2 citing papers.

TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos cs.CV · 2026-05-20 · unverdicted · none · ref 23
TempGlitch is a controlled benchmark showing that 12 evaluated VLMs perform near chance level on detecting five types of temporal glitches in gameplay videos, with denser sampling and larger models providing no reliable improvement.
RemoteShield: Enable Robust Multimodal Large Language Models for Earth Observation cs.CV · 2026-04-19 · unverdicted · none · ref 45
RemoteShield improves robustness of Earth observation MLLMs by training on semantic equivalence clusters of clean and perturbed inputs via preference learning to maintain consistent reasoning under noise.

Vlm-robustbench: A comprehensive benchmark for robustness of vision-language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer