Proposes Visual Fidelity and Contrastiveness scores for VLM explanations that improve user accuracy in judging prediction correctness by 11.1% without visual context on A-OKVQA, VizWiz, and MMMU-Pro.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Believing without Seeing: Quality Scores for Contextualizing Vision-Language Model Explanations
Proposes Visual Fidelity and Contrastiveness scores for VLM explanations that improve user accuracy in judging prediction correctness by 11.1% without visual context on A-OKVQA, VizWiz, and MMMU-Pro.