What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian- Noise-free Text-Image Corruption and Evaluation.arXiv preprint, 2024

Michal Golovanevsky, William Rudman, Vedant Palit, Ritambhara Singh, Carsten Eickhoff · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Multimodal QUD: Inquisitive Questions from Scientific Figures

cs.CL · 2026-04-26 · unverdicted · novelty 7.0

Extending the linguistic theory of Questions Under Discussion to multimodal scientific figures produces the MQUD dataset of author-annotated questions and improves VLM generation of content-specific multimodal questions.

citing papers explorer

Showing 1 of 1 citing paper.

Multimodal QUD: Inquisitive Questions from Scientific Figures cs.CL · 2026-04-26 · unverdicted · none · ref 34
Extending the linguistic theory of Questions Under Discussion to multimodal scientific figures produces the MQUD dataset of author-annotated questions and improves VLM generation of content-specific multimodal questions.

What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian- Noise-free Text-Image Corruption and Evaluation.arXiv preprint, 2024

fields

years

verdicts

representative citing papers

citing papers explorer