Look, listen and learn.2017 IEEE International Conference on Computer Vision (ICCV), pages 609–617, 2017

Relja Arandjelovi´c, Andrew Zisserman · 2017

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

Video MLLMs show an audio-visual Clever Hans effect relying on visual-acoustic correlations rather than audio verification; Thud interventions diagnose it and a 10K-sample preference alignment improves intervention performance by 28 points.

citing papers explorer

Showing 1 of 1 citing paper.

When Vision Speaks for Sound cs.CV · 2026-05-13 · unverdicted · none · ref 3
Video MLLMs show an audio-visual Clever Hans effect relying on visual-acoustic correlations rather than audio verification; Thud interventions diagnose it and a 10K-sample preference alignment improves intervention performance by 28 points.

Look, listen and learn.2017 IEEE International Conference on Computer Vision (ICCV), pages 609–617, 2017

fields

years

verdicts

representative citing papers

citing papers explorer