Know-show: Bench- marking video-language models on spatio-temporal grounded reasoning.arXiv preprint arXiv:2512.05513

Chinthani Sugandhika, Chen Li, Deepu Rajan, Basura Fernando · arXiv 2512.05513

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence

cs.CV · 2026-03-13 · unverdicted · novelty 7.0

VAEX-BENCH shows state-of-the-art MLLMs perform substantially worse on abstractive spatiotemporal reasoning tasks than on matched extractive tasks in video understanding.

citing papers explorer

Showing 1 of 1 citing paper.

Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence cs.CV · 2026-03-13 · unverdicted · none · ref 15
VAEX-BENCH shows state-of-the-art MLLMs perform substantially worse on abstractive spatiotemporal reasoning tasks than on matched extractive tasks in video understanding.

Know-show: Bench- marking video-language models on spatio-temporal grounded reasoning.arXiv preprint arXiv:2512.05513

fields

years

verdicts

representative citing papers

citing papers explorer