Preprint, arXiv:2508.13968

Rotbench: Evaluating multimodal large language models on identifying image rotation · 2025 · arXiv 2508.13968

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Why MLLMs Struggle to Determine Object Orientations

cs.CV · 2026-04-14 · accept · novelty 7.0

Orientation information is recoverable from MLLM visual encoder embeddings via linear regression, contradicting the hypothesis that failures originate in the encoders.

When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

cs.CV · 2026-05-06 · unverdicted · novelty 4.0 · 2 refs

Mild rotations and noise significantly increase relation hallucinations in VLMs across models and datasets, with prompt and preprocessing fixes providing only partial relief.

citing papers explorer

Showing 2 of 2 citing papers.

Why MLLMs Struggle to Determine Object Orientations cs.CV · 2026-04-14 · accept · none · ref 25
Orientation information is recoverable from MLLM visual encoder embeddings via linear regression, contradicting the hypothesis that failures originate in the encoders.
When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise cs.CV · 2026-05-06 · unverdicted · none · ref 10 · 2 links
Mild rotations and noise significantly increase relation hallucinations in VLMs across models and datasets, with prompt and preprocessing fixes providing only partial relief.

Preprint, arXiv:2508.13968

fields

years

verdicts

representative citing papers

citing papers explorer