Affordance benchmark for mllms.arXiv preprint arXiv:2506.00893, 2025

Junying Wang, Wenzhe Li, Yalun Wu, Yingji Liang, Yijin Guo, Chunyi Li, Haodong Duan, Zicheng Zhang, Guangtao Zhai · 2025 · arXiv 2506.00893

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

ChronoPhyBench: Do MLLMs Truly Understand the World or Merely Exploit Language Priors?

cs.CV · 2026-06-06 · unverdicted · novelty 7.0

ChronoPhyBench is a new benchmark and dataset for chronological physical dynamics reasoning that combines video-conditioned next-state prediction with VQA to reduce language bias in MLLM evaluation.

citing papers explorer

Showing 1 of 1 citing paper.

ChronoPhyBench: Do MLLMs Truly Understand the World or Merely Exploit Language Priors? cs.CV · 2026-06-06 · unverdicted · none · ref 29
ChronoPhyBench is a new benchmark and dataset for chronological physical dynamics reasoning that combines video-conditioned next-state prediction with VQA to reduce language bias in MLLM evaluation.

Affordance benchmark for mllms.arXiv preprint arXiv:2506.00893, 2025

fields

years

verdicts

representative citing papers

citing papers explorer