For each video, we cropped short 10-frame video clips around three tempo- rally separated core interaction events

Experimental Dataset S-Table 8 lists all 33 videos included in the experimental dataset, which was used in our experiments

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Action Without Interaction: Probing the Physical Foundations of Video LMMs via Contact-Release Detection

cs.CV · 2025-11-25 · unverdicted · novelty 7.0

Video LMMs name objects and actions reliably but fail to detect the precise frames and locations of contact and release events, revealing shortcut learning instead of physical grounding.

citing papers explorer

Showing 1 of 1 citing paper.

Action Without Interaction: Probing the Physical Foundations of Video LMMs via Contact-Release Detection cs.CV · 2025-11-25 · unverdicted · none · ref 42
Video LMMs name objects and actions reliably but fail to detect the precise frames and locations of contact and release events, revealing shortcut learning instead of physical grounding.

For each video, we cropped short 10-frame video clips around three tempo- rally separated core interaction events

fields

years

verdicts

representative citing papers

citing papers explorer