pith. sign in

← back to paper

Review history

arxiv: 2604.06750 · 2 revisions

How Well Do Vision-Language Models Understand Sequential Driving Scenes? A Sensitivity Study

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 6.0
    40306 ms 5722 in 1280 out 2026-05-21T09:49:23.084404+00:00
  2. 2026-05-10 CONDITIONAL MODERATE v0.9.0 novelty 6.0
    30620 ms 5489 in 1120 out 2026-05-10T18:41:46.404826+00:00