pith. sign in

← back to paper

Review history

arxiv: 2605.09904 · 2 revisions

TOC-Bench: A Temporal Object Consistency Benchmark for Video Large Language Models

  1. 2026-05-13 CONDITIONAL LOW v0.9.0 novelty 7.0
    86106 ms 5603 in 1279 out 2026-05-13T06:46:02.967083+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 7.0
    30178 ms 5581 in 1225 out 2026-05-12T04:09:27.293358+00:00