pith. sign in

← back to paper

Review history

arxiv: 2605.09826 · 2 revisions

EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents

  1. 2026-05-20 CONDITIONAL LOW v0.9.0 novelty 6.0
    64624 ms 5738 in 1148 out 2026-05-20T23:24:28.788730+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 7.0
    57987 ms 5507 in 1206 out 2026-05-12T04:54:30.363368+00:00