pith. sign in

← back to paper

Review history

arxiv: 2605.14504 · 2 revisions

When Robots Do the Chores: A Benchmark and Agent for Long-Horizon Household Task Execution

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 8.0
    55625 ms 5742 in 1600 out 2026-05-20T21:21:04.353535+00:00
  2. 2026-05-15 CONDITIONAL LOW v0.9.0 novelty 6.0
    37951 ms 5511 in 1263 out 2026-05-15T01:55:07.807981+00:00