pith. sign in

← back to paper

Review history

arxiv: 2605.02503 · 2 revisions

DataClawBench: An Agent Benchmark for Exploratory Real-World Financial Data Analysis

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 6.0
    48563 ms 5725 in 1276 out 2026-05-21T00:21:48.264268+00:00
  2. 2026-05-08 UNVERDICTED LOW v0.9.0 novelty 7.0
    53875 ms 5507 in 1284 out 2026-05-08T18:20:13.877925+00:00