pith. sign in

← back to paper

Review history

arxiv: 2605.07905 · 2 revisions

CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 7.0
    41243 ms 5713 in 1096 out 2026-05-20T22:44:50.084961+00:00
  2. 2026-05-11 UNVERDICTED LOW v0.9.0 novelty 7.0
    73565 ms 5482 in 1340 out 2026-05-11T03:35:25.842977+00:00