← back to paper
arxiv: 2605.07905 · 2 revisions
CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers