CoCoReviewBench curates 3,900 ICLR and NeurIPS papers into category-specific subsets with discussion-based annotations to evaluate AI reviewers on completeness and correctness rather than human review overlap.
These issues often require checking adherence to conference or legal policies and may involve ethical considerations
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers
CoCoReviewBench curates 3,900 ICLR and NeurIPS papers into category-specific subsets with discussion-based annotations to evaluate AI reviewers on completeness and correctness rather than human review overlap.