For guidance on when this is appropriate, please review the NeurIPS ethics guidelines

Ethical concerns: If there are ethical issues with this paper, please flag the paper for an ethics review

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

cs.CL · 2026-05-08 · unverdicted · novelty 7.0

CoCoReviewBench curates 3,900 conference papers with category subsets and expert discussion annotations to evaluate AI reviewers on completeness and correctness, showing they are limited and prone to hallucinations while reasoning models perform better.

Agent Laboratory: Using LLM Agents as Research Assistants

cs.HC · 2025-01-08 · conditional · novelty 5.0

Agent Laboratory is an autonomous LLM framework that completes end-to-end research from idea to report and code, with human feedback improving quality and cutting expenses by 84% while reaching competitive ML performance.

citing papers explorer

Showing 2 of 2 citing papers.

CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers cs.CL · 2026-05-08 · unverdicted · none · ref 93
CoCoReviewBench curates 3,900 conference papers with category subsets and expert discussion annotations to evaluate AI reviewers on completeness and correctness, showing they are limited and prone to hallucinations while reasoning models perform better.
Agent Laboratory: Using LLM Agents as Research Assistants cs.HC · 2025-01-08 · conditional · none · ref 6
Agent Laboratory is an autonomous LLM framework that completes end-to-end research from idea to report and code, with human feedback improving quality and cutting expenses by 84% while reaching competitive ML performance.

For guidance on when this is appropriate, please review the NeurIPS ethics guidelines

fields

years

verdicts

representative citing papers

citing papers explorer