AI peer reviewers show excessive agreement across papers and give higher scores after simple LLM-based stylistic rewriting, so general-purpose LLMs should not automate reviews without rigorous evaluation.
eacl-long.119/
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Stop Automating Peer Review Without Rigorous Evaluation
AI peer reviewers show excessive agreement across papers and give higher scores after simple LLM-based stylistic rewriting, so general-purpose LLMs should not automate reviews without rigorous evaluation.