An Open Review of OpenReview: A Critical Analysis of the Machine Learning Conference Review Process

Alex Valtchanov; David Tran; Eric Slud; Keshav Ganapathy; Micah Goldblum; Raymond Feng; Tom Goldstein

arxiv: 2010.05137 · v2 · pith:A576KJN2new · submitted 2020-10-11 · 💻 cs.LG · cs.CY

An Open Review of OpenReview: A Critical Analysis of the Machine Learning Conference Review Process

David Tran , Alex Valtchanov , Keshav Ganapathy , Raymond Feng , Eric Slud , Micah Goldblum , Tom Goldstein This is my paper

classification 💻 cs.LG cs.CY

keywords reviewacceptancedecisionslearningmachinescoresbiasconference

0 comments

read the original abstract

Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of papers submitted to ICLR between 2017 and 2020. We quantify reproducibility/randomness in review scores and acceptance decisions, and examine whether scores correlate with paper impact. Our findings suggest strong institutional bias in accept/reject decisions, even after controlling for paper quality. Furthermore, we find evidence for a gender gap, with female authors receiving lower scores, lower acceptance rates, and fewer citations per paper than their male counterparts. We conclude our work with recommendations for future conference organizers.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews
cs.CL 2026-04 unverdicted novelty 8.0

Non-English papers face substantially higher rates of negative peer review bias than English-only papers in NLP, with demanding unjustified cross-lingual generalization as the dominant form.