An Open Review of OpenReview: A Critical Analysis of the Machine Learning Conference Review Process
read the original abstract
Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of papers submitted to ICLR between 2017 and 2020. We quantify reproducibility/randomness in review scores and acceptance decisions, and examine whether scores correlate with paper impact. Our findings suggest strong institutional bias in accept/reject decisions, even after controlling for paper quality. Furthermore, we find evidence for a gender gap, with female authors receiving lower scores, lower acceptance rates, and fewer citations per paper than their male counterparts. We conclude our work with recommendations for future conference organizers.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews
Non-English papers face substantially higher rates of negative peer review bias than English-only papers in NLP, with demanding unjustified cross-lingual generalization as the dominant form.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.