Stretching the Effectiveness of MLE from Accuracy to Bias for Pairwise Comparisons

· 2019 · cs.LG · arXiv 1906.04066

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

A number of applications (e.g., AI bot tournaments, sports, peer grading, crowdsourcing) use pairwise comparison data and the Bradley-Terry-Luce (BTL) model to evaluate a given collection of items (e.g., bots, teams, students, search results). Past work has shown that under the BTL model, the widely-used maximum-likelihood estimator (MLE) is minimax-optimal in estimating the item parameters, in terms of the mean squared error. However, another important desideratum for designing estimators is fairness. In this work, we consider fairness modeled by the notion of bias in statistics. We show that the MLE incurs a suboptimal rate in terms of bias. We then propose a simple modification to the MLE, which "stretches" the bounding box of the maximum-likelihood optimizer by a small constant factor from the underlying ground truth domain. We show that this simple modification leads to an improved rate in bias, while maintaining minimax-optimality in the mean squared error. In this manner, our proposed class of estimators provably improves fairness represented by bias without loss in accuracy.

representative citing papers

Learning What Evaluators Value: A Reliable Approach to Modeling Evaluator Preferences

cs.LG · 2026-05-15 · unverdicted · novelty 5.0 · 2 refs

Presents a robust algorithm for learning any coordinate-wise non-decreasing evaluator preference function, with theoretical guarantees that it matches linear performance when linearity holds.

citing papers explorer

Showing 1 of 1 citing paper.

Learning What Evaluators Value: A Reliable Approach to Modeling Evaluator Preferences cs.LG · 2026-05-15 · unverdicted · none · ref 23 · 2 links · internal anchor
Presents a robust algorithm for learning any coordinate-wise non-decreasing evaluator preference function, with theoretical guarantees that it matches linear performance when linearity holds.

Stretching the Effectiveness of MLE from Accuracy to Bias for Pairwise Comparisons

fields

years

verdicts

representative citing papers

citing papers explorer