pith. sign in

arxiv: 2606.07492 · v1 · pith:FYZ7LPVWnew · submitted 2026-06-05 · 💻 cs.IR · cs.LG· stat.ML

Bradley-Terry Rankings for Recommender Systems Across Dataset Taxonomies

classification 💻 cs.IR cs.LGstat.ML
keywords rankingalgorithmsbradley-terrydatasetmethodologydemonstrateintroducemodel
0
0 comments X
read the original abstract

The ranking of recommendation algorithms is a challenging problem since model performance is sensitive to dataset characteristics such as sparsity, sequential structure, and scale. This drives a demand for a proper methodology for fair comparison between algorithms. Naive aggregation of performance metrics (e.g., averaging NDCG over benchmarks) can yield misleading rankings, undermining practical selection. To address this problem, we introduce a novel, data-driven ranking methodology based on Bradley-Terry (BT) model. We demonstrate that the obtained ranking depends on key dataset statistics. Additionally, we propose a novel metric for evaluating ranking consistency and demonstrate robustness of our ranking to incomplete data. Finally, we introduce a dataset-specific methodology for ranking algorithms on unseen datasets without running the models, relying on extensions of the Bradley-Terry framework, including BT trees and BT models with covariates.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.