Bradley-Terry Rankings for Recommender Systems Across Dataset Taxonomies

Alexander Derevyagin; Anton Lysenko; Askar Tsyganov; Daria Korovaitceva; Ekaterina Grishina; Evgeny Frolov; Ilya Ivanov; Margarita Rusanova; Sergey Samsonov; Stepan Kuznetsov

arxiv: 2606.07492 · v1 · pith:FYZ7LPVWnew · submitted 2026-06-05 · 💻 cs.IR · cs.LG· stat.ML

Bradley-Terry Rankings for Recommender Systems Across Dataset Taxonomies

Ekaterina Grishina , Stepan Kuznetsov , Askar Tsyganov , Ilya Ivanov , Daria Korovaitceva , Margarita Rusanova , Uliana Parkina , Alexander Derevyagin

show 3 more authors

Evgeny Frolov Sergey Samsonov Anton Lysenko

This is my paper

classification 💻 cs.IR cs.LGstat.ML

keywords rankingalgorithmsbradley-terrydatasetmethodologydemonstrateintroducemodel

0 comments

read the original abstract

The ranking of recommendation algorithms is a challenging problem since model performance is sensitive to dataset characteristics such as sparsity, sequential structure, and scale. This drives a demand for a proper methodology for fair comparison between algorithms. Naive aggregation of performance metrics (e.g., averaging NDCG over benchmarks) can yield misleading rankings, undermining practical selection. To address this problem, we introduce a novel, data-driven ranking methodology based on Bradley-Terry (BT) model. We demonstrate that the obtained ranking depends on key dataset statistics. Additionally, we propose a novel metric for evaluating ranking consistency and demonstrate robustness of our ranking to incomplete data. Finally, we introduce a dataset-specific methodology for ranking algorithms on unseen datasets without running the models, relying on extensions of the Bradley-Terry framework, including BT trees and BT models with covariates.

This paper has not been read by Pith yet.

Bradley-Terry Rankings for Recommender Systems Across Dataset Taxonomies

discussion (0)