Predicting Qualification Thresholds in UEFA's incomplete round-robin tournaments

Christian Deutscher; David Winkelmann; Rouven Michels

arxiv: 2508.20075 · v4 · submitted 2025-08-27 · 💰 econ.GN · q-fin.EC

Predicting Qualification Thresholds in UEFA's incomplete round-robin tournaments

David Winkelmann , Rouven Michels , Christian Deutscher This is my paper

Pith reviewed 2026-05-18 20:54 UTC · model grok-4.3

classification 💰 econ.GN q-fin.EC

keywords UEFA Champions Leaguequalification thresholdsDixon-Coles modelsimulationincomplete round-robinElo ratingsplay-off qualification

0 comments

The pith

A statistical model using Elo ratings and an adjusted bivariate Dixon-Coles approach estimates the points needed for direct qualification and play-off entry in UEFA's new 36-team league format.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to provide reliable estimates of how many points teams must collect to finish in the top eight for direct knockout advancement or in positions nine through twenty-four for play-off qualification under the single-table incomplete round-robin structure introduced in the 2024/25 Champions League and Europa League. It does so by fitting a bivariate Dixon-Coles model that incorporates the lower observed frequency of draws, which may stem from altered team incentives, and by using Elo ratings as proxies for relative team strength. Simulations drawn from this fitted model generate distributions of final standings, from which the authors derive threshold values that clubs can use for planning. A sympathetic reader cares because these estimates replace rough commercial benchmarks with a method that accounts for the specific dynamics of the new format, allowing more informed choices about squad management and match tactics amid outcome uncertainty.

Core claim

By proxying team strengths with Elo ratings and fitting a bivariate Dixon-Coles model that adjusts for the reduced rate of draws seen in the 2024/25 season, the authors generate simulated season outcomes under the incomplete round-robin format. These simulations produce estimated qualification thresholds that indicate the points totals required for direct advancement to the round of sixteen and for entry into the play-off round.

What carries the argument

The bivariate Dixon-Coles model adjusted for observed draw frequency, using Elo ratings as team-strength proxies to simulate match results and derive threshold distributions.

Load-bearing premise

The adjusted bivariate Dixon-Coles model combined with Elo ratings produces simulated outcomes whose derived thresholds accurately represent what teams need to qualify in the new incomplete round-robin structure.

What would settle it

Compare the model's predicted point thresholds against the actual final points of the top eight and the ninth-to-twenty-fourth placed teams in the completed 2024/25 season or in later seasons under the same format.

read the original abstract

For the 2024/25 season, the Union of European Football Associations (UEFA) introduced an incomplete round-robin format in the Champions League and Europa League, replacing the traditional group stage with a single league table of all 36 teams. Under this structure, the top eight teams advance directly to the round of 16, while teams ranked 9th-24th qualify for a play-off round. Simulation-based analyses, such as those by commercial data analyst Opta, provide indicative point thresholds for qualification but reveal deviations when compared with actual outcomes in the first season. To overcome these discrepancies, we employ a bivariate Dixon--Coles model that accounts for the lower frequency of draws observed in the 2024/25 Champions League season, potentially driven by reduced incentives for teams to play for a draw. We proxy team strengths by Elo ratings and fit the model to different settings. This enables us to simulate match outcomes and to estimate qualification thresholds for both direct advancement and play-off participation. Our results provide scientific guidance for clubs and managers, supporting strategic decision-making under uncertainty regarding their progression prospects in the new UEFA club competition formats.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper runs Dixon-Coles simulations with a draw-rate tweak to produce qualification thresholds for the new UEFA league phase, but supplies almost no checks that the tweak improves accuracy over baselines.

read the letter

The paper's main output is a set of simulated point thresholds for direct qualification (top 8) and play-off spots (9-24) in the 2024/25 UEFA Champions League and Europa League league phase. The authors fit a bivariate Dixon-Coles model, adjust the draw parameter downward to match the lower draw frequency observed that season, proxy strengths with Elo ratings, and simulate the incomplete round-robin to get the numbers. That is the concrete deliverable they offer clubs and managers. They do a clean job of taking a standard model and pointing out why the new format likely reduces draws, then turning that into usable thresholds. The practical framing is straightforward and the choice of Elo proxies is reasonable for this setting. What is new is simply the application to this exact format with the explicit draw adjustment; nothing in the method itself is novel. The soft spot is the missing validation. The abstract notes that Opta thresholds deviated from actual outcomes but gives no numbers on model fit, no held-out match predictions, no sensitivity runs on the draw adjustment, and no direct comparison showing that their version reduces error relative to an unadjusted baseline. Because the draw adjustment is calibrated on the same season whose results are being simulated, the thresholds sit close to post-hoc description rather than out-of-sample forecast. That circularity is the main limitation and it is not minor for anyone who wants to rely on the numbers for decisions. This is a paper for football analysts, club performance staff, and applied sports economists who need ballpark figures for the new structure. A reader looking for methodological innovation or strong statistical evidence will find little. It still deserves a serious referee because the question is timely, the model is transparent, and the gaps are fixable with added checks rather than fatal. I would send it out and ask for quantitative validation and perhaps a test on prior seasons.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a bivariate Dixon-Coles model, adjusted for the lower draw frequency observed in the 2024/25 season and using Elo ratings as team-strength proxies, to simulate outcomes and derive point thresholds for direct qualification (top 8) and play-off qualification (9-24) under UEFA's new incomplete round-robin league phase in the Champions League and Europa League.

Significance. If the simulations prove well-calibrated against realized outcomes, the estimated thresholds could supply clubs and managers with probabilistic guidance superior to commercial baselines such as Opta. The approach is timely given the format change, but its practical value is currently limited by the absence of reported fit statistics, out-of-sample validation, or quantitative comparisons demonstrating improvement over existing simulations.

major comments (2)

Abstract: the claim that the adjusted model 'overcomes these discrepancies' with Opta is unsupported by any quantitative metric (e.g., MAE, calibration score, or Kolmogorov-Smirnov statistic) comparing simulated versus observed ranking distributions or thresholds in the 2024/25 season.
Methods/Results (model fitting and simulation sections): the draw-rate adjustment and Elo-proxy parameters are fitted to the same 2024/25 data whose qualification outcomes are being predicted, so the reported thresholds risk being in-sample fits rather than genuine forecasts for the incomplete round-robin structure; no sensitivity analysis or held-out validation is described.

minor comments (1)

Consider adding an explicit table or figure that reports the fitted Dixon-Coles parameters, the implied draw probability, and the resulting simulated qualification thresholds alongside Opta values and actual 2024/25 outcomes.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We address each major comment point by point below and indicate the revisions that will be incorporated.

read point-by-point responses

Referee: Abstract: the claim that the adjusted model 'overcomes these discrepancies' with Opta is unsupported by any quantitative metric (e.g., MAE, calibration score, or Kolmogorov-Smirnov statistic) comparing simulated versus observed ranking distributions or thresholds in the 2024/25 season.

Authors: We agree that the abstract statement requires quantitative support. In the revised manuscript we will add a dedicated comparison subsection reporting mean absolute error between simulated and observed qualification thresholds, as well as a calibration check (e.g., proportion of simulated rankings falling within observed bands). The abstract language will be moderated or strengthened according to the results of these metrics. revision: yes
Referee: Methods/Results (model fitting and simulation sections): the draw-rate adjustment and Elo-proxy parameters are fitted to the same 2024/25 data whose qualification outcomes are being predicted, so the reported thresholds risk being in-sample fits rather than genuine forecasts for the incomplete round-robin structure; no sensitivity analysis or held-out validation is described.

Authors: The parameters are estimated from 2024/25 matches, and the thresholds are generated by forward simulation of the league-phase schedule under the fitted model rather than by using realized scores. We will add a sensitivity analysis that varies the draw-rate adjustment factor and the Elo weighting coefficient, reporting how threshold distributions change. A full held-out validation within the 2024/25 season is not feasible without substantially reducing the estimation sample; we will explicitly note this data limitation and supplement with robustness checks that re-estimate the model on pre-2024/25 seasons where possible. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation of qualification thresholds

full rationale

The paper fits a bivariate Dixon-Coles model (with Elo proxies and an explicit adjustment for the lower draw rate observed in the 2024/25 season) to match data and then runs Monte Carlo simulations to produce point-threshold distributions for direct qualification and play-offs. This is a standard forward simulation pipeline: inputs are observed frequencies and team ratings; outputs are simulated quantiles of the resulting league table. No step reduces the claimed thresholds to the inputs by definition, renames a fitted parameter as a prediction, or relies on a load-bearing self-citation or uniqueness theorem. The derivation remains self-contained as an application of an established scoring model rather than a tautological re-expression of its calibration data.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The claim rests on the adequacy of Elo ratings as team-strength proxies and on the Dixon-Coles model's ability to capture the adjusted draw probability after fitting; these are domain assumptions rather than derived results.

free parameters (1)

Dixon-Coles attack, defense, and draw parameters
Fitted to match data to account for team strengths and the observed lower draw rate.

axioms (1)

domain assumption Elo ratings serve as a sufficient proxy for relative team strengths in the new format
Used directly to initialize the bivariate model before fitting.

pith-pipeline@v0.9.0 · 5734 in / 1388 out tokens · 53999 ms · 2026-05-18T20:54:57.401655+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

we employ a bivariate Dixon-Coles model that accounts for the lower frequency of draws observed in the 2024/25 UCL season... proxy team strengths by Elo ratings
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

simulation-based analyses... estimate qualification thresholds for both direct advancement and play-off participation

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.