A market-calibrated accelerated failure time model for in-play football forecasting
Pith reviewed 2026-05-19 18:53 UTC · model grok-4.3
The pith
Calibrating a Weibull accelerated failure time model to pre-match betting prices and adding post-shot expected goals nearly matches betting exchange accuracy for in-play football forecasts.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that jointly fitting team-strength parameters to 1X2 and over/under Betfair prices via squared-error minimisation, then incorporating post-shot expected goals in a Weibull AFT model, produces in-play forecasts that almost match betting exchange accuracy while retaining interpretability. Comparison with other calibrated models shows market calibration drives the accuracy gain. A betting simulation yields positive returns, indicating market inefficiency.
What carries the argument
Market calibration of team strength parameters to pre-match betting prices combined with post-shot expected goals as a time-varying covariate in a Weibull accelerated failure time model.
Load-bearing premise
That jointly fitting team-strength parameters to pre-match 1X2 and over/under Betfair prices via squared-error minimisation produces values that remain valid for in-play forecasting once post-shot expected goals are added as a time-varying covariate.
What would settle it
Testing the calibrated model forward on a new set of Premier League matches and checking whether classification accuracy remains near 70 percent and whether simulated betting returns against in-play odds stay positive.
Figures
read the original abstract
In-play football forecasting models have struggled to match the accuracy of betting exchange prices, which aggregate information from many market participants. We close this gap by combining two extensions to a Weibull accelerated failure time model: calibrating team strength parameters to Betfair Exchange prices at kick-off to capture pre-match market information, and including post-shot expected goals as a time-varying covariate to capture in-play information. The calibration approach, where we jointly fit team-strength parameters to 1X2 and over/under betting markets via squared-error minimisation, applies to any intensity-based goal arrival model and enables stronger in-play forecasting. Evaluated across 140 English Premier League matches at minute intervals, the calibrated model almost matches Betfair's classification accuracy (70.2% versus 70.6%) while retaining interpretable team-level parameters and covariate effects. A comparison with two alternative continuous-time scoring models, both calibrated to the same pre-match odds, confirms that market calibration is the dominant driver of predictive accuracy. A betting simulation against Betfair in-play odds yields a 4.5% return on investment (Sharpe ratio 5.94) over 17,458 bets, suggesting an inefficiency within in-play football markets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a Weibull accelerated failure time model for in-play football goal arrivals that jointly calibrates team-strength parameters to pre-match Betfair 1X2 and over/under prices via squared-error minimization and adds post-shot expected goals as a time-varying covariate. Evaluated at minute intervals across 140 English Premier League matches, the model reaches 70.2% classification accuracy (versus Betfair's 70.6%) and produces a 4.5% ROI (Sharpe ratio 5.94) over 17,458 simulated bets against in-play odds. Comparisons with two other continuous-time models calibrated to the same pre-match odds indicate that market calibration is the primary driver of accuracy gains.
Significance. If the pre-match calibrated parameters remain valid once realized post-shot xG trajectories are inserted, the work supplies a practical, interpretable route to combine market-implied information with in-play covariates. The concrete accuracy figures, direct comparison to betting-exchange benchmarks, and positive ROI constitute falsifiable, practically relevant evidence of both predictive performance and potential market inefficiency.
major comments (2)
- [Calibration section] Calibration section: Team-strength parameters are fitted by minimizing squared error to pre-match Betfair prices at kick-off (covariate = 0 or unconditional expectation). The manuscript provides no explicit invariance check showing that the same parameters, when driven by match-specific realized post-shot xG paths, continue to produce hazards consistent with the original market-implied probabilities. This leaves open the possibility that average future xG contributions are absorbed into the fitted strengths, undermining their use for in-play forecasting.
- [Results section] Results section: The claim that market calibration dominates accuracy gains rests on comparisons with two alternative continuous-time models, both calibrated to the same pre-match odds. Without reporting the accuracy of a non-calibrated Weibull AFT model that nevertheless includes the post-shot xG covariate, it is difficult to isolate whether the reported 70.2% accuracy stems primarily from calibration or from the addition of the time-varying covariate itself.
minor comments (2)
- [Abstract] Abstract and methods: The betting simulation reports a 4.5% ROI and Sharpe ratio 5.94 but does not specify stake sizing, whether odds are taken at the exact minute of each prediction, or how the 17,458 bets are distributed across matches.
- [Evaluation] Evaluation: Accuracy is stated as 70.2% versus 70.6% without accompanying standard errors, number of independent matches, or details on the precise classification rule (e.g., probability threshold for home-win calls).
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which help clarify the interpretation of our calibration procedure and strengthen the evidence for our main claims. We respond to each major comment below and indicate the revisions we will make.
read point-by-point responses
-
Referee: [Calibration section] Calibration section: Team-strength parameters are fitted by minimizing squared error to pre-match Betfair prices at kick-off (covariate = 0 or unconditional expectation). The manuscript provides no explicit invariance check showing that the same parameters, when driven by match-specific realized post-shot xG paths, continue to produce hazards consistent with the original market-implied probabilities. This leaves open the possibility that average future xG contributions are absorbed into the fitted strengths, undermining their use for in-play forecasting.
Authors: We acknowledge the value of an explicit invariance check. The calibration is performed once at kick-off against unconditional market prices, after which the post-shot xG enters only as a time-varying covariate that shifts the hazard during the match. To address the concern that average future xG effects might be absorbed into the strength parameters, we will add a short verification in the revised Calibration section (or an appendix). This will show that, when the calibrated parameters are driven by the unconditional expectation of the xG process, the resulting integrated hazards recover the original pre-match market probabilities on average. We view this as a useful clarification rather than a change to the core method. revision: yes
-
Referee: [Results section] Results section: The claim that market calibration dominates accuracy gains rests on comparisons with two alternative continuous-time models, both calibrated to the same pre-match odds. Without reporting the accuracy of a non-calibrated Weibull AFT model that nevertheless includes the post-shot xG covariate, it is difficult to isolate whether the reported 70.2% accuracy stems primarily from calibration or from the addition of the time-varying covariate itself.
Authors: We agree that reporting the non-calibrated Weibull AFT model that still includes the post-shot xG covariate would make the attribution clearer. In the revised Results section we will therefore add the classification accuracy, log-likelihood, and betting-simulation metrics for this baseline specification. This addition will allow readers to quantify the incremental contribution of market calibration over the covariate alone and will support (or, if necessary, qualify) the statement that calibration is the dominant driver. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper's central derivation calibrates team-strength parameters to pre-match Betfair 1X2 and over/under prices via squared-error minimization at kick-off (covariate at baseline), then augments the Weibull AFT model with an independent time-varying post-shot expected goals covariate for in-play updates. Evaluation compares the resulting in-play forecasts against Betfair's own classification accuracy on the same 140 matches and uses two alternative models (also pre-match calibrated) to attribute gains to calibration. No equation or claim reduces the in-play output to the pre-match fit by construction; the time-varying covariate path supplies new information not present in the calibration target. External market benchmark and controlled comparisons keep the chain self-contained without self-definitional, fitted-prediction, or self-citation reductions.
Axiom & Free-Parameter Ledger
free parameters (1)
- team strength parameters
axioms (1)
- domain assumption Goal arrival times follow a Weibull distribution conditional on team strengths and covariates
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Weibull accelerated failure time model for goal inter-arrival times... logE[T_H] = μ + β_home + a_H + d_A + β^⊤ x_H
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
calibrating team strength parameters to Betfair Exchange prices at kick-off via squared-error minimisation
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
-
[2]
Boshnakov, G., T. Kharrat, and I. G. McHale (2017). A bivariate weibull count model for forecasting association football scores. International Journal of Forecasting\/ 33\/ (2), 458--466
work page 2017
-
[3]
Bunker, R., C. Yeung, and K. Fujii (2024). Machine learning for soccer match result prediction. In Artificial Intelligence, Optimization, and Data Sciences in Sports , pp.\ 7--49. Springer
work page 2024
-
[4]
Capen, E. C., R. V. Clapp, and W. M. Campbell (1971). Competitive bidding in high-risk situations. Journal of Petroleum Technology\/ 23\/ (06), 641--653
work page 1971
-
[5]
Clegg, L. and J. Cartlidge (2025). Not feeling the buzz: Correction study of mispricing and inefficiency in online sportsbooks. International Journal of Forecasting\/ 41\/ (2), 798--802
work page 2025
-
[6]
Constantinou, A. C. and N. E. Fenton (2012). Solving the problem of inadequate scoring rules for assessing probabilistic football forecast models. Journal of Quantitative Analysis in Sports\/ 8\/ (1), 1--14
work page 2012
-
[7]
Dixon, M. and M. Robinson (1998). A birth process model for association football matches. Journal of the Royal Statistical Society: Series D (The Statistician)\/ 47\/ (3), 523--538
work page 1998
-
[8]
Dixon, M. J. and S. G. Coles (1997). Modelling association football scores and inefficiencies in the football betting market. Journal of the Royal Statistical Society: Series C (Applied Statistics)\/ 46\/ (2), 265--280
work page 1997
-
[9]
Egidi, L., F. Pauli, and N. Torelli (2018). Combining historical data and bookmakers’ odds in modelling football scores. Statistical Modelling\/ 18\/ (5-6), 436--459
work page 2018
-
[10]
Epstein, E. S. (1969). A scoring system for probability forecasts of ranked categories. Journal of Applied Meteorology (1962-1982)\/ 8\/ (6), 985--987
work page 1969
-
[11]
Forrest, D., J. Goddard, and R. Simmons (2005). Odds-setters as forecasters: The case of english football. International Journal of Forecasting\/ 21\/ (3), 551--564
work page 2005
-
[12]
Holmes, B. and I. G. McHale (2024). Forecasting football match results using a player rating based model. International Journal of Forecasting\/ 40\/ (1), 302--312
work page 2024
-
[13]
Hub \'a c ek, O. and G. S \' r (2023). Beating the market with a bad predictive model. International Journal of Forecasting\/ 39\/ (2), 691--719
work page 2023
-
[14]
Kalbfleisch, J. D. and R. L. Prentice (2011). The Statistical Analysis of Failure Time Data\/ (2nd ed.). Hoboken, NJ: John Wiley & Sons
work page 2011
-
[15]
Karlis, D. and I. Ntzoufras (2003). Analysis of sports data by using bivariate poisson models. Journal of the Royal Statistical Society: Series D (The Statistician)\/ 52\/ (3), 381--393
work page 2003
-
[16]
Kelly, J. L. (1956). A new interpretation of information rate. The Bell System Technical Journal\/ 35\/ (4), 917--926
work page 1956
-
[17]
Klaassen, F. J. and J. R. Magnus (2003). Forecasting the winner of a tennis match. European Journal of Operational Research\/ 148\/ (2), 257--267
work page 2003
-
[18]
Kleinbaum, D. G. and M. Klein (2012). Survival Analysis: A Self-Learning Text\/ (3rd ed.). Springer
work page 2012
-
[19]
Klemp, M., F. Wunderlich, and D. Memmert (2021). In-play forecasting in football using event and positional data. Scientific Reports\/ 11\/ (1), 24139
work page 2021
-
[20]
Kovalchik, S. and M. Reid (2019). A calibration method with dynamic updates for within-match forecasting of wins in tennis. International Journal of Forecasting\/ 35\/ (2), 756--766
work page 2019
-
[21]
Leriou, I. and I. Ntzoufras (2025). Survival modeling of goal arrival times in english premier league. Computational Statistics\/ 40\/ (4), 2109--2133
work page 2025
-
[22]
Maher, M. J. (1982). Modelling association football scores. Statistica Neerlandica\/ 36\/ (3), 109--118
work page 1982
-
[23]
Maia, L. F. G., T. Pennanen, M. A. da Silva, and R. S. Targino (2025). Stochastic modelling of football matches using dynamic regressors. International Journal of Forecasting\/ 42\/ (1), 181--202
work page 2025
-
[24]
Moroney, M. J. (1956). Facts from Figures\/ (3rd ed.). London: Penguin
work page 1956
-
[25]
Pollard, R. and C. Reep (1997). Measuring the effectiveness of playing strategies at soccer. Journal of the Royal Statistical Society Series D: The Statistician\/ 46\/ (4), 541--550
work page 1997
-
[26]
Powell, M. J. (1964). An efficient method for finding the minimum of a function of several variables without calculating derivatives. The Computer Journal\/ 7\/ (2), 155--162
work page 1964
-
[27]
Reep, C. and B. Benjamin (1968). Skill and chance in association football. Journal of the Royal Statistical Society. Series A (General)\/ 131\/ (4), 581--585
work page 1968
-
[28]
Robberechts, P., J. Van Haaren, and J. Davis (2021). A bayesian approach to in-game win probability in soccer. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pp.\ 3512--3521
work page 2021
-
[29]
Silva, R. M. and T. B. Swartz (2016). Analysis of substitution times in soccer. Journal of Quantitative Analysis in Sports\/ 12\/ (3), 113--122
work page 2016
-
[30]
Singh, K. (2019). Introducing expected threat ( xT ). Accessed: 2026-01-29
work page 2019
-
[31]
S trumbelj, E. and M. R. S ikonja (2010). Online bookmakers’ odds as forecasts: The case of european soccer leagues. International Journal of Forecasting\/ 26\/ (3), 482--488
work page 2010
-
[32]
S trumbelj, E. and P. Vra c ar (2012). Simulating a basketball match with a homogeneous markov model and forecasting the outcome. International Journal of Forecasting\/ 28\/ (2), 532--542
work page 2012
-
[33]
Titman, A., D. Costain, P. Ridall, and K. Gregory (2015). Joint modelling of goals and bookings in association football. Journal of the Royal Statistical Society Series A: Statistics in Society\/ 178\/ (3), 659--683
work page 2015
-
[34]
Volf, P. (2009). A random point process model for the score in sport matches. IMA Journal of Management Mathematics\/ 20\/ (2), 121--131
work page 2009
-
[35]
Watanabe, N. M., P. Wicker, and J. C. Reuter (2015). Determinants of stoppage time awarded to teams in the english premier league. International Journal of Sport Finance\/ 10\/ (4), 310--327
work page 2015
-
[36]
Wheatcroft, E. (2021). Evaluating probabilistic forecasts of football matches: the case against the ranked probability score. Journal of Quantitative Analysis in Sports\/ 17\/ (4), 273--287
work page 2021
-
[37]
Wolfers, J. and E. Zitzewitz (2004). Prediction markets. Journal of Economic Perspectives\/ 18\/ (2), 107--126
work page 2004
-
[38]
Wunderlich, F. (2025). Using the wisdom of crowds in sports: how performance analysis in football can benefit from the information enclosed in betting odds. International Journal of Performance Analysis in Sport\/ 25\/ (4), 687--706
work page 2025
-
[39]
Wunderlich, F., M. G. Caparr \'o s, and D. Memmert (2026). Does the consideration of market prices in model selection increase model profitability? evidence from theory, artificial data and real-world data. In press, International Journal of Forecasting
work page 2026
-
[40]
Wunderlich, F. and D. Memmert (2020). Are betting returns a useful measure of accuracy in (sports) forecasting? International Journal of Forecasting\/ 36\/ (2), 713--722
work page 2020
-
[41]
Zou, Q., K. Song, and J. Shi (2020). A bayesian in-play prediction model for association football outcomes. Applied Sciences\/ 10\/ (8), 2904
work page 2020
-
[42]
Survival modeling of goal arrival times in
Leriou, Ilias and Ntzoufras, Ioannis , journal=. Survival modeling of goal arrival times in. 2025 , publisher=
work page 2025
-
[43]
The Statistical Analysis of Failure Time Data , author=. 2011 , publisher=
work page 2011
-
[44]
Journal of Quantitative Analysis in Sports , volume=
Analysis of substitution times in soccer , author=. Journal of Quantitative Analysis in Sports , volume=. 2016 , doi=
work page 2016
-
[45]
Analysis of sports data by using bivariate
Karlis, Dimitris and Ntzoufras, Ioannis , journal=. Analysis of sports data by using bivariate. 2003 , publisher=
work page 2003
-
[46]
Statistica Neerlandica , volume=
Modelling association football scores , author=. Statistica Neerlandica , volume=. 1982 , publisher=
work page 1982
-
[47]
Moroney, M. J. , title =
-
[48]
Journal of the Royal Statistical Society
Skill and chance in association football , author=. Journal of the Royal Statistical Society. Series A (General) , volume=. 1968 , publisher=
work page 1968
-
[49]
Artificial Intelligence, Optimization, and Data Sciences in Sports , pages=
Machine learning for soccer match result prediction , author=. Artificial Intelligence, Optimization, and Data Sciences in Sports , pages=. 2024 , publisher=
work page 2024
-
[50]
Boshnakov, Georgi and Kharrat, Tarak and McHale, Ian G , journal=. A bivariate. 2017 , publisher=
work page 2017
-
[51]
Journal of the Royal Statistical Society: Series C (Applied Statistics) , volume=
Modelling association football scores and inefficiencies in the football betting market , author=. Journal of the Royal Statistical Society: Series C (Applied Statistics) , volume=. 1997 , publisher=
work page 1997
-
[52]
European Journal of Operational Research , volume=
Plus--minus player ratings for soccer , author=. European Journal of Operational Research , volume=. 2020 , publisher=
work page 2020
-
[53]
In-play forecasting in football using event and positional data , author=. Scientific Reports , volume=. 2021 , publisher=
work page 2021
-
[54]
International Journal of Forecasting , volume=
Stochastic modelling of football matches using dynamic regressors , author=. International Journal of Forecasting , volume=. 2026 , publisher=
work page 2026
-
[55]
International Journal of Forecasting , volume=
Forecasting football match results using a player rating based model , author=. International Journal of Forecasting , volume=. 2024 , publisher=
work page 2024
-
[56]
Zou, Qingrong and Song, Kai and Shi, Jian , journal=. A. 2020 , publisher=
work page 2020
-
[57]
Journal of the Royal Statistical Society: Series D (The Statistician) , volume=
A birth process model for association football matches , author=. Journal of the Royal Statistical Society: Series D (The Statistician) , volume=. 1998 , publisher=
work page 1998
-
[58]
IMA Journal of Management Mathematics , volume=
A random point process model for the score in sport matches , author=. IMA Journal of Management Mathematics , volume=. 2009 , publisher=
work page 2009
-
[59]
Journal of the Royal Statistical Society Series A: Statistics in Society , volume=
Joint modelling of goals and bookings in association football , author=. Journal of the Royal Statistical Society Series A: Statistics in Society , volume=. 2015 , publisher=
work page 2015
-
[60]
Robberechts, Pieter and Van Haaren, Jan and Davis, Jesse , booktitle=. A. 2021 , doi=
work page 2021
- [61]
-
[62]
We know who wins: graph-oriented approaches of passing networks for predictive football match outcomes , author=. Journal of Big Data , volume=. 2025 , publisher=
work page 2025
- [63]
-
[64]
Journal of Applied Meteorology (1962-1982) , volume=
A scoring system for probability forecasts of ranked categories , author=. Journal of Applied Meteorology (1962-1982) , volume=. 1969 , publisher=
work page 1962
-
[65]
Journal of Quantitative Analysis in Sports , volume=
Solving the problem of inadequate scoring rules for assessing probabilistic football forecast models , author=. Journal of Quantitative Analysis in Sports , volume=. 2012 , publisher=
work page 2012
-
[66]
Journal of Quantitative Analysis in Sports , volume=
Evaluating probabilistic forecasts of football matches: the case against the ranked probability score , author=. Journal of Quantitative Analysis in Sports , volume=. 2021 , publisher=
work page 2021
-
[67]
International Journal of Forecasting , volume=
Not feeling the buzz: Correction study of mispricing and inefficiency in online sportsbooks , author=. International Journal of Forecasting , volume=. 2025 , publisher=
work page 2025
-
[68]
Determinants of stoppage time awarded to teams in the
Watanabe, Nicholas M and Wicker, Pamela and Reuter, James C , journal=. Determinants of stoppage time awarded to teams in the. 2015 , publisher=
work page 2015
-
[69]
The Computer Journal , volume=
An efficient method for finding the minimum of a function of several variables without calculating derivatives , author=. The Computer Journal , volume=. 1964 , publisher=
work page 1964
-
[70]
Journal of Economic Perspectives , volume=
Prediction markets , author=. Journal of Economic Perspectives , volume=. 2004 , publisher=
work page 2004
-
[71]
Odds-setters as forecasters: The case of
Forrest, David and Goddard, John and Simmons, Robert , journal=. Odds-setters as forecasters: The case of. 2005 , publisher=
work page 2005
-
[72]
International Journal of Forecasting , volume=
Online bookmakers' odds as forecasts: The case of. International Journal of Forecasting , volume=. 2010 , publisher=
work page 2010
-
[73]
International Journal of Performance Analysis in Sport , volume=
Using the wisdom of crowds in sports: how performance analysis in football can benefit from the information enclosed in betting odds , author=. International Journal of Performance Analysis in Sport , volume=. 2025 , publisher=
work page 2025
-
[74]
International Journal of Forecasting , volume=
A calibration method with dynamic updates for within-match forecasting of wins in tennis , author=. International Journal of Forecasting , volume=. 2019 , publisher=
work page 2019
-
[75]
International Journal of Forecasting , volume=
Simulating a basketball match with a homogeneous. International Journal of Forecasting , volume=. 2012 , publisher=
work page 2012
-
[76]
European Journal of Operational Research , volume=
Forecasting the winner of a tennis match , author=. European Journal of Operational Research , volume=. 2003 , publisher=
work page 2003
-
[77]
Statistical Modelling , volume=
Combining historical data and bookmakers' odds in modelling football scores , author=. Statistical Modelling , volume=. 2018 , publisher=
work page 2018
-
[78]
The Bell System Technical Journal , volume=
A new interpretation of information rate , author=. The Bell System Technical Journal , volume=. 1956 , publisher=
work page 1956
-
[79]
Journal of Petroleum Technology , volume=
Competitive bidding in high-risk situations , author=. Journal of Petroleum Technology , volume=. 1971 , publisher=
work page 1971
-
[80]
Journal of the Royal Statistical Society Series D: The Statistician , volume=
Measuring the effectiveness of playing strategies at soccer , author=. Journal of the Royal Statistical Society Series D: The Statistician , volume=. 1997 , publisher=
work page 1997
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.