A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football
Pith reviewed 2026-05-15 12:36 UTC · model grok-4.3
The pith
A restricted Shapley value computed only over observed passing coalitions and valued by xGA measures each football player's marginal contribution to shot actions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By restricting Shapley-value coalitions to subsets of players connected through observed passes and using xGA as the coalition value function, the PRS statistic yields interpretable, action-specific marginal contributions that differ across teammates within the same club, as shown in the 2022/23 Serie A data for AC Milan and SSC Napoli.
What carries the argument
The Player's Restricted Shapley (PRS) statistic, which calculates marginal contributions over only the tactically admissible player subsets defined by the season's passing network and values each subset with the xGA cohesion function.
If this is right
- PRS supplies action-specific numbers that distinguish players who create shots through build-up from those who only finish them.
- Pairing PRS with a final efficiency score exposes gaps between a player's cooperative involvement and actual goal conversion.
- Team-level analysis of 8,421 shots shows measurable differences in contribution patterns inside the same squad.
- The framework supplies a quantitative input for scouting and squad-building decisions that already incorporate passing networks.
Where Pith is reading between the lines
- Transfer-market valuations could add a PRS term to better price players whose main value lies in facilitating shots rather than scoring them.
- The same restricted-coalition logic could be tested on defensive actions or set pieces once comparable interaction data exist.
- Coaches might experiment with line-ups that maximize high-PRS pairings identified from recent matches.
Load-bearing premise
Observed passing interactions in the data correctly define the tactically admissible coalitions whose xGA values reflect true individual impact without distortion from opponent pressure or missing tracking information.
What would settle it
Recomputing PRS rankings after replacing the passing-based coalitions with random subsets of the same size and checking whether the new rankings lose all correlation with independent video-coded impact ratings or future goal-creation rates.
read the original abstract
This paper proposes a novel framework to assess individual player contributions in football, explicitly accounting for the cooperative nature of shot-ending offensive actions. By incorporating team interaction into player evaluation, it also supports economically sustainable decision-making, with practical implications for performance analysis and player scouting. Extending the expected Goal (xG) paradigm, we propose the expected Goal Action (xGA), a measure of shot quality that incorporates build-up play and passing networks. Furthermore, we adapt cooperative game theory and introduce the Player's Restricted Shapley (PRS) statistic, a contribution metric based on restricted coalition structures derived from observed passing interactions, where xGA is adopted to compute the cohesion function. Unlike traditional Shapley approaches, the PRS one restricts coalitions to tactically admissible player subsets, offering action-specific, interpretable measures of marginal contribution in a cooperative context. We apply the framework to 8,421 shot-actions from the Italian League Serie A season 2022/23, and the case studies of AC Milan and SSC Napoli reveal some heterogeneity in contributions within teams. Furthermore, combining the PRS statistic with a final efficiency metric highlights the discrepancies between cooperative engagement and goal conversion.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes extending the expected goals (xG) framework with expected Goal Action (xGA), which incorporates build-up play and passing networks to evaluate shot quality. It introduces the Player's Restricted Shapley (PRS) statistic, adapting cooperative game theory by restricting coalitions to subsets observed in passing interactions and using xGA as the cohesion function to quantify individual marginal contributions to shot-ending actions. The approach is applied to 8,421 shot-actions from Serie A 2022/23, with case studies on AC Milan and SSC Napoli illustrating within-team heterogeneity in contributions and discrepancies between cooperative engagement and goal conversion.
Significance. If the framework is rigorously validated, PRS could advance player evaluation in football analytics by explicitly modeling cooperative interactions, offering more interpretable and action-specific metrics than standard xG or traditional Shapley values. This has potential value for scouting and performance analysis, particularly in distinguishing individual skill from team passing patterns, provided the restrictions and value function are shown to be unbiased.
major comments (3)
- [Abstract and framework description] Framework description (abstract and methods): No equations or algorithmic details are supplied for computing xGA from passing networks or for evaluating xGA on restricted coalitions; without these, it is impossible to verify how marginal contributions are obtained or whether re-simulation of build-ups is performed.
- [PRS definition and coalition restriction] PRS construction (methods): The restriction of coalitions to observed passing sequences is presented without justification or sensitivity analysis; the manuscript does not demonstrate that unobserved but tactically feasible combinations are correctly excluded rather than merely missing due to tracking gaps or opponent pressure, which directly affects the validity of the marginal contribution claims.
- [Application to Serie A data and case studies] Empirical application (results): The analysis of 8,421 actions reports case-study heterogeneity but supplies no validation of xGA against realized goals, no calibration metrics, and no error analysis; this leaves the reported PRS values and efficiency discrepancies without external grounding.
minor comments (1)
- [Notation and definitions] Notation: The relationship between standard xG and the proposed xGA would benefit from an explicit comparison table or equation highlighting the additional network terms.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive report. We address each major comment below and indicate the revisions we will make to strengthen the manuscript.
read point-by-point responses
-
Referee: Framework description (abstract and methods): No equations or algorithmic details are supplied for computing xGA from passing networks or for evaluating xGA on restricted coalitions; without these, it is impossible to verify how marginal contributions are obtained or whether re-simulation of build-ups is performed.
Authors: We agree that the current version lacks explicit equations. In the revised manuscript we will insert the full mathematical definition of xGA as a function of the passing network (including the precise cohesion value derived from build-up sequences) and the algorithm for evaluating it on the restricted coalitions. We will also state explicitly that no re-simulation of unobserved build-ups is performed; xGA is computed directly from the observed sequences only. revision: yes
-
Referee: PRS construction (methods): The restriction of coalitions to observed passing sequences is presented without justification or sensitivity analysis; the manuscript does not demonstrate that unobserved but tactically feasible combinations are correctly excluded rather than merely missing due to tracking gaps or opponent pressure, which directly affects the validity of the marginal contribution claims.
Authors: The restriction is intentional and data-driven: only coalitions that actually occurred in the recorded action are admitted, thereby reflecting the realized cooperative structure. We will add a dedicated paragraph in the methods section justifying this choice on substantive grounds (tactical admissibility within the specific action) and will include a sensitivity analysis that perturbs the observed coalitions (e.g., by adding or removing low-probability links) to quantify robustness of the PRS values. revision: yes
-
Referee: Empirical application (results): The analysis of 8,421 actions reports case-study heterogeneity but supplies no validation of xGA against realized goals, no calibration metrics, and no error analysis; this leaves the reported PRS values and efficiency discrepancies without external grounding.
Authors: We acknowledge that the present results section focuses on illustrative case studies rather than formal validation. In the revision we will add calibration plots of xGA versus realized goal outcomes, Brier scores, and a brief error analysis for the xGA model. These additions will provide the external grounding requested while preserving the paper’s primary emphasis on the PRS metric. revision: yes
Circularity Check
PRS marginal contributions reduce to xGA values on empirically observed coalitions by construction
specific steps
-
fitted input called prediction
[Abstract (PRS definition) and methods section on coalition restriction]
"we adapt cooperative game theory and introduce the Player's Restricted Shapley (PRS) statistic, a contribution metric based on restricted coalition structures derived from observed passing interactions, where xGA is adopted to compute the cohesion function. Unlike traditional Shapley approaches, the PRS one restricts coalitions to tactically admissible player subsets"
The paper first constructs xGA from the identical set of 8,421 shot-actions and their passing networks, then defines PRS by restricting coalitions to exactly the subsets that appear in those observed sequences and valuing them with the same xGA. The resulting marginal contributions are therefore computed directly from the empirical interaction graph and the xGA fit on the same data, reducing the 'prediction' of individual impact to a re-expression of the input quantities.
full rationale
The central derivation defines xGA from the same passing networks and shot-actions, then restricts Shapley coalitions exactly to the observed subsets of those actions and values them with the same xGA function. This makes the reported player contributions a direct algebraic function of the input data's interaction graph and xGA estimates, with no external benchmark or independent model for admissible coalitions. The paper's own description confirms the restriction is taken from observed sequences and xGA is adopted as the cohesion function, satisfying the fitted-input-called-prediction pattern at the core of the PRS statistic.
Axiom & Free-Parameter Ledger
free parameters (1)
- xGA model parameters
axioms (1)
- domain assumption Observed passing interactions define tactically admissible coalitions
invented entities (2)
-
xGA
no independent evidence
-
PRS statistic
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We adapt cooperative game theory and introduce the Player's Restricted Shapley (PRS) statistic, a contribution metric based on restricted coalition structures derived from observed passing interactions, where xGA is adopted to compute the cohesion function.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
ϕ^R_i(υ) = sum_{C in C_i} w_i(C) [υ(C ∪ {i}) − υ(C)] with weights normalized over restricted support C_i
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Cefis, M. (2022). Football analytics: a bibliometric study about the last decade contributions.Electronic Journal of Applied Statistical Analysis, 15(1), 232–248
work page 2022
-
[2]
Leriou, I., Ntzoufras, I. (2025). Survival modeling of goal arrival times in English premier league.Computational Statistics, 40(4), 2109–2133
work page 2025
-
[3]
Carpita, M., Sandri, M., Simonetto, A., Zuccolotto, P. (2015). Discovering the drivers of football match outcomes with data mining.Quality Technology & Quantitative Management, 12(4), 561–577
work page 2015
-
[4]
Riboli, A., Nardi, F., Osti, M., Cefis, M., Tesoro, G., Mazzoni, S. (2025). Training load, official match locomotor demand, and their association in top-class soccer players during a full competitive season.The Journal of Strength & Conditioning Research, 39(2), 249–259
work page 2025
-
[5]
Carpita, M., Golia, S. (2021). Discovering associations between players’ perfor- mance indicators and matches’ results in the European Soccer Leagues.Journal of Applied Statistics, 48(9), 1696–1711. 18
work page 2021
-
[6]
Pappalardo, L., Cintia, P., Ferragina, P., Massucco, E., Pedreschi, D., Giannotti, F. (2019). PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach.ACM Transactions on Intelligent Systems and Technology (TIST), 10(5), 1–27
work page 2019
-
[7]
Fairchild, A., Pelechrinis, K., Kokkodis, M. (2018). Spatial analysis of shots in MLS: a model for expected goals and fractal dimensionality.Journal of Sports Analytics, 4(3), 165–174
work page 2018
-
[8]
Robberechts, P., Davis, J. (2020). How Data Availability Affects the Ability to Learn Good xG Models. InMachine Learning and Data Mining for Sports Analytics, pp. 17–27. Springer
work page 2020
-
[9]
Cefis, M., Carpita, M. (2025). Accuracy and explainability of statistical and machine learning xG models in football.Statistics, 59(2), 426–445
work page 2025
-
[10]
Ruan, L., Ge, H., Shen, Y., Pu, Z., Zong, S., Cui, Y. (2022). Quantifying the effectiveness of defensive playing styles in the Chinese Football Super League. Frontiers in Psychology, 1–10
work page 2022
-
[11]
Karim, H., Lotfi, M. (2023). The Kos Angle, an optimizing parameter for football expected goals (xG) models.International Journal of Computer Science in Sport, 22(2), 49–61
work page 2023
-
[12]
Cefis, M., Carpita, M. (2024). A new xG model for football analytics.Journal of the Operational Research Society, 1–13
work page 2024
-
[13]
Barthelemy, B., Rav´ e, G., Govindasamy, K., Ali, A., Del Coso, J., Demeaux, J., Bideau, B., Zouha, H. (2024). Impact of technical-tactical and physical perfor- mance on the match outcome in professional soccer: A case study.Journal of Human Kinetics, 94, 203
work page 2024
-
[14]
Hassani, K., Ramdani, M., Lotfi, M. (2025). Dynamic Expected Threat (DxT) Model: Addressing the Deficit of Realism in Football Action Evaluation.Applied Sciences, 15(8), 4151
work page 2025
-
[15]
Chakraborty, K. (n.d.). Expected Threat (xT): The Value of a Soccer Possession. Available at:https://karun.in/blog/expected-threat.html. Accessed: 2025- 07-30
work page 2025
-
[16]
Shapley, L. S. (1953). A value forn-person games. In H. W. Kuhn and A. W. Tucker (Eds.),Contributions to the Theory of Games, pp. 307–317. Princeton University Press
work page 1953
-
[17]
Myerson, R. B. (1977). Graphs and cooperation in games.Mathematics of Operations Research, 2(3), 225–229
work page 1977
-
[18]
Metulini, R., Gnecco, G. (2023). Measuring players’ importance in basketball using the generalized Shapley value.Annals of Operations Research, 325(1), 441– 465
work page 2023
-
[19]
Auer, B. R., Hiller, T. (2015). On the evaluation of soccer players: a comparison of a new game-theoretical approach to classic performance measures.Applied Economics Letters, 22(14), 1100–1107
work page 2015
-
[20]
Hiller, T. (2015). The importance of players in teams of the German Bundesliga in the season 2012/2013: a cooperative game theory approach.Applied Economics Letters, 22(4), 324–329
work page 2015
-
[21]
Metulini, R., Cefis, M. (2024). Some novelty on the XG model for Football 19 Analytics. InEURO 2024 Conference Handbook & Abstracts, pp. 248–249
work page 2024
-
[22]
Maschler, M., Zamir, S., Solan, E. (2020).Game Theory. Cambridge University Press
work page 2020
-
[23]
Castro, J., G´ omez, D., Tejada, J. (2009). Polynomial calculation of the Shapley value based on sampling.Computers & operations research, 36(5), 1726–1730
work page 2009
-
[24]
Derks, J., Peters, H. (1993). A Shapley value for games with restricted coalitions. International Journal of Game Theory, 21(4), 351–360
work page 1993
-
[25]
Breiman, L. (2001). Statistical modeling: The two cultures.Statistical Science, 16(3), 199–231
work page 2001
-
[26]
W., Lemeshow, S., Sturdivant, R
Hosmer, D. W., Lemeshow, S., Sturdivant, R. X. (2013).Applied Logistic Regression. John Wiley & Sons
work page 2013
-
[27]
Chen, T., Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD, pp. 785–794
work page 2016
-
[28]
Cavus, M., Biecek, P. (2022). Explainable expected goal models for performance analysis in football analytics. In2022 IEEE 9th DSAA, pp. 1–9
work page 2022
-
[29]
Chicco, D., Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC).BMC Genomics, 21, 1–13
work page 2020
-
[30]
Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1–3
work page 1950
-
[31]
Cefis, M., Carpita, M. (2024). The higher-order PLS-SEM confirmatory approach for composite indicators of football performance quality.Computational Statistics, 39(1), 93–116
work page 2024
-
[32]
(2021).Explanatory Model Analysis
Biecek, P., Burzykowski, T. (2021).Explanatory Model Analysis. Chapman and Hall/CRC
work page 2021
-
[33]
Cefis, M., Metulini, R., Carpita, M. (2025). A New Dataset for Exploring Actions of Italian Football Matches. InIES 2025, pp. 624–630. 20
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.