pith. sign in

arxiv: 2603.11016 · v2 · submitted 2026-03-11 · 📊 stat.AP

A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football

Pith reviewed 2026-05-15 12:36 UTC · model grok-4.3

classification 📊 stat.AP
keywords football analyticsShapley valueplayer contributionxGApassing networkscooperative gamessports performanceSerie A
0
0 comments X

The pith

A restricted Shapley value computed only over observed passing coalitions and valued by xGA measures each football player's marginal contribution to shot actions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper adapts cooperative game theory to football by defining coalitions strictly from real passing interactions rather than all possible player combinations. It extends the expected goals idea into xGA to score the full build-up sequence leading to a shot. The resulting Player's Restricted Shapley statistic then assigns credit only within those data-derived groups, producing action-level numbers that reflect actual team play. A reader would care because standard individual stats ignore the cooperative setup of shots and therefore misattribute value to scorers versus creators.

Core claim

By restricting Shapley-value coalitions to subsets of players connected through observed passes and using xGA as the coalition value function, the PRS statistic yields interpretable, action-specific marginal contributions that differ across teammates within the same club, as shown in the 2022/23 Serie A data for AC Milan and SSC Napoli.

What carries the argument

The Player's Restricted Shapley (PRS) statistic, which calculates marginal contributions over only the tactically admissible player subsets defined by the season's passing network and values each subset with the xGA cohesion function.

If this is right

  • PRS supplies action-specific numbers that distinguish players who create shots through build-up from those who only finish them.
  • Pairing PRS with a final efficiency score exposes gaps between a player's cooperative involvement and actual goal conversion.
  • Team-level analysis of 8,421 shots shows measurable differences in contribution patterns inside the same squad.
  • The framework supplies a quantitative input for scouting and squad-building decisions that already incorporate passing networks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Transfer-market valuations could add a PRS term to better price players whose main value lies in facilitating shots rather than scoring them.
  • The same restricted-coalition logic could be tested on defensive actions or set pieces once comparable interaction data exist.
  • Coaches might experiment with line-ups that maximize high-PRS pairings identified from recent matches.

Load-bearing premise

Observed passing interactions in the data correctly define the tactically admissible coalitions whose xGA values reflect true individual impact without distortion from opponent pressure or missing tracking information.

What would settle it

Recomputing PRS rankings after replacing the passing-based coalitions with random subsets of the same size and checking whether the new rankings lose all correlation with independent video-coded impact ratings or future goal-creation rates.

read the original abstract

This paper proposes a novel framework to assess individual player contributions in football, explicitly accounting for the cooperative nature of shot-ending offensive actions. By incorporating team interaction into player evaluation, it also supports economically sustainable decision-making, with practical implications for performance analysis and player scouting. Extending the expected Goal (xG) paradigm, we propose the expected Goal Action (xGA), a measure of shot quality that incorporates build-up play and passing networks. Furthermore, we adapt cooperative game theory and introduce the Player's Restricted Shapley (PRS) statistic, a contribution metric based on restricted coalition structures derived from observed passing interactions, where xGA is adopted to compute the cohesion function. Unlike traditional Shapley approaches, the PRS one restricts coalitions to tactically admissible player subsets, offering action-specific, interpretable measures of marginal contribution in a cooperative context. We apply the framework to 8,421 shot-actions from the Italian League Serie A season 2022/23, and the case studies of AC Milan and SSC Napoli reveal some heterogeneity in contributions within teams. Furthermore, combining the PRS statistic with a final efficiency metric highlights the discrepancies between cooperative engagement and goal conversion.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The manuscript proposes extending the expected goals (xG) framework with expected Goal Action (xGA), which incorporates build-up play and passing networks to evaluate shot quality. It introduces the Player's Restricted Shapley (PRS) statistic, adapting cooperative game theory by restricting coalitions to subsets observed in passing interactions and using xGA as the cohesion function to quantify individual marginal contributions to shot-ending actions. The approach is applied to 8,421 shot-actions from Serie A 2022/23, with case studies on AC Milan and SSC Napoli illustrating within-team heterogeneity in contributions and discrepancies between cooperative engagement and goal conversion.

Significance. If the framework is rigorously validated, PRS could advance player evaluation in football analytics by explicitly modeling cooperative interactions, offering more interpretable and action-specific metrics than standard xG or traditional Shapley values. This has potential value for scouting and performance analysis, particularly in distinguishing individual skill from team passing patterns, provided the restrictions and value function are shown to be unbiased.

major comments (3)
  1. [Abstract and framework description] Framework description (abstract and methods): No equations or algorithmic details are supplied for computing xGA from passing networks or for evaluating xGA on restricted coalitions; without these, it is impossible to verify how marginal contributions are obtained or whether re-simulation of build-ups is performed.
  2. [PRS definition and coalition restriction] PRS construction (methods): The restriction of coalitions to observed passing sequences is presented without justification or sensitivity analysis; the manuscript does not demonstrate that unobserved but tactically feasible combinations are correctly excluded rather than merely missing due to tracking gaps or opponent pressure, which directly affects the validity of the marginal contribution claims.
  3. [Application to Serie A data and case studies] Empirical application (results): The analysis of 8,421 actions reports case-study heterogeneity but supplies no validation of xGA against realized goals, no calibration metrics, and no error analysis; this leaves the reported PRS values and efficiency discrepancies without external grounding.
minor comments (1)
  1. [Notation and definitions] Notation: The relationship between standard xG and the proposed xGA would benefit from an explicit comparison table or equation highlighting the additional network terms.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the thoughtful and constructive report. We address each major comment below and indicate the revisions we will make to strengthen the manuscript.

read point-by-point responses
  1. Referee: Framework description (abstract and methods): No equations or algorithmic details are supplied for computing xGA from passing networks or for evaluating xGA on restricted coalitions; without these, it is impossible to verify how marginal contributions are obtained or whether re-simulation of build-ups is performed.

    Authors: We agree that the current version lacks explicit equations. In the revised manuscript we will insert the full mathematical definition of xGA as a function of the passing network (including the precise cohesion value derived from build-up sequences) and the algorithm for evaluating it on the restricted coalitions. We will also state explicitly that no re-simulation of unobserved build-ups is performed; xGA is computed directly from the observed sequences only. revision: yes

  2. Referee: PRS construction (methods): The restriction of coalitions to observed passing sequences is presented without justification or sensitivity analysis; the manuscript does not demonstrate that unobserved but tactically feasible combinations are correctly excluded rather than merely missing due to tracking gaps or opponent pressure, which directly affects the validity of the marginal contribution claims.

    Authors: The restriction is intentional and data-driven: only coalitions that actually occurred in the recorded action are admitted, thereby reflecting the realized cooperative structure. We will add a dedicated paragraph in the methods section justifying this choice on substantive grounds (tactical admissibility within the specific action) and will include a sensitivity analysis that perturbs the observed coalitions (e.g., by adding or removing low-probability links) to quantify robustness of the PRS values. revision: yes

  3. Referee: Empirical application (results): The analysis of 8,421 actions reports case-study heterogeneity but supplies no validation of xGA against realized goals, no calibration metrics, and no error analysis; this leaves the reported PRS values and efficiency discrepancies without external grounding.

    Authors: We acknowledge that the present results section focuses on illustrative case studies rather than formal validation. In the revision we will add calibration plots of xGA versus realized goal outcomes, Brier scores, and a brief error analysis for the xGA model. These additions will provide the external grounding requested while preserving the paper’s primary emphasis on the PRS metric. revision: yes

Circularity Check

1 steps flagged

PRS marginal contributions reduce to xGA values on empirically observed coalitions by construction

specific steps
  1. fitted input called prediction [Abstract (PRS definition) and methods section on coalition restriction]
    "we adapt cooperative game theory and introduce the Player's Restricted Shapley (PRS) statistic, a contribution metric based on restricted coalition structures derived from observed passing interactions, where xGA is adopted to compute the cohesion function. Unlike traditional Shapley approaches, the PRS one restricts coalitions to tactically admissible player subsets"

    The paper first constructs xGA from the identical set of 8,421 shot-actions and their passing networks, then defines PRS by restricting coalitions to exactly the subsets that appear in those observed sequences and valuing them with the same xGA. The resulting marginal contributions are therefore computed directly from the empirical interaction graph and the xGA fit on the same data, reducing the 'prediction' of individual impact to a re-expression of the input quantities.

full rationale

The central derivation defines xGA from the same passing networks and shot-actions, then restricts Shapley coalitions exactly to the observed subsets of those actions and values them with the same xGA function. This makes the reported player contributions a direct algebraic function of the input data's interaction graph and xGA estimates, with no external benchmark or independent model for admissible coalitions. The paper's own description confirms the restriction is taken from observed sequences and xGA is adopted as the cohesion function, satisfying the fitted-input-called-prediction pattern at the core of the PRS statistic.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 2 invented entities

The framework rests on two new constructs (xGA and PRS) plus the domain assumption that passing data directly encodes admissible coalitions; no independent evidence is supplied for either construct.

free parameters (1)
  • xGA model parameters
    xGA extends xG and must incorporate fitted coefficients for build-up and passing features; these are not enumerated but are required to compute the cohesion function.
axioms (1)
  • domain assumption Observed passing interactions define tactically admissible coalitions
    The restriction step in PRS assumes that the passing network observed in each action accurately reflects which player subsets are allowed to cooperate.
invented entities (2)
  • xGA no independent evidence
    purpose: Shot-quality measure that incorporates build-up play and passing networks
    New quantity introduced to serve as the value function for the cooperative game.
  • PRS statistic no independent evidence
    purpose: Player contribution metric based on restricted Shapley values
    New statistic defined on top of xGA and the restricted coalitions.

pith-pipeline@v0.9.0 · 5506 in / 1421 out tokens · 31382 ms · 2026-05-15T12:36:01.872351+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages

  1. [1]

    Cefis, M. (2022). Football analytics: a bibliometric study about the last decade contributions.Electronic Journal of Applied Statistical Analysis, 15(1), 232–248

  2. [2]

    Leriou, I., Ntzoufras, I. (2025). Survival modeling of goal arrival times in English premier league.Computational Statistics, 40(4), 2109–2133

  3. [3]

    Carpita, M., Sandri, M., Simonetto, A., Zuccolotto, P. (2015). Discovering the drivers of football match outcomes with data mining.Quality Technology & Quantitative Management, 12(4), 561–577

  4. [4]

    Riboli, A., Nardi, F., Osti, M., Cefis, M., Tesoro, G., Mazzoni, S. (2025). Training load, official match locomotor demand, and their association in top-class soccer players during a full competitive season.The Journal of Strength & Conditioning Research, 39(2), 249–259

  5. [5]

    Carpita, M., Golia, S. (2021). Discovering associations between players’ perfor- mance indicators and matches’ results in the European Soccer Leagues.Journal of Applied Statistics, 48(9), 1696–1711. 18

  6. [6]

    Pappalardo, L., Cintia, P., Ferragina, P., Massucco, E., Pedreschi, D., Giannotti, F. (2019). PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach.ACM Transactions on Intelligent Systems and Technology (TIST), 10(5), 1–27

  7. [7]

    Fairchild, A., Pelechrinis, K., Kokkodis, M. (2018). Spatial analysis of shots in MLS: a model for expected goals and fractal dimensionality.Journal of Sports Analytics, 4(3), 165–174

  8. [8]

    Robberechts, P., Davis, J. (2020). How Data Availability Affects the Ability to Learn Good xG Models. InMachine Learning and Data Mining for Sports Analytics, pp. 17–27. Springer

  9. [9]

    Cefis, M., Carpita, M. (2025). Accuracy and explainability of statistical and machine learning xG models in football.Statistics, 59(2), 426–445

  10. [10]

    Ruan, L., Ge, H., Shen, Y., Pu, Z., Zong, S., Cui, Y. (2022). Quantifying the effectiveness of defensive playing styles in the Chinese Football Super League. Frontiers in Psychology, 1–10

  11. [11]

    Karim, H., Lotfi, M. (2023). The Kos Angle, an optimizing parameter for football expected goals (xG) models.International Journal of Computer Science in Sport, 22(2), 49–61

  12. [12]

    Cefis, M., Carpita, M. (2024). A new xG model for football analytics.Journal of the Operational Research Society, 1–13

  13. [13]

    Barthelemy, B., Rav´ e, G., Govindasamy, K., Ali, A., Del Coso, J., Demeaux, J., Bideau, B., Zouha, H. (2024). Impact of technical-tactical and physical perfor- mance on the match outcome in professional soccer: A case study.Journal of Human Kinetics, 94, 203

  14. [14]

    Hassani, K., Ramdani, M., Lotfi, M. (2025). Dynamic Expected Threat (DxT) Model: Addressing the Deficit of Realism in Football Action Evaluation.Applied Sciences, 15(8), 4151

  15. [15]

    Chakraborty, K. (n.d.). Expected Threat (xT): The Value of a Soccer Possession. Available at:https://karun.in/blog/expected-threat.html. Accessed: 2025- 07-30

  16. [16]

    Shapley, L. S. (1953). A value forn-person games. In H. W. Kuhn and A. W. Tucker (Eds.),Contributions to the Theory of Games, pp. 307–317. Princeton University Press

  17. [17]

    Myerson, R. B. (1977). Graphs and cooperation in games.Mathematics of Operations Research, 2(3), 225–229

  18. [18]

    Metulini, R., Gnecco, G. (2023). Measuring players’ importance in basketball using the generalized Shapley value.Annals of Operations Research, 325(1), 441– 465

  19. [19]

    R., Hiller, T

    Auer, B. R., Hiller, T. (2015). On the evaluation of soccer players: a comparison of a new game-theoretical approach to classic performance measures.Applied Economics Letters, 22(14), 1100–1107

  20. [20]

    Hiller, T. (2015). The importance of players in teams of the German Bundesliga in the season 2012/2013: a cooperative game theory approach.Applied Economics Letters, 22(4), 324–329

  21. [21]

    Metulini, R., Cefis, M. (2024). Some novelty on the XG model for Football 19 Analytics. InEURO 2024 Conference Handbook & Abstracts, pp. 248–249

  22. [22]

    (2020).Game Theory

    Maschler, M., Zamir, S., Solan, E. (2020).Game Theory. Cambridge University Press

  23. [23]

    Castro, J., G´ omez, D., Tejada, J. (2009). Polynomial calculation of the Shapley value based on sampling.Computers & operations research, 36(5), 1726–1730

  24. [24]

    Derks, J., Peters, H. (1993). A Shapley value for games with restricted coalitions. International Journal of Game Theory, 21(4), 351–360

  25. [25]

    Breiman, L. (2001). Statistical modeling: The two cultures.Statistical Science, 16(3), 199–231

  26. [26]

    W., Lemeshow, S., Sturdivant, R

    Hosmer, D. W., Lemeshow, S., Sturdivant, R. X. (2013).Applied Logistic Regression. John Wiley & Sons

  27. [27]

    Chen, T., Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD, pp. 785–794

  28. [28]

    Cavus, M., Biecek, P. (2022). Explainable expected goal models for performance analysis in football analytics. In2022 IEEE 9th DSAA, pp. 1–9

  29. [29]

    Chicco, D., Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC).BMC Genomics, 21, 1–13

  30. [30]

    Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1–3

  31. [31]

    Cefis, M., Carpita, M. (2024). The higher-order PLS-SEM confirmatory approach for composite indicators of football performance quality.Computational Statistics, 39(1), 93–116

  32. [32]

    (2021).Explanatory Model Analysis

    Biecek, P., Burzykowski, T. (2021).Explanatory Model Analysis. Chapman and Hall/CRC

  33. [33]

    Cefis, M., Metulini, R., Carpita, M. (2025). A New Dataset for Exploring Actions of Italian Football Matches. InIES 2025, pp. 624–630. 20