pith. sign in

arxiv: 2604.11200 · v1 · submitted 2026-04-13 · 💻 cs.LG · cs.AI· stat.ML

ShapShift: Explaining Model Prediction Shifts with Subgroup Conditional Shapley Values

Pith reviewed 2026-05-10 16:09 UTC · model grok-4.3

classification 💻 cs.LG cs.AIstat.ML
keywords prediction shiftShapley valuesdecision treesmodel monitoringexplainable AIdata driftsurrogate models
0
0 comments X

The pith

ShapShift attributes model prediction shifts to changes in conditional probabilities of subgroups defined by decision trees.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes ShapShift as a Shapley value method to explain shifts in average model predictions caused by changes in input distributions. It attributes the overall shift to alterations in the conditional probabilities of subgroups, with those subgroups coming from the structure of decision trees. This approach starts with exact attributions for single trees at each split node and extends to ensembles by picking the most explanatory tree while handling residuals. A model-agnostic version uses surrogate trees grown under a novel objective to apply the same idea to black-box models such as neural networks. The resulting explanations are presented as simple, faithful, and nearly complete, supporting ongoing monitoring of models in environments where data patterns evolve.

Core claim

ShapShift decomposes a model's prediction shift into contributions from changes in the conditional probabilities of interpretable subgroups. For a single decision tree the attributions are exact and follow directly from probability changes at each split node. For tree ensembles the method selects the single tree that best explains the shift and accounts for the remaining effects separately. For arbitrary models it grows surrogate trees with a custom objective function that defines the subgroups, then applies the same conditional Shapley attribution.

What carries the argument

Subgroup conditional Shapley values that attribute the total prediction shift to changes in conditional probabilities along paths defined by decision-tree splits.

If this is right

  • Model monitors can identify the specific subgroups whose probability changes drive a shift in average predictions.
  • Explanations remain interpretable even when the underlying model is a large ensemble or a neural network.
  • Approximation methods make the attributions practical for real-time monitoring despite the cost of exact computation.
  • The same subgroup decomposition can be applied across different model classes without retraining the original model.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If tree-defined subgroups consistently explain shifts, practitioners could use them as early-warning indicators for data drift before the shift becomes large.
  • The surrogate-tree construction suggests that any model whose behavior can be approximated by partitions might admit similar shift explanations without direct access to its internals.
  • The method implicitly treats the tree structure as a sufficient statistic for the relevant aspects of the input distribution.

Load-bearing premise

That subgroups defined by decision tree structures capture the dominant drivers of the observed prediction shift and that selecting one tree plus residual handling suffices for ensembles.

What would settle it

An observed prediction shift where adjusting the conditional probabilities of the attributed subgroups fails to reproduce most of the measured change in average model output.

Figures

Figures reproduced from arXiv: 2604.11200 by Emanuele Albini, Manuela Veloso, Salim I. Amoukou, Saumitra Mishra, Tom Bewley.

Figure 1
Figure 1. Figure 1: Using SHAPSHIFT to attribute an increase in the mean prediction of a loan approval model under a shift P → Q to subgroup conditional probabilities. The reduc￾tion of working people earning < $50k has the largest positive impact. The small unexplained term quantifies the shift not accounted for by the four given factors. explains the ensemble’s shift, account￾ing for residual effects. This requires adding a… view at source ↗
Figure 2
Figure 2. Figure 2: Calculation of SVs for subgroup conditionals to explain prediction shift in a decision tree. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Application of SHAPSHIFT to a decision tree for house price prediction. 5 Extension #1: SHAPSHIFT for Tree Ensembles Tree ensembles, such as random forests [5] and gradient boosted trees [13], are far more commonly used than single decision trees. We now describe how our method can be extended to such models. Firstly, consider what would change if the 4-leaf tree from [PITH_FULL_IMAGE:figures/full_fig_p00… view at source ↗
Figure 4
Figure 4. Figure 4: SV analysis of one member of a tree ensemble to explain ensemble-wide prediction shift, [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: illustrates this selection process on a random forest of 100 trees, each with 8 leaves, for the Breast Cancer Wisconsin dataset (see Appendix B.1 for details). We define distribution P as all cases reported in January 1989, and Q as those from later dates. Shifting from P to Q induces a prediction shift of −0.253 i.e. a 25.3% reduction in the predicted malignancy rate [PITH_FULL_IMAGE:figures/full_fig_p00… view at source ↗
Figure 6
Figure 6. Figure 6: Comparison of naïve and optimised split criteria for surrogate growth. [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Aggregated performance metrics of SHAPSHIFT on 250 shifts across five Folktables datasets. ↓ / ↑ / ↕ = lower / higher / more separated is better. 7 [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Impact of tree choice on PercentUnexplained. Impact of Tree Choice In [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Impact of RF ensemble size on metrics. Comparison to Optimal Transport Baselines Prior shift explanation methods tackle somewhat different problems to ours, but with targeted ad￾ditions, we can adapt two methods proposed by Kulinski & Inouye [21] as baselines. Both begin by finding a pairing between data points from P and Q using optimal transport (OT). The first method (OT-C) then groups the paired points… view at source ↗
Figure 10
Figure 10. Figure 10: Quantitative comparison of SHAPSHIFT to adapted OT baselines on Folktables data. OT-S OT-C Ours [PITH_FULL_IMAGE:figures/full_fig_p009_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Qualitative shift representations used by our method and baselines. More qualitatively, we argue that SHAPSHIFT ex￾planations achieve a stronger balance between in￾terpretability and information content than the OT baselines. As shown in [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Comparison of Kernel SHAP and pruning methods for handling large trees. Alternatively, the structure of our problem affords another way of handling large trees. Since the largest SVs are almost always for conditionals of split nodes near the root of a tree (which impact the largest pro￾portions of data), we can obtain very similar expla￾nations by applying the exact method to a heavily￾pruned version of t… view at source ↗
Figure 13
Figure 13. Figure 13: Illustration of a counterintuitive result when performing the SV analysis using subgroup [PITH_FULL_IMAGE:figures/full_fig_p014_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Interpretation of our method as interventions on a causal graph. [PITH_FULL_IMAGE:figures/full_fig_p016_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Empirical demonstration that P S⊆C w(S)ZS(l) ≈ P (l)+Q(l) 2 in expectation. 17 [PITH_FULL_IMAGE:figures/full_fig_p017_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: PercentUnexplained results for random forest and gradient boosted tree ensembles. [PITH_FULL_IMAGE:figures/full_fig_p022_16.png] view at source ↗
read the original abstract

Changes in input distribution can induce shifts in the average predictions of machine learning models. Such prediction shifts may impact downstream business outcomes (e.g. a bank's loan approval rate), so understanding their causes can be crucial. We propose \ours{}: a Shapley value method for attributing prediction shifts to changes in the conditional probabilities of interpretable subgroups of data, where these subgroups are defined by the structure of decision trees. We initially apply this method to single decision trees, providing exact explanations based on conditional probability changes at split nodes. Next, we extend it to tree ensembles by selecting the most explanatory tree and accounting for residual effects. Finally, we propose a model-agnostic variant using surrogate trees grown with a novel objective function, allowing application to models like neural networks. While exact computation can be intensive, approximation techniques enable practical application. We show that \ours{} provides simple, faithful, and near-complete explanations of prediction shifts across model classes, aiding model monitoring in dynamic environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes ShapShift, a Shapley-value-based method for attributing shifts in average model predictions to changes in conditional probabilities over interpretable subgroups defined by decision-tree structure. For single trees it claims exact computation from split-node probability changes; for ensembles it selects the single most explanatory tree and handles residuals separately; for black-box models it grows surrogate trees under a novel objective. The central claim is that the resulting attributions are simple, faithful, and near-complete across model classes and therefore useful for monitoring prediction shifts in dynamic environments.

Significance. If the near-complete claim can be substantiated, the method would offer a practical, axiom-grounded tool for diagnosing distribution-shift effects on deployed models, which is relevant to high-stakes monitoring tasks. The grounding in standard Shapley axioms and the tree-structured subgroup definition are clear strengths; however, the absence of residual bounds or quantitative validation for the ensemble case limits the immediate impact.

major comments (2)
  1. [ensemble extension] The ensemble extension (described after the single-tree case) selects one tree and attributes the remainder to a residual term. The manuscript provides neither an analytic bound on the residual fraction nor empirical measurements of its magnitude across the reported experiments. Because the central claim of 'near-complete' explanations across model classes rests on this residual being negligible, the lack of such quantification is load-bearing.
  2. [surrogate-tree variant] The surrogate-tree construction for black-box models relies on a novel objective function whose derivation is only sketched. Without an explicit statement of the objective, a proof that the resulting attributions remain faithful to the original model's conditional-probability shifts, or ablation results showing sensitivity to the surrogate, the model-agnostic claim cannot be evaluated.
minor comments (2)
  1. [abstract] The abstract states that 'exact computation can be intensive' and that 'approximation techniques enable practical application,' yet no concrete approximation algorithm, complexity analysis, or reference to the relevant section is supplied.
  2. [methods] Notation for the conditional Shapley values and the precise definition of 'subgroup' should be introduced with a single, self-contained equation early in the methods section to improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed review. The comments highlight important areas for strengthening the validation of the ensemble and surrogate extensions, which we address below. We plan to incorporate the suggested clarifications and additional analyses in a revised manuscript.

read point-by-point responses
  1. Referee: The ensemble extension (described after the single-tree case) selects one tree and attributes the remainder to a residual term. The manuscript provides neither an analytic bound on the residual fraction nor empirical measurements of its magnitude across the reported experiments. Because the central claim of 'near-complete' explanations across model classes rests on this residual being negligible, the lack of such quantification is load-bearing.

    Authors: We agree that explicit quantification of the residual is necessary to support the near-complete claim. Deriving a general analytic bound is difficult because the residual depends on the specific ensemble structure, data distribution, and tree selection heuristic. However, we will add empirical measurements of the residual fraction (as a percentage of the total shift) for all ensemble experiments in the revised manuscript. These measurements will be reported alongside the existing results to demonstrate that the residual is typically small in practice. We will also expand the description of the tree-selection criterion to clarify how it minimizes the residual. revision: partial

  2. Referee: The surrogate-tree construction for black-box models relies on a novel objective function whose derivation is only sketched. Without an explicit statement of the objective, a proof that the resulting attributions remain faithful to the original model's conditional-probability shifts, or ablation results showing sensitivity to the surrogate, the model-agnostic claim cannot be evaluated.

    Authors: We acknowledge that the surrogate-tree section requires a more complete presentation. In the revision we will state the novel objective function in explicit mathematical form, including the precise optimization criterion used to grow the surrogate. We will add a short theoretical argument showing that the conditional Shapley values computed on the surrogate remain faithful to the original model's subgroup probability shifts under the chosen objective, and we will include ablation experiments that vary the surrogate hyperparameters and report the resulting attribution stability. These changes will allow readers to evaluate the model-agnostic extension directly. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation builds on standard Shapley axioms and independent extensions

full rationale

The paper defines ShapShift by applying Shapley values to attribute shifts in average predictions to changes in conditional subgroup probabilities, where subgroups are induced by decision tree splits. For single trees this yields exact attributions at nodes, following directly from the value function without redefining the target shift in terms of the attributions themselves. The ensemble extension (selecting one tree plus residuals) and surrogate-tree variant (with novel objective) are presented as practical constructions motivated by computational needs, not as outputs forced by fitting or self-referential definitions. No load-bearing self-citations, uniqueness theorems imported from prior author work, or ansatzes smuggled via citation appear in the derivation chain. The method remains self-contained against external Shapley axioms and tree structure, with no step reducing by construction to its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review yields no concrete free parameters, invented entities, or non-standard axioms; the method rests on the standard efficiency and additivity properties of Shapley values plus the assumption that tree splits define meaningful subgroups.

axioms (1)
  • standard math Shapley value axioms (efficiency, symmetry, dummy player, additivity)
    Invoked as the foundation for attributing the total prediction shift to subgroup probability changes.

pith-pipeline@v0.9.0 · 5485 in / 1337 out tokens · 82344 ms · 2026-05-10T16:09:56.331221+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

  1. [1]

    Ackerman, P

    S. Ackerman, P. Dube, E. Farchi, O. Raz, and M. Zalmanovici. Machine learning model drift detection via weak data slices. In2021 IEEE/ACM Third International Workshop on Deep Learning for Testing and Testing for Deep Learning (DeepTest), pages 1–8, Los Alamitos, CA, USA, jun 2021. IEEE Computer Society

  2. [2]

    The lifecycle of a statistical model: Model failure detection, identification, and refitting.arXiv preprint arXiv:2202.04166,

    Alnur Ali, Maxime Cauchois, and John C Duchi. The lifecycle of a statistical model: Model failure detection, identification, and refitting.arXiv preprint arXiv:2202.04166, 2022

  3. [3]

    Shirke, B

    Varun Babbar, Zhicheng Guo, and Cynthia Rudin. What is different between these datasets? arXiv preprint arXiv:2403.05652, 2024

  4. [4]

    Evaluating and aggregating feature-based model explanations

    Umang Bhatt, Adrian Weller, and José MF Moura. Evaluating and aggregating feature-based model explanations. InProceedings of the Twenty-Ninth International Conference on Interna- tional Joint Conferences on Artificial Intelligence, pages 3016–3022, 2021

  5. [5]

    Random forests.Machine learning, 45:5–32, 2001

    Leo Breiman. Random forests.Machine learning, 45:5–32, 2001

  6. [6]

    Kailash Budhathoki, Dominik Janzing, Patrick Bloebaum, and Hoiyi Ng. Why did the distri- bution change? In Arindam Banerjee and Kenji Fukumizu, editors,Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, volume 130 ofProceedings of Machine Learning Research, pages 1666–1674. PMLR, 13–15 Apr 2021

  7. [7]

    Diagnosing model perfor- mance under distribution shift.arXiv preprint arXiv:2303.02011, 2023

    Tiffany Tianhui Cai, Hongseok Namkoong, and Steve Yadlowsky. Diagnosing model perfor- mance under distribution shift.arXiv preprint arXiv:2303.02011, 2023

  8. [8]

    Human- interpretable model explainability on high-dimensional data.arXiv preprint arXiv:2010.07384, 2020

    Damien de Mijolla, Christopher Frye, Markus Kunesch, John Mansir, and Ilya Feige. Human- interpretable model explainability on high-dimensional data.arXiv preprint arXiv:2010.07384, 2020

  9. [9]

    The spotlight: A general method for discovering systematic errors in deep learning models

    Greg d’Eon, Jason d’Eon, James R Wright, and Kevin Leyton-Brown. The spotlight: A general method for discovering systematic errors in deep learning models. InProceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, pages 1962–1981, 2022

  10. [10]

    Retiring adult: New datasets for fair machine learning.Advances in neural information processing systems, 34:6478–6490, 2021

    Frances Ding, Moritz Hardt, John Miller, and Ludwig Schmidt. Retiring adult: New datasets for fair machine learning.Advances in neural information processing systems, 34:6478–6490, 2021

  11. [11]

    A hierarchical decomposition for explaining ml performance discrepancies.arXiv preprint arXiv:2402.14254, 2024

    Jean Feng, Harvineet Singh, Fan Xia, Adarsh Subbaswamy, and Alexej Gossmann. A hierarchical decomposition for explaining ml performance discrepancies.arXiv preprint arXiv:2402.14254, 2024

  12. [12]

    FICO Explainable Machine Learning (xML) Challenge, 2018

    FICO. FICO Explainable Machine Learning (xML) Challenge, 2018. https://community.fico.com/s/explainable-machinelearning-challenge

  13. [13]

    Greedy function approximation: a gradient boosting machine.Annals of statistics, pages 1189–1232, 2001

    Jerome H Friedman. Greedy function approximation: a gradient boosting machine.Annals of statistics, pages 1189–1232, 2001. 10

  14. [14]

    Asymmetric shapley values: incorporating causal knowledge into model-agnostic explainability.Advances in neural information processing systems, 33:1229–1239, 2020

    Christopher Frye, Colin Rowat, and Ilya Feige. Asymmetric shapley values: incorporating causal knowledge into model-agnostic explainability.Advances in neural information processing systems, 33:1229–1239, 2020

  15. [15]

    Faircanary: Rapid continuous explainable fairness

    Avijit Ghosh, Aalok Shanbhag, and Christo Wilson. Faircanary: Rapid continuous explainable fairness. InProceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’22, page 307–316, New York, NY , USA, 2022. Association for Computing Machinery

  16. [16]

    One or two things we know about concept drift–a survey on monitoring evolving environments.arXiv preprint arXiv:2310.15826, 2023

    Fabian Hinder, Valerie Vaquet, and Barbara Hammer. One or two things we know about concept drift–a survey on monitoring evolving environments.arXiv preprint arXiv:2310.15826, 2023

  17. [17]

    Progressive inference: Explaining decoder-only sequence classification models using intermediate predictions

    Sanjay Kariyappa, Freddy Lecue, Saumitra Mishra, Christopher Pond, Daniele Magazzeni, and Manuela Veloso. Progressive inference: Explaining decoder-only sequence classification models using intermediate predictions. InForty-first International Conference on Machine Learning, 2024

  18. [18]

    Moni- toring and explainability of models in production.arXiv preprint arXiv:2007.06299, 2020

    Janis Klaise, Arnaud Van Looveren, Clive Cox, Giovanni Vacanti, and Alexandru Coca. Moni- toring and explainability of models in production.arXiv preprint arXiv:2007.06299, 2020

  19. [19]

    Towards explanatory model monitoring

    Alexander Koebler, Thomas Decker, Michael Lebacher, Ingo Thon, V olker Tresp, and Florian Buettner. Towards explanatory model monitoring. InXAI in Action: Past, Present, and Future Applications, 2023

  20. [20]

    Concept bottleneck models

    Pang Wei Koh, Thao Nguyen, Yew Siang Tang, Stephen Mussmann, Emma Pierson, Been Kim, and Percy Liang. Concept bottleneck models. InInternational conference on machine learning, pages 5338–5348. PMLR, 2020

  21. [21]

    Sean Kulinski and David I. Inouye. Towards explaining distribution shifts. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, editors,Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pages 17931–17952. PMLR, 23–29 Jul 2023

  22. [22]

    On the need for a language describing distribution shifts: Illustrations on tabular datasets.Advances in Neural Information Processing Systems, 36, 2024

    Jiashuo Liu, Tianyu Wang, Peng Cui, and Hongseok Namkoong. On the need for a language describing distribution shifts: Illustrations on tabular datasets.Advances in Neural Information Processing Systems, 36, 2024

  23. [23]

    A unified approach to interpreting model predictions

    Scott M Lundberg and Su-In Lee. A unified approach to interpreting model predictions. Advances in neural information processing systems, 30, 2017

  24. [24]

    Cancer diagnosis via linear programming

    Olvi L Mangasarian and William H Wolberg. Cancer diagnosis via linear programming. Technical report, University of Wisconsin-Madison Department of Computer Sciences, 1990

  25. [25]

    Christoph Molnar.Interpretable machine learning. Lulu. com, 2020

  26. [26]

    Explanation shift: Detecting distribution shifts on tabular data via the explanation space

    Carlos Mougan, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, and Steffen Staab. Explanation shift: Detecting distribution shifts on tabular data via the explanation space. In NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications, 2022

  27. [27]

    Sparse spatial autoregressions.Statistics & Probability Letters, 33(3):291–297, 1997

    R Kelley Pace and Ronald Barry. Sparse spatial autoregressions.Statistics & Probability Letters, 33(3):291–297, 1997

  28. [28]

    Higher-order explanations of graph neural networks via relevant walks.IEEE transactions on pattern analysis and machine intelligence, 44(11):7581– 7596, 2021

    Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T Schütt, Klaus- Robert Müller, and Grégoire Montavon. Higher-order explanations of graph neural networks via relevant walks.IEEE transactions on pattern analysis and machine intelligence, 44(11):7581– 7596, 2021

  29. [29]

    Unified shapley framework to explain prediction drift, 2021

    Aalok Shanbhag, Avijit Ghosh, and Josh Rubin. Unified shapley framework to explain prediction drift, 2021

  30. [30]

    A value for n-person games

    Lloyd S Shapley. A value for n-person games. 1953

  31. [31]

    Using the adap learning algorithm to forecast the onset of diabetes mellitus

    Jack W Smith, James E Everhart, WC Dickson, William C Knowler, and Robert Scott Johannes. Using the adap learning algorithm to forecast the onset of diabetes mellitus. InProceedings of the annual symposium on computer application in medical care, page 261. American Medical Informatics Association, 1988. 11

  32. [32]

    Why did the Model Fail?

    Haoran Zhang, Harvineet Singh, and Shalmali Joshi. “Why did the Model Fail?”: Attributing Model Performance Changes to Distribution Shifts. InICML 2022: Workshop on Spurious Correlations, Invariance and Stability, 2022. 12 A Methodological Details and Discussion A.1 Handling General Models via Scalarisation Although the notation in the main paper presents...