pith. sign in

arxiv: 2605.24084 · v1 · pith:J32PMKJZnew · submitted 2026-05-22 · 💻 cs.LG · cs.AI· cs.LO

Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks

Pith reviewed 2026-06-30 16:38 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.LO
keywords SHAPShapley valuesneural network verificationexact feature attributionprovable boundsexplainable AIscalabilitycoalition evaluation
0
0 comments X

The pith

Neural network verification can recover exact Shapley values by tightening bounds on feature contributions until they match.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that verification engines applied to neural networks can produce lower and upper bounds on Shapley values that converge to the exact values. This addresses the exponential cost of enumerating feature coalitions by replacing direct computation with certified queries. A reader would care because current exact SHAP methods are limited to tiny feature sets while approximations lack guarantees. The work shows the new bounds scale to search spaces orders of magnitude larger than earlier exact techniques. It therefore supplies a concrete route to trustworthy feature attributions for bigger models.

Core claim

The central claim is that an algorithm can formulate each coalition evaluation in the Shapley formula as a neural-network verification query. Solving these queries yields provably correct lower and upper bounds on the corresponding marginal contributions. The bounds are refined until the interval collapses to a single point, recovering the exact Shapley value for every feature. Experiments confirm that the procedure handles input dimensions far beyond the reach of previous exact SHAP algorithms while still terminating with the true values.

What carries the argument

The reduction of Shapley coalition marginals to neural-network verification problems that return certified bounds on the network output.

If this is right

  • Exact Shapley values become computable for neural networks whose feature count exceeds the limits of prior exact methods.
  • Statistical approximation techniques can now be benchmarked against ground-truth values on larger instances.
  • Certified bounds remain available even when exact values are still out of reach, supporting safety arguments that rely on contribution ranges.
  • The same verification machinery can be reused for related attribution scores that also require summing over coalitions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same bounding technique could be applied to other additive feature-attribution schemes that share the coalition structure of Shapley values.
  • Hybrid pipelines might use the certified bounds to decide when a cheaper approximation is already sufficiently accurate.
  • The method opens a path to certified fairness audits that treat exact feature contributions as the reference quantity.

Load-bearing premise

Existing verification engines can be applied to the sub-networks and queries that arise from Shapley coalition evaluations without prohibitive cost or loss of tightness.

What would settle it

On a small network whose exact Shapley values are known by exhaustive enumeration, the algorithm's bounds fail to converge to those known values after repeated refinement.

Figures

Figures reproduced from arXiv: 2605.24084 by David Boetius, Guy Katz, Shahaf Bassan, Stefan Leue, Tobias Sutter.

Figure 1
Figure 1. Figure 1: Splitting in VERISHAP. For Si = {j1, j2}, there are two options, and , for splitting the initial partition (left). For the split , the branches are B1 = {∅, {j2}} and B2 = {{j1}, {j1, j2}}, defined by I1 = E2 = ∅ and E1 = I2 = {j1}. This split allows computing the exact SHAP value directly since the contribution function ∆i(S) is linear for this split (right). branch Bk ∈ B (t−1) defined by Ik and Ek and s… view at source ↗
Figure 2
Figure 2. Figure 2: MNIST VERISHAP Results. SHAP value bounds for the class ‘4’ score of an MNIST CNN on a test set image of a ‘2’. The SHAP values are computed for an 8 × 8 grid of superpixels. The SHAP bounds φi, φi are visualised by their midpoints (φi + φi)/2 and half-ranges (HR) (φi − φi)/2 determining colour hue and lightness, respectively. Lighter colour means tighter bounds in this figure. In (c), the largest half-ran… view at source ↗
Figure 3
Figure 3. Figure 3: VERISHAP bounds for three features from [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: , KERNELSHAP requires 346s, 123s, and 351s on Mushroom, Default, and Auto, respectively, at the largest feasible sample sizes. Similarly, TREEMSR requires 895s, 132s, and 341s on Mushroom, Default, and Auto, respectively. For both estimators, this exceeds the overall runtime of VERISHAP on Mushroom and Auto. Overall, VERISHAP complements statistical estimators when high accuracy and precision are required.… view at source ↗
Figure 5
Figure 5. Figure 5: shows, LEVERAGESHAP (Musco & Witter, 2025) outperforms KERNELSHAP, confirming a trend emerging on smaller search spaces (Witter et al., 2025; Musco & Wit￾ter, 2025). TREEMSR shows volatile performance across datasets when applied to neural networks, a nuanced in￾sight that is not directly apparent from the results of Witter et al. (2025) for explaining tree-based models with large search spaces. This exemp… view at source ↗
Figure 6
Figure 6. Figure 6: Sources of Looseness in the VERISHAP Value Bounds. This plot decomposes the initial value function v bounds computed by VERISHAP into the true range of v over the discrete coalition space , the looseness introduced by the continuous relaxation , and the looseness introduced by CROWN-IBP . For each dataset, we normalise the bounds by the network output to attribute. Since the looseness introduced by the con… view at source ↗
Figure 7
Figure 7. Figure 7: Scalability in Network Size. We plot the runtime and the number of iterations VERISHAP requires to compute the exact SHAP values for neural networks trained on the Mushroom dataset of increasing size. We measure network size by the number of parameters a network possesses. The right plots (c) and (d) show cut-outs of the left plots (a) and (b) focussing on the results for small networks. 0 200 400 0 200 40… view at source ↗
Figure 8
Figure 8. Figure 8: Extended MNIST Experiment Analysis. This figure provides additional statistics for our MNIST experiment described in Section 6.1 and visualised in [PITH_FULL_IMAGE:figures/full_fig_p028_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: VERISHAP Results for Mean-Baseline SHAP on MNIST, FashionMNIST, CIFAR10, & GTSRB. For MNIST, we compute SHAP value bounds for the class ‘4’ score on a test set image of a ‘2’. For FashionMNIST, we attribute the ‘Ankle Boot’ score on a test set image of an ‘Ankle Boot’. For CIFAR10, we attribute the ‘Cat’ score on a test set image of a ‘Cat’. For GTSRB, we attribute the ‘Pass on the Right’ score on a test s… view at source ↗
Figure 10
Figure 10. Figure 10: VERISHAP Results for Zero-Baseline SHAP on FashionMNIST. We attribute the ‘Ankle Boot’ score on a test set image of an ‘Ankle Boot’. The SHAP bounds φi, φi are visualised by their midpoints (φi + φi)/2 and half-ranges (HR) (φi − φi)/2 determining colour hue and lightness, respectively. Lighter colour means tighter bounds in this figure. We write ‘p% HR’ to denote the iteration in which the largest half-ra… view at source ↗
Figure 11
Figure 11. Figure 11: VERISHAP vs. KERNELSHAP , LEVERAGESHAP , LINEARMSR & TREEMSR . We run KERNELSHAP, LEVERAGESHAP, LINEARMSR, and TREEMSR with increasing sample sizes until each exhausts the available GPU memory, repeating each run 100 times. For each sample size, we plot the range across repetitions of the largest error between the estimated SHAP value and the true SHAP value of a feature. For VERISHAP, we mark the half-ra… view at source ↗
Figure 12
Figure 12. Figure 12: KERNELSHAP vs. LEVERAGESHAP vs. TREEMSR (Extended). We report the mean squared error (MSE) of statistical SHAP estimators across 100 runs as in [PITH_FULL_IMAGE:figures/full_fig_p033_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: LEVERAGESHAP vs. LINEARMSR . We report the mean squared error (MSE) of the statistical SHAP estimators across 100 runs as in [PITH_FULL_IMAGE:figures/full_fig_p033_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Initial Bounds Half-Range vs. Number of Branchs. For each tabular network, we plot the largest initial SHAP bounds half-range (HR) maxi∈[n](φi (1) − φi (1))/2 against the number of branches required for VERISHAP to compute SHAP bounds with a maximum half-range of at-most 10% of the network output to attribute (10% HR). We normalise the initial bounds margin by the network output to attribute to enable the… view at source ↗
read the original abstract

Shapley additive explanations (SHAP) are widely recognised as computationally intractable for neural networks, since they induce an exponential search space over the input features. In this work, we take a first step towards scaling exact SHAP computation to larger search spaces by introducing an algorithm that leverages recent advances in neural network verification to compute arbitrarily tight exact lower and upper bounds on SHAP values for neural networks, ultimately recovering the exact SHAP values. We demonstrate that our approach scales to orders of magnitude larger search spaces than state-of-the-art exact methods. This provides an important first step towards exact SHAP computation and establishes a principled cornerstone for evaluating statistical approximation methods on larger search spaces.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The paper introduces Verified SHAP, an algorithm that applies recent neural network verification advances to compute arbitrarily tight lower and upper bounds on Shapley values for neural networks. By tightening these bounds sufficiently, the method recovers exact SHAP values and is claimed to scale to search spaces orders of magnitude larger than prior exact methods, providing a foundation for evaluating statistical approximations on bigger instances.

Significance. If the scaling and exact-recovery claims hold, the work is significant because exact SHAP computation has been intractable for neural networks due to the exponential coalition space; verification-based bounding offers a principled route to exact values without enumeration. The approach directly leverages external verification progress rather than inventing new parameters or self-referential definitions, which strengthens its potential as a cornerstone for XAI evaluation.

major comments (1)
  1. [Abstract and §4 (experiments)] The central claim requires that verification engines can be applied to the exponentially many coalition-specific queries (marginal contributions f(x_S ∪ {i}) − f(x_S) under input masks) while maintaining tightness sufficient to recover exact values and without prohibitive cost. The manuscript must supply concrete benchmarks (network widths/depths, feature counts, verification timeouts, and bound gaps) showing that coalition encodings do not introduce disjunctions that cause looseness on ReLU networks, as this assumption is load-bearing for both the 'arbitrarily tight' recovery and the reported scaling advantage.
minor comments (1)
  1. [Abstract] The abstract states that the method 'demonstrates' scaling but provides no quantitative details on network sizes, runtimes, or tightness metrics; moving a concise summary of the key experimental parameters into the abstract would improve clarity.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive review and the emphasis on substantiating the central claims with concrete experimental details. We address the major comment below.

read point-by-point responses
  1. Referee: [Abstract and §4 (experiments)] The central claim requires that verification engines can be applied to the exponentially many coalition-specific queries (marginal contributions f(x_S ∪ {i}) − f(x_S) under input masks) while maintaining tightness sufficient to recover exact values and without prohibitive cost. The manuscript must supply concrete benchmarks (network widths/depths, feature counts, verification timeouts, and bound gaps) showing that coalition encodings do not introduce disjunctions that cause looseness on ReLU networks, as this assumption is load-bearing for both the 'arbitrarily tight' recovery and the reported scaling advantage.

    Authors: We agree that the manuscript should provide more granular experimental evidence to support the claims of arbitrary tightness and scaling. In the revised version we will expand §4 with additional tables and text that report: (i) explicit network widths and depths for all evaluated models, (ii) feature counts ranging from the small instances used by prior exact methods up to the larger spaces claimed, (iii) per-query verification timeouts and total runtime, and (iv) the numerical bound gaps (upper minus lower) achieved on the marginal-contribution queries before exact recovery. We will also add a dedicated paragraph on the coalition-encoding construction, showing that the input-masking formulation introduces no new disjunctive constraints beyond the existing ReLU activations; the verifier therefore inherits the same tightness guarantees as standard ReLU verification, which we will illustrate with both a short formal argument and empirical bound-gap statistics confirming that gaps close to machine precision within the reported budgets. revision: yes

Circularity Check

0 steps flagged

No circularity; derivation applies external verification to SHAP queries

full rationale

The paper's central algorithm applies existing neural network verification engines to compute bounds on marginal contributions within the SHAP formula, recovering exact values when bounds become tight. No equations redefine SHAP in terms of the verification outputs, no fitted parameters are relabeled as predictions, and no load-bearing steps reduce to self-citations or author-specific uniqueness theorems. The method is self-contained against external benchmarks because it treats verification advances as independent primitives rather than deriving them from the SHAP application itself.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract provides no explicit free parameters, invented entities, or additional axioms beyond the implicit assumption that verification engines can bound SHAP queries.

axioms (1)
  • domain assumption Neural network verification tools can produce arbitrarily tight bounds on the output of sub-queries corresponding to feature coalitions in SHAP computation.
    This is the load-bearing premise that allows the bounds to be tightened to exact values.

pith-pipeline@v0.9.1-grok · 5653 in / 1089 out tokens · 24652 ms · 2026-06-30T16:38:13.028706+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

14 extracted references · 12 canonical work pages · 1 internal anchor

  1. [1]

    URL https://doi.org /10.1609/aaai.v36i5.20484

    AAAI Press, 2022. URL https://doi.org /10.1609/aaai.v36i5.20484. Baniecki, H., Muschalik, M., Fumagalli, F., Hammer, B., H¨ullermeier, E., and Biecek, P. Explaining similarity in vision-language encoders with weighted Banzhaf inter- actions.CoRR, abs/2508.05430, 2025. URL https: //doi.org/10.48550/arXiv.2508.05430. Barcel´o, P., Cominetti, R., and Morgado...

  2. [2]

    Becker, B

    URL https://openreview.net/forum?i d=nOfSWmPYL5. Becker, B. and Kohavi, R. Adult. UCI Machine Learning Repository, 1996. URL https://doi.org /10.24432/C5XW20. Biradar, G., Izza, Y ., Lobo, E. A., Viswanathan, V ., and Zick, Y . Axiomatic aggregations of abductive explanations. In AAAI, pp. 11096–11104. AAAI Press, 2024. URL http s://doi.org/10.1609/aaai.v...

  3. [3]

    URL https://proceedings.m lr.press/v202/chen23s.html

    PMLR, 2023. URL https://proceedings.m lr.press/v202/chen23s.html. Chen, Y ., Hua, F., Jin, Y ., and Zhang, E. Z. BGPQ: A heap-based priority queue design for GPUs. InICPP, pp. 9:1–9:10. ACM, 2021. URL https://doi.org /10.1145/3472456.3472463. Chiu, H., Chen, H., Zhang, H., and Zhang, R. Y . SDP- CROWN: Efficient bound propagation for neural network verifi...

  4. [4]

    Fernandes, K., Vinagre, P., and Cortez, P

    URL https://openreview.net/forum?i d=ky7vVlBQBY. Fernandes, K., Vinagre, P., and Cortez, P. A pro- active intelligent decision support system for predict- ing the popularity of online news. InEPIA, volume 9273 ofLecture Notes in Computer Science, pp. 535–546. Springer, 2015. URL https://doi.org /10.1007/978-3-319-23485-4 53. Ferrari, C., M ¨uller, M. N., ...

  5. [5]

    Fryer, D

    URL https://openreview.net/forum?i d=l amHf1oaK. Fryer, D. V ., Str¨umke, I., and Nguyen, H. D. Shapley values for feature selection: The good, the bad, and the axioms. IEEE Access, 9:144352–144360, 2021. URL https: //doi.org/10.1109/ACCESS.2021.3119110. Fumagalli, F., Muschalik, M., Kolpaczki, P., H ¨uller- meier, E., and Hammer, B. SHAP-IQ: Unified appr...

  6. [6]

    Izza, Y ., Huang, X., Morgado, A., Planes, J., Ignatiev, A., and Marques-Silva, J

    URL https://doi.org/10.1609/aaai.v 33i01.33011511. Izza, Y ., Huang, X., Morgado, A., Planes, J., Ignatiev, A., and Marques-Silva, J. Distance-restricted explanations: Theoretical underpinnings & efficient implementation. InKR, 2024. URL https://doi.org/10.24963/k r.2024/45. Jethani, N., Sudarshan, M., Covert, I. C., Lee, S., and Ran- ganath, R. FastSHAP:...

  7. [7]

    Kotha, S., Brix, C., Kolter, J

    URL https://proceedings.mlr.press/v 238/kolpaczki24a.html. Kotha, S., Brix, C., Kolter, J. Z., Dvijotham, K., and Zhang, H. Provably bounding neural network preimages.NeurIPS, 36:80270–80290, 2023. URL http://papers.nips.cc/paper files/paper /2023/hash/fe061ec0ae03c5cf5b5323a2b 9121bfd-Abstract-Conference.html. Krizhevsky, A. Learning multiple layers of f...

  8. [8]

    Kumar, I

    URL https://www.cs.toronto.edu/˜kri z/learning-features-2009-TR.pdf. Kumar, I. E., Venkatasubramanian, S., Scheidegger, C., and Friedler, S. A. Problems with Shapley-value-based ex- planations as feature importance measures. InICML, volume 119 ofProceedings of Machine Learning Re- search, pp. 5491–5500. PMLR, 2020. URL http://p roceedings.mlr.press/v119/k...

  9. [9]

    Amortized Linear-time Exact Shapley Value for Product-Kernel Methods

    URL https://proceedings.neurips.c c/paper/2017/hash/8a20a8621978632d76c 43dfd28b67767-Abstract.html. Lundberg, S. M., Erion, G. G., Chen, H., DeGrave, A. J., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S. From local explanations to global un- derstanding with explainable AI for trees.Nat. Mach. Intell., 2(1):56–67, 2020. URL h...

  10. [10]

    Musco, C

    URL https://openreview.net/forum?i d=9tKC0YM8sX. Musco, C. and Witter, R. T. Provably accurate Shapley value estimation via leverage score sampling. InICLR. OpenReview.net, 2025. URL https://openrevie w.net/forum?id=wg3rBImn3O. Mushroom. Mushroom Dataset. UCI Machine Learn- ing Repository, 1981. URL https://doi.org /10.24432/C5959T. Nasr, M., El-Bahnasy, ...

  11. [11]

    Neubert, P

    URL https://doi.org/10.1109/ICENC O.2017.8289800. Neubert, P. and Protzel, P. Compact watershed and preempt- ive SLIC: on improving trade-offs of superpixel segment- ation algorithms. InICPR, pp. 996–1001. IEEE Computer Society, 2014. URL https://doi.org/10.1109/I CPR.2014.181. Palechor, F. M. and de la Hoz Manotas, A. Dataset for estimation of obesity le...

  12. [12]

    Singh, G., Gehr, T., P ¨uschel, M., and Vechev, M

    URLhttps://doi.org/10.24432/C5T01Q. Singh, G., Gehr, T., P ¨uschel, M., and Vechev, M. T. An abstract domain for certifying neural networks.Proc. ACM Program. Lang., 3(POPL):41:1–41:30, 2019. URL https://doi.org/10.1145/3290354. Slack, D., Hilgard, S., Jia, E., Singh, S., and Lakka- raju, H. Fooling LIME and SHAP: Adversarial at- tacks on post hoc explana...

  13. [13]

    URL https://openreview.net/forum?i d=nVZtXBI6LNn. Yang, J. Fast TreeSHAP: Accelerating SHAP value com- putation for trees.CoRR, abs/2109.09847, 2021. URL https://arxiv.org/abs/2109.09847. Yeh, I. and Lien, C. The comparisons of data mining tech- niques for the predictive accuracy of probability of default of credit card clients.Expert Syst. Appl., 36(2):2...

  14. [14]

    Zhang, H., Chen, H., Xiao, C., Gowal, S., Stanforth, R., Li, B., Boning, D

    URL https://proceedings.neurips .cc/paper/2018/hash/d04863f100d59b3e b688a11f95b0ae60-Abstract.html. Zhang, H., Chen, H., Xiao, C., Gowal, S., Stanforth, R., Li, B., Boning, D. S., and Hsieh, C. Towards stable and efficient training of verifiably robust neural networks. In ICLR. OpenReview.net, 2020. URL https://openr eview.net/forum?id=Skxuk1rFwB. Zhou, ...