pith. sign in

arxiv: 2602.00208 · v3 · submitted 2026-01-30 · 💻 cs.LG · cs.AI· cs.IR· math.ST· stat.ML· stat.TH

Analyzing Shapley Additive Explanations to Understand Anomaly Detection Algorithm Behaviors and Their Complementarity

Pith reviewed 2026-05-16 09:38 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.IRmath.STstat.MLstat.TH
keywords anomaly detectionSHAP explanationsensemble methodsmodel complementarityfeature attributionunsupervised learningdetector diversityexplanation similarity
0
0 comments X

The pith

SHAP attribution similarity identifies complementary anomaly detectors, offering a selection criterion distinct from raw output scores for building effective ensembles.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a method to characterize unsupervised anomaly detectors by computing SHAP explanations of their feature attributions. Detectors whose attributions are similar tend to produce correlated anomaly scores and flag largely the same points, while divergent attributions reliably signal complementary detection. This supplies a new way to pick models for ensembles that capture different kinds of irregularities, improving on selections based only on the detectors' raw scores. The work further shows that explanation diversity is useful only when the individual detectors already perform well on their own. A reader would care because redundant models have long limited the practical gains from ensembles in unsupervised anomaly detection.

Core claim

Using SHAP to quantify feature attributions, the authors demonstrate that similarity in these attribution profiles between anomaly detectors corresponds to correlated anomaly scores and largely overlapping detected anomalies, whereas divergence in explanations reliably indicates complementary detection behavior. This allows explanation-driven metrics to serve as a distinct criterion for selecting ensemble members compared to raw outputs, resulting in more diverse and effective ensembles when combined with high individual model performance.

What carries the argument

SHAP attribution profiles used to quantify similarity between anomaly detectors' decision mechanisms via feature importance.

Load-bearing premise

SHAP attributions faithfully capture the decision mechanisms of the anomaly detectors, so that similarity or divergence in attributions directly corresponds to overlapping or complementary sets of detected anomalies.

What would settle it

An observation of two detectors that share nearly identical SHAP attribution profiles yet detect largely disjoint sets of anomalies would falsify the claimed correspondence.

Figures

Figures reproduced from arXiv: 2602.00208 by Benoit Gaudou, Jordan Levy, Moncef Garouani, Nicolas Verstaevel, Paul Saves.

Figure 1
Figure 1. Figure 1: Mean similarity between models across all datasets. [PITH_FULL_IMAGE:figures/full_fig_p008_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Relationship between ensemble diversity (given by [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗
read the original abstract

Unsupervised anomaly detection is a challenging problem due to the diversity of data distributions and the lack of labels. Ensemble methods are often adopted to mitigate these challenges by combining multiple detectors, which can reduce individual biases and increase robustness. Yet building an ensemble that is genuinely complementary remains challenging, since many detectors rely on similar decision cues and end up producing redundant anomaly scores. As a result, the potential of ensemble learning is often limited by the difficulty of identifying models that truly capture different types of irregularities. To address this, we propose a methodology for characterizing anomaly detectors through their decision mechanisms. Using SHapley Additive exPlanations, we quantify how each model attributes importance to input features, and we use these attribution profiles to measure similarity between detectors. We show that detectors with similar explanations tend to produce correlated anomaly scores and identify largely overlapping anomalies. Conversely, explanation divergence reliably indicates complementary detection behavior. Our results demonstrate that explanation-driven metrics offer a different criterion than raw outputs for selecting models in an ensemble. However, we also demonstrate that diversity alone is insufficient; high individual model performance remains a prerequisite for effective ensembles. By explicitly targeting explanation diversity while maintaining model quality, we are able to construct ensembles that are more diverse, more complementary, and ultimately more effective for unsupervised anomaly detection.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes using SHAP explanations to analyze the decision mechanisms of unsupervised anomaly detection algorithms. It demonstrates that models with similar SHAP attribution profiles tend to produce correlated anomaly scores and detect largely overlapping anomalies, while divergent profiles indicate complementary behaviors. This is leveraged to select models for ensembles, showing improved performance when combining explanation diversity with high individual model accuracy.

Significance. Should the findings hold under rigorous validation, this methodology offers a valuable new criterion for constructing complementary ensembles in unsupervised anomaly detection, distinct from traditional output-based diversity measures. It underscores that while diversity is important, it must be paired with strong base model performance, which could influence ensemble design practices in the field.

major comments (2)
  1. [Methodology and SHAP Setup] The manuscript does not provide an ablation study or justification for the background distribution used in computing SHAP values for the anomaly detectors. Given that anomaly scores are relative to normal data, different background choices could alter the attributions substantially, risking that the observed correlation between explanation similarity and anomaly overlap is not robust.
  2. [Experimental Results] The experimental results assert that explanation-driven metrics differ from raw outputs for ensemble selection, but lack direct head-to-head comparisons, statistical significance tests, or controls across multiple datasets to establish that explanation divergence reliably yields superior complementarity beyond output correlation.
minor comments (1)
  1. [Throughout] Ensure consistent terminology for 'explanation similarity' versus 'attribution profiles' to improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback. The comments highlight important aspects of robustness and empirical validation that we will address through targeted revisions. Below we respond point by point to the major comments.

read point-by-point responses
  1. Referee: [Methodology and SHAP Setup] The manuscript does not provide an ablation study or justification for the background distribution used in computing SHAP values for the anomaly detectors. Given that anomaly scores are relative to normal data, different background choices could alter the attributions substantially, risking that the observed correlation between explanation similarity and anomaly overlap is not robust.

    Authors: We agree that the choice of background distribution merits explicit justification and sensitivity analysis. In the submitted manuscript we used the empirical distribution of the training data (standard for unsupervised anomaly detection to represent the normal baseline). To strengthen the work we will add a dedicated ablation subsection that evaluates alternative backgrounds, including the feature-wise mean, random subsamples from the training set, and synthetically generated normal points. We will report that the reported correlations between explanation similarity and anomaly overlap remain stable across these choices, thereby confirming robustness. revision: yes

  2. Referee: [Experimental Results] The experimental results assert that explanation-driven metrics differ from raw outputs for ensemble selection, but lack direct head-to-head comparisons, statistical significance tests, or controls across multiple datasets to establish that explanation divergence reliably yields superior complementarity beyond output correlation.

    Authors: We acknowledge that the current experimental section would benefit from more explicit comparative analysis. The manuscript already evaluates explanation-driven selection against random and output-correlation baselines on multiple datasets and shows gains in complementarity when high individual accuracy is preserved. In the revision we will add direct head-to-head tables, apply statistical significance tests (paired Wilcoxon signed-rank tests with Bonferroni correction), and include additional datasets with controlled variations in dimensionality and anomaly type. These additions will provide clearer quantitative evidence that explanation divergence supplies complementary information beyond output correlation alone. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical SHAP correlations are independent of target metrics

full rationale

The paper applies standard SHAP attribution to anomaly detectors, then computes empirical correlations between attribution similarity and anomaly-score overlap. No equations or steps reduce the measured complementarity to a fitted parameter or self-citation chain; the observed relationships are reported as data-driven findings rather than derived by construction from the inputs. The central claim therefore remains externally falsifiable and does not collapse into its own definitions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work relies on the standard assumption that SHAP values provide faithful local explanations for black-box models and that correlation of these explanations tracks correlation of anomaly detections. No new free parameters or invented entities are introduced in the abstract.

axioms (1)
  • domain assumption SHAP values faithfully reflect the decision mechanisms of the anomaly detection models
    Invoked when using attribution profiles to measure similarity and complementarity.

pith-pipeline@v0.9.0 · 5550 in / 1180 out tokens · 26442 ms · 2026-05-16T09:38:18.098435+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · 1 internal anchor

  1. [1]

    Anomaly detection: A survey

    V. Chandola et al. “Anomaly detection: A survey”. In: ACM computing surveys (CSUR) 41.3 (2009), pp. 1–58

  2. [2]

    A unifying review of deep and shallow anomal y detection

    L. Ruff et al. “A unifying review of deep and shallow anomal y detection”. In: Proceedings of the IEEE 109.5 (2021), pp. 756–795

  3. [3]

    Anomalies detection by unsupervised lear ning using ex- plainable artificial intelligence in nuclear power plants

    S. W. Oh et al. “Anomalies detection by unsupervised lear ning using ex- plainable artificial intelligence in nuclear power plants” . In: Transactions of the Korean Nuclear Society Spring Meeting Jeju, Korea . 2022

  4. [4]

    C. C. Aggarwal. Outlier Analysis . Springer, 2016

  5. [5]

    No free lunch theorems f or optimiza- tion

    D. H. Wolpert and W. G. Macready. “No free lunch theorems f or optimiza- tion”. In: Transactions on evolutionary computation (2002)

  6. [6]

    ADBench: Anomaly detection benchmark

    S. Han et al. “ADBench: Anomaly detection benchmark”. In : NeurIPS 35 (2022), pp. 32142–32159

  7. [7]

    On evaluation of outlier rankings and outlier scores

    E. Schubert et al. “On evaluation of outlier rankings and outlier scores”. In: International conference on data mining . SIAM. 2012, pp. 1047–1058

  8. [8]

    The need for unsupervised outlier model se lection: A review and evaluation of internal evaluation strategies

    M. Q. Ma et al. “The need for unsupervised outlier model se lection: A review and evaluation of internal evaluation strategies”. In: ACM SIGKDD Explorations Newsletter 25.1 (2023), pp. 19–35. 14 J. Levy, P. Saves, M. Garouani, N. Verstaevel and B. Gaudou

  9. [9]

    A unified approach to interp reting model predictions

    S. M. Lundberg and S.-I. Lee. “A unified approach to interp reting model predictions”. In: NeurIPS 30 (2017)

  10. [10]

    PyOD: A Python Toolbox for Scalable Outli er Detection

    Y. Zhao et al. “PyOD: A Python Toolbox for Scalable Outli er Detection”. In: Journal of Machine Learning Research 20.96 (2019), pp. 1–7

  11. [11]

    How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

    N. Goix. “How to evaluate the quality of unsupervised an omaly detection algorithms?” In: arXiv preprint arXiv:1607.01152 (2016)

  12. [12]

    Internal evaluation of unsupervis ed outlier detection

    H. O. Marques et al. “Internal evaluation of unsupervis ed outlier detection”. In: TKDD 14.4 (2020), pp. 1–42

  13. [13]

    Unsupervised model selection for variat ional disentangled representation learning

    S. Duan et al. “Unsupervised model selection for variat ional disentangled representation learning”. In: International Conference on Learning Repre- sentations. 2019

  14. [14]

    InfoGAN-CR and ModelCentrality: Self-su pervised Model Training and Selection for Disentangling GANs

    Z. Lin et al. “InfoGAN-CR and ModelCentrality: Self-su pervised Model Training and Selection for Disentangling GANs”. In: International confer- ence on machine learning . PMLR. 2020, pp. 6127–6139

  15. [15]

    Less is more: Building selecti ve anomaly en- sembles

    S. Rayana and L. Akoglu. “Less is more: Building selecti ve anomaly en- sembles”. In: TKDD 10.4 (2016), pp. 1–33

  16. [16]

    Unsupervised time series outlier dete ction with diversity- driven convolutional ensembles

    D. Campos et al. “Unsupervised time series outlier dete ction with diversity- driven convolutional ensembles”. In: Proceedings of the VLDB Endowment 15.3 (2021), pp. 611–623

  17. [17]

    Interpretability needs a new paradigm

    A. Madsen et al. “Interpretability needs a new paradigm ”. In: arXiv (2024)

  18. [18]

    Surrogate Modeling and Explainable Art ificial Intelligence for Complex Systems: A Workflow for Automated Simulation Exp loration

    P. Saves et al. “Surrogate Modeling and Explainable Art ificial Intelligence for Complex Systems: A Workflow for Automated Simulation Exp loration”. In: arXiv preprint (2025)

  19. [19]

    XStacking: An effective and inherent ly explainable framework for stacked ensemble learning

    M. Garouani et al. “XStacking: An effective and inherent ly explainable framework for stacked ensemble learning”. In: Information Fusion (2025)

  20. [20]

    Learning to rank using gradient descen t

    C. Burges et al. “Learning to rank using gradient descen t”. In: Proceedings of the 22nd international conference on Machine learning . 2005, pp. 89–96

  21. [21]

    The detection of disease clustering and a ge neralized regression approach

    N. Mantel. “The detection of disease clustering and a ge neralized regression approach”. In: Cancer research 27 (1967), pp. 209–220

  22. [22]

    Beyond the single-best model: Rashomon partial depen- dence profile for trustworthy explanations in automl

    M. Cavus et al. “Beyond the single-best model: Rashomon partial depen- dence profile for trustworthy explanations in automl”. In: International Conference on Discovery Science . Springer. 2025, pp. 445–459

  23. [23]

    TimeCIEL: Contextual Interactive Ensem ble Learning for Time Series Classification

    J. Levy et al. “TimeCIEL: Contextual Interactive Ensem ble Learning for Time Series Classification”. In: International Conference on Practical Ap- plications of Agents and Multi-Agent Systems . Springer. 2025, pp. 316– 327