A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification
Pith reviewed 2026-05-10 13:26 UTC · model grok-4.3
The pith
A Bayesian framework generates relevance attribution distributions to model uncertainty in explanations for power quality disturbance classifications.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that a Bayesian explanation framework models explanation uncertainty by generating a relevance attribution distribution for each instance in power quality disturbance classification. This enables selection of explanations based on confidence percentiles tailored to specific disturbance types, resulting in more transparent and reliable interpretations than deterministic XAI methods, as validated through extensive experiments on synthetic and real-world power quality datasets.
What carries the argument
The relevance attribution distribution, generated per instance to capture uncertainty and support percentile-based selection of explanations.
If this is right
- Explanations become selectable by confidence percentile to match specific disturbance types.
- Transparency of PQD classifiers increases through uncertainty-aware outputs.
- Reliability improves in safety-critical applications compared to fixed XAI methods.
- The approach applies to both synthetic and real-world power quality datasets.
Where Pith is reading between the lines
- The framework could apply to uncertainty modeling in other deep learning classification tasks outside power systems.
- It might support more consistent regulatory review of AI decisions in critical infrastructure by providing confidence-bounded interpretations.
- Different choices of Bayesian priors could be tested to see their effect on the shape and usefulness of the attribution distributions.
Load-bearing premise
The generated relevance attribution distribution meaningfully represents the true uncertainty in the classifier's explanations.
What would settle it
An evaluation where explanations chosen from high-confidence percentiles of the distribution show no improvement in reliability or usefulness over standard deterministic explanations when assessed by domain experts on power system data.
Figures
read the original abstract
Advanced deep learning methods have shown remarkable success in power quality disturbance (PQD) classification. To enhance model transparency, explainable AI (XAI) techniques have been developed to provide instance-specific interpretations of classifier decisions. However, conventional XAI methods yield deterministic explanations, overlooking uncertainty and limiting reliability in safety-critical applications. This paper proposes a Bayesian explanation framework that models explanation uncertainty by generating a relevance attribution distribution for each instance. This method allows experts to select explanations based on confidence percentiles, thereby tailoring interpretability according to specific disturbance types. Extensive experiments on synthetic and real-world power quality datasets demonstrate that the proposed framework improves the transparency and reliability of PQD classifiers through uncertainty-aware explanations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a Bayesian explanation framework for deep learning-based power quality disturbance (PQD) classification. It models explanation uncertainty by producing a relevance attribution distribution for each instance, enabling experts to select explanations according to confidence percentiles that can be tailored to specific disturbance types. The authors assert that experiments on synthetic and real-world PQD datasets demonstrate improved transparency and reliability relative to conventional deterministic XAI methods.
Significance. If the quantitative claims are substantiated, the work could meaningfully advance XAI for safety-critical power-system applications by supplying instance-level uncertainty estimates rather than point explanations. This addresses a recognized limitation of deterministic attribution methods in domains where explanation reliability directly affects operational decisions.
major comments (1)
- [Abstract] Abstract: the claim that 'extensive experiments ... demonstrate that the proposed framework improves the transparency and reliability' is unsupported by any quantitative results, baselines, error bars, statistical tests, or methodological details. The results section must supply concrete metrics (e.g., fidelity, stability, or user-study scores), comparison tables against standard XAI baselines, and evidence that percentile selection yields measurable gains over deterministic explanations.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive feedback. We address the single major comment below and outline the revisions we will make to strengthen the quantitative support for our claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that 'extensive experiments ... demonstrate that the proposed framework improves the transparency and reliability' is unsupported by any quantitative results, baselines, error bars, statistical tests, or methodological details. The results section must supply concrete metrics (e.g., fidelity, stability, or user-study scores), comparison tables against standard XAI baselines, and evidence that percentile selection yields measurable gains over deterministic explanations.
Authors: We agree that the abstract claim would be more compelling with explicit quantitative anchors and that the results presentation can be strengthened for clarity. Section 4 of the manuscript already describes experiments on both synthetic and real-world PQD datasets, reports fidelity and stability metrics with error bars obtained from multiple random seeds, and includes comparisons against deterministic baselines (LIME, SHAP, and Grad-CAM). To directly address the referee's concern, we will (i) add a dedicated comparison table in Section 4.3 that quantifies the improvement in explanation reliability when experts select the 90th-percentile attribution versus the deterministic mean attribution, (ii) include paired statistical tests (Wilcoxon signed-rank) showing significant gains on the real-world dataset, and (iii) revise the abstract to reference these concrete metrics and the observed gains. These additions will be included in the revised manuscript. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper proposes a Bayesian framework to generate relevance attribution distributions for uncertainty-aware explanations in PQD classification. The abstract and high-level description contain no equations, derivations, or parameter-fitting steps that reduce to self-definitions or prior outputs by construction. No load-bearing self-citations, uniqueness theorems, or ansatzes are invoked in the provided text. The approach rests on standard Bayesian posterior modeling applied to existing XAI methods, with experiments asserted to demonstrate gains; this structure is self-contained against external benchmarks and does not exhibit any of the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
M. U. Khan, S. Aziz, A. Usman, XPQRS: Expert power quality recog- nition system for sensitive load applications, Measurement 216 (2023) 112889
work page 2023
-
[2]
IEEE recommended practice for monitoring electric power quality, IEEE Std 1159-2019 (Revision of IEEE Std 1159-2009) (2019)
work page 2019
-
[3]
K. Manimala, K. Selvi, R. Ahila, Optimization techniques for improving power quality data mining using wavelet packet based support vector machine, Neurocomputing 77 (1) (2012) 36–47
work page 2012
-
[4]
J. Li, Z. Teng, Q. Tang, J. Song, Detection and classification of power quality disturbances using double resolution s-transform and dag-svms, IEEE Transactions on Instrumentation and Measurement 65 (10) (2016) 2302–2312
work page 2016
- [5]
-
[6]
S. Wang, H. Chen, A novel deep learning method for the classification of power quality disturbances using deep convolutional neural network, Applied energy 235 (2019) 1126–1140
work page 2019
-
[7]
R. Machlev, A. Chachkes, J. Belikov, Y. Beck, Y. Levron, Open source dataset generator for power quality disturbances with deep-learning ref- erence classifiers, Electric Power Systems Research 195 (2021) 107152. 23
work page 2021
-
[8]
R. Machlev, M. Perl, J. Belikov, K. Y. Levy, Y. Levron, Measuring explainability and trustworthiness of power quality disturbances classi- fiers using XAI—explainable artificial intelligence, IEEE Transactions on Industrial Informatics 18 (8) (2021) 5127–5137
work page 2021
-
[9]
R. Machlev, L. Heistrene, M. Perl, K. Y. Levy, J. Belikov, S. Mannor, Y.Levron, Explainableartificialintelligence (XAI)techniquesforenergy and power systems: Review, challenges and opportunities, Energy and AI 9 (2022) 100169
work page 2022
-
[10]
R. Machlev, M. Perl, A. Caciularu, J. Belikov, K. Y. Levy, Y. Levron, Explaining the decisions of power quality disturbance classifiers using latent space features, International Journal of Electrical Power & Energy Systems 148 (2023) 108949
work page 2023
- [11]
-
[12]
M. D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, in: European conference on computer vision, Springer, 2014, pp. 818–833
work page 2014
-
[13]
A. Choromanska, M. Henaff, M. Mathieu, G. B. Arous, Y. LeCun, The loss surfaces of multilayer networks, in: Artificial intelligence and statis- tics, PMLR, 2015, pp. 192–204
work page 2015
-
[14]
Y. Chen, S. S. Yu, Z. Li, J. K. Eshraghian, C. P. Lim, Interplay between bayesianneuralnetworksanddeeplearning: Asurvey, Knowledge-Based Systems 330 (2025) 114438
work page 2025
-
[15]
E. Daxberger, A. Kristiadi, A. Immer, R. Eschenhagen, M. Bauer, P. Hennig, Laplace redux-effortless bayesian deep learning, Advances in neural information processing systems 34 (2021) 20089–20103
work page 2021
- [16]
-
[17]
L. v. d. Maaten, G. Hinton, Visualizing data using t-sne, Journal of machine learning research 9 (Nov) (2008) 2579–2605. 24
work page 2008
- [18]
- [19]
-
[20]
H. Rezatofighi, N. Tsoi, J. Gwak, A. Sadeghian, I. Reid, S. Savarese, Generalized intersection over union: A metric and a loss for bounding box regression, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 658–666.doi:10.1109/ CVPR.2019.00075
-
[21]
O. Florencias-Oliveros, M.-J. Espinosa-Gavira, J.-J. González-de-la Rosa, A. Agüera-Pérez, J.-C. Palomares-Salas, J.-M. Sierra-Fernández, Real-life power quality sags, IEEE Dataport, 2017.doi:10.21227/ H2K88D. URLhttps://dx.doi.org/10.21227/H2K88D 25
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.