pith. sign in

arxiv: 2407.20067 · v2 · submitted 2024-07-29 · 💻 cs.LG · cs.AI

xAI-Drop: Don't Use What You Cannot Explain

Pith reviewed 2026-05-23 22:57 UTC · model grok-4.3

classification 💻 cs.LG cs.AI
keywords graph neural networksexplainabilitydropping regularizationinterpretabilitynode droppinggeneralization
0
0 comments X

The pith

xAI-Drop drops nodes from GNNs based on how well the model can explain them, raising accuracy and explanation quality.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Graph Neural Networks struggle with generalization and interpretability when trained on graph data. The paper claims that explainability metrics, rather than random or heuristic rules, should determine which nodes to drop during training to reduce noise and complexity. xAI-Drop applies this idea by identifying nodes that contribute poorly to explanations and excluding them from the propagation step. On multiple real-world datasets the approach yields higher accuracy than existing dropping methods while also producing clearer model explanations.

Core claim

xAI-Drop is a topological-level dropping regularizer that leverages explainability to pinpoint noisy network elements to be excluded from the GNN propagation mechanism. An empirical evaluation on diverse real-world datasets demonstrates that our method outperforms current state-of-the-art dropping approaches in accuracy, and improves explanation quality.

What carries the argument

xAI-Drop, a dropping regularizer that selects nodes for exclusion using explainability scores instead of random or heuristic criteria.

Load-bearing premise

Explainability metrics can reliably identify nodes contributing to noise and over-complexity such that excluding them improves both accuracy and explanation quality without introducing new biases.

What would settle it

A real-world graph dataset on which xAI-Drop produces lower accuracy or worse explanation quality than standard dropping baselines.

Figures

Figures reproduced from arXiv: 2407.20067 by Andrea Passerini, Antonio Longa, Pietro Li\`o, Vincenzo Marco De Luca.

Figure 1
Figure 1. Figure 1: Illustration of the rationale behind XAI-DROP. Panel (a) shows a Barabási-Albert network with house-shaped motifs randomly attached. The task here is to classify nodes as either the top of a house (label 1) or otherwise (label 0). It is easy to see that a triangle is an approximate pattern for the positive class. The figure highlights three prototypical nodes (A, B, C) which are parts of a triangle, where … view at source ↗
Figure 2
Figure 2. Figure 2: A graphical representation of the node dropping strategy (XAI-DROPNODE) employed by the XAI-DROP algorithm in node classification tasks. Panel a) illustrates the confidence-based selection process, where nodes are selected if the model’s confidence is equal or greater than a specified threshold θ. Panel b) presents the computation of fidelity sufficiency scores and dropping probabilities for the nodes sele… view at source ↗
Figure 3
Figure 3. Figure 3: Test accuracy (left axis) and training time (right axis) when using different explainers for XAI-DROPNODE applied on Cora for node classification with GCN architecture. The dotted line represents the accuracy achieved when using the baseline DropNode random strategy. Baseline DropEdge DropMess. DropNode DropAgg Learn2Drop BBGDC ExPASS ENGAGE MATE xAI DropNode 0 200 400 600 800 1000 1200 Time (s) Cora Basel… view at source ↗
Figure 4
Figure 4. Figure 4: The histogram of the time in seconds required for training GCN on Cora, Citeseer, and PubMed with each regularization method used for node classification. In the analysis of the computational complexity, the number of parameters required by each method also plays a crucial role. As in [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The log-scale histogram of the parameters used for training GCN for node classification task with Cora, Citeseer, and Pubmed datasets. F Hyperparameter sensitivity F.1 Confidence One of the most important hyperparameters in our method is the confident threshold θ ∈ [0, 1]. This hyperparameter is necessary to decide whether a node is a candidate noisy node or not. To fully comprehend its rule, we can start … view at source ↗
Figure 6
Figure 6. Figure 6: Test Accuracy and standard deviations on some node classification datasets (Cora, Citeseer, and Pubmed) trained with GCN varying confidence threshold θ. F.2 Dropping probability The dropping probability p is a crucial hyperparameter for properly applying XAI-DROP. As with any dropping strategies, the tuning of this hyperparameter strongly depends on the input graph, and is also related to the design of the… view at source ↗
Figure 7
Figure 7. Figure 7: Test accuracy on Node classification with GCN on multiple datasets (Cora, Citeseer, Pubmed) varying dropping probability θ. G Evolution of node confidence and explainability, ablation studies The XAI-DROP method relies on a crucial intuition: the combination of confidence and explanation quality can be used as a proxy for pinpointing harmful nodes in a graph, the removal of which stabilizes training. To be… view at source ↗
Figure 8
Figure 8. Figure 8: Confusion matrices for an increasing number of training epochs, showing nodes with high confidence and good explanations (HC-GE), high confidence and poor explanations (HC-PE), low confidence and good explanations (LC-GE) and low confidence and poor explanations (LC-PE) [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Histograms showing the average dropping probability of each node in a graph computed over all the training epochs for XAI-DROPNODE (left) and DROPNODE (right) respectively. nodes is especially detrimental, most likely because it destabilizes training removing instances that still need to be properly learned. Noisy node selection Criterion Cora Random 80.0±0.5 HighConfidence 80.6±0.4 LowConfidence 74.5±1.5 … view at source ↗
Figure 10
Figure 10. Figure 10: reports results of the different methods in terms of KL-Sufficiency and KL-Necessity, as only a reasonable trade-off between the two is an indicator of a good quality explanation. Results clearly indicate that XAI-DROP scores the best trade-off between the two metrics, thus achieving the best explanations for all datasets. 0.935 0.940 0.945 0.950 0.955 0.960 0.965 KLSufficiency 0.06 0.07 0.08 0.09 0.10 0.… view at source ↗
Figure 11
Figure 11. Figure 11: Examples of explanations generated using a saliency map on a GCN trained on the Cora network. default dropping probabilities into a range of probabilities p defined in the range [0, 1]. We have tested multiple mapping approaches, but two distributions better fill our needs: empirical cumulative distribution and Gaussian distribution (described in Equation 4). In [PITH_FULL_IMAGE:figures/full_fig_p022_11.png] view at source ↗
read the original abstract

Graph Neural Networks (GNNs) have emerged as the predominant paradigm for learning from graph-structured data, offering a wide range of applications from social network analysis to bioinformatics. Despite their versatility, GNNs face challenges such as lack of generalization and poor interpretability, which hinder their wider adoption and reliability in critical applications. Dropping has emerged as an effective paradigm for improving the generalization capabilities of GNNs. However, existing approaches often rely on random or heuristic-based selection criteria, lacking a principled method to identify and exclude nodes that contribute to noise and over-complexity in the model. In this work, we argue that explainability should be a key indicator of a model's quality throughout its training phase. To this end, we introduce xAI-Drop, a novel topological-level dropping regularizer that leverages explainability to pinpoint noisy network elements to be excluded from the GNN propagation mechanism. An empirical evaluation on diverse real-world datasets demonstrates that our method outperforms current state-of-the-art dropping approaches in accuracy, and improves explanation quality.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper proposes xAI-Drop, a topological-level dropping regularizer for Graph Neural Networks that uses explainability to identify and exclude nodes contributing to noise and over-complexity from the propagation mechanism. It claims this yields higher accuracy than state-of-the-art dropping methods and improved explanation quality, validated empirically on diverse real-world datasets.

Significance. If substantiated by detailed methods and results, the integration of explainability as a training-time quality indicator could advance reliable GNNs for applications like social network analysis and bioinformatics by addressing both generalization and interpretability in a principled manner rather than via random or heuristic dropping.

major comments (2)
  1. Abstract: The central empirical claim of outperforming SOTA dropping approaches in accuracy is presented without any equations, experimental protocol, baselines, datasets, metrics, or error analysis, rendering it impossible to assess whether the improvements are supported by evidence.
  2. Abstract: The method's soundness rests on the unexamined assumption that explainability metrics can reliably isolate nodes causing noise/over-complexity such that their exclusion simultaneously boosts accuracy and explanation quality without new biases; no derivation, validation procedure, or independence check for these metrics is supplied.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful comments on the manuscript. We address each major comment below, providing clarifications based on the full paper content while noting where revisions may strengthen the presentation.

read point-by-point responses
  1. Referee: Abstract: The central empirical claim of outperforming SOTA dropping approaches in accuracy is presented without any equations, experimental protocol, baselines, datasets, metrics, or error analysis, rendering it impossible to assess whether the improvements are supported by evidence.

    Authors: We agree that the abstract is intentionally concise and high-level, as is conventional for the venue, and does not embed the full experimental details. The manuscript body supplies the requested elements: the xAI-Drop formulation (including the topological dropping equation based on explainability scores) appears in Section 3; the experimental protocol, baselines (e.g., DropEdge, GraphDrop), datasets (multiple real-world graphs from social and bioinformatics domains), metrics (accuracy plus explanation quality via fidelity and sparsity), and error analysis (means and standard deviations over 10 runs with statistical significance tests) are detailed in Sections 4 and 5 with tables and figures. This structure allows readers to evaluate the evidence. We will revise the abstract to include one additional sentence referencing the key evaluation setting and metrics for improved clarity. revision: partial

  2. Referee: Abstract: The method's soundness rests on the unexamined assumption that explainability metrics can reliably isolate nodes causing noise/over-complexity such that their exclusion simultaneously boosts accuracy and explanation quality without new biases; no derivation, validation procedure, or independence check for these metrics is supplied.

    Authors: The assumption is examined in the manuscript. Section 3 derives the dropping regularizer from the premise that low explainability scores indicate nodes contributing to noise or over-complexity, with the update rule explicitly formulated to exclude them from message passing. Validation occurs via the empirical results in Section 5, which demonstrate joint gains in accuracy and explanation quality (measured by post-hoc fidelity) across datasets, with ablation studies isolating the contribution of the explainability component. On potential new biases, the experiments compare explanation consistency before and after dropping and include controls for over-pruning. An explicit independence check between the explainability metric and the downstream task loss is not separately tabulated; we can add a short discussion or supplementary plot in revision if the referee deems it essential. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The manuscript presents an empirical regularizer (xAI-Drop) whose central claims rest on experimental comparisons across real-world datasets rather than any closed-form derivation, uniqueness theorem, or fitted parameter that is later renamed as a prediction. No equations, self-citations, or ansatzes are invoked in a load-bearing way that reduces the result to its own inputs by construction. The method is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.0 · 5714 in / 892 out tokens · 17426 ms · 2026-05-23T22:57:54.077360+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

67 extracted references · 67 canonical work pages · 4 internal anchors

  1. [1]

    Graph neural networks for social recommendation

    Wenqi Fan, Yao Ma, Qing Li, Yuan He, Eric Zhao, Jiliang Tang, and Dawei Yin. Graph neural networks for social recommendation. In The world wide web conference, pages 417–426, 2019. 1, 7

  2. [2]

    Could graph neural networks learn better molecular representation for drug discovery? a comparison study of descriptor-based and graph-based models

    Dejun Jiang, Zhenxing Wu, Chang-Yu Hsieh, Guangyong Chen, Ben Liao, Zhe Wang, Chao Shen, Dongsheng Cao, Jian Wu, and Tingjun Hou. Could graph neural networks learn better molecular representation for drug discovery? a comparison study of descriptor-based and graph-based models. Journal of Cheminformatics, 13(1):12, Feb 2021. ISSN 1758-2946. doi: 10.1186/s...

  3. [3]

    Graph neural network for traffic forecasting: A survey

    Weiwei Jiang and Jiayun Luo. Graph neural network for traffic forecasting: A survey. Expert Systems with Applications, 207:117921, 2022. 1

  4. [4]

    M. Gori, G. Monfardini, and F. Scarselli. A new model for learning in graph domains. In Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., volume 2, pages 729–734 vol. 2, 2005. doi: 10.1109/IJCNN.2005.1555942. 1

  5. [5]

    Semi-Supervised Classification with Graph Convolutional Networks

    Thomas N Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907, 2016. 7

  6. [6]

    Message passing neural networks

    Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. Message passing neural networks. Machine learning meets quantum physics, pages 199–214, 2020

  7. [7]

    Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and Philip S. Yu. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 32(1):4–24, 2021. 1

  8. [8]

    Neural message passing for quantum chemistry

    Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. Neural message passing for quantum chemistry. In International conference on machine learning , pages 1263–1272. PMLR, 2017. 1

  9. [9]

    Graph convolutional neural networks for web-scale recommender systems

    Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 974–983, 2018

  10. [10]

    Graph neural networks for node classification

    Jian Tang and Renjie Liao. Graph neural networks for node classification. Graph Neural Networks: Foundations, Frontiers, and Applications, pages 41–61, 2022

  11. [11]

    The expressive power of pooling in graph neural networks

    Filippo Maria Bianchi and Veronica Lachi. The expressive power of pooling in graph neural networks. Advances in neural information processing systems, 36, 2024. 1

  12. [12]

    Collaborative representation learning for nodes and relations via heterogeneous graph neural network

    Weimin Li, Lin Ni, Jianjia Wang, and Can Wang. Collaborative representation learning for nodes and relations via heterogeneous graph neural network. Knowledge-Based Systems, 255: 109673, 2022. 1

  13. [13]

    Link prediction based on graph neural networks

    Muhan Zhang and Yixin Chen. Link prediction based on graph neural networks. Advances in neural information processing systems, 31, 2018

  14. [14]

    A simple and expressive graph neural network based method for structural link representation

    Veronica Lachi, Francesco Ferrini, Antonio Longa, Bruno Lepri, and Andrea Passerini. A simple and expressive graph neural network based method for structural link representation. In ICML 2024 Workshop on Geometry-grounded Representation Learning and Generative Modeling, 2024. URL https://openreview.net/forum?id=EGGSCLyVrz. 1

  15. [15]

    Graphrnn: Generat- ing realistic graphs with deep auto-regressive models

    Jiaxuan You, Rex Ying, Xiang Ren, William Hamilton, and Jure Leskovec. Graphrnn: Generat- ing realistic graphs with deep auto-regressive models. In International conference on machine learning, pages 5708–5717. PMLR, 2018. 1 10 xAI-Drop: Don’t Use What You Cannot Explain

  16. [16]

    Digress: Discrete denoising diffusion for graph generation

    Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, V olkan Cevher, and Pas- cal Frossard. Digress: Discrete denoising diffusion for graph generation. arXiv preprint arXiv:2209.14734, 2022

  17. [17]

    A simple latent variable model for graph learning and inference

    Manfred Jaeger, Antonio Longa, Steve Azzolin, Oliver Schulte, and Andrea Passerini. A simple latent variable model for graph learning and inference. In Learning on Graphs Conference, pages 26–1. PMLR, 2024. 1

  18. [18]

    Graph neural networks: Methods, applications, and opportunities

    Lilapati Waikhom and Ripon Patgiri. Graph neural networks: Methods, applications, and opportunities. arXiv preprint arXiv:2108.10733, 2021. 1

  19. [19]

    A review of graph neural networks and their applications in power systems

    Wenlong Liao, Birgitte Bak-Jensen, Jayakrishnan Radhakrishna Pillai, Yuelong Wang, and Yusen Wang. A review of graph neural networks and their applications in power systems. Journal of Modern Power Systems and Clean Energy, 10(2):345–360, 2021

  20. [20]

    Survey of graph neural networks and applications

    Fan Liang, Cheng Qian, Wei Yu, David Griffith, and Nada Golmie. Survey of graph neural networks and applications. Wireless Communications and Mobile Computing, 2022(1):9261537,

  21. [21]

    Generalization and representational limits of graph neural networks

    Vikas Garg, Stefanie Jegelka, and Tommi Jaakkola. Generalization and representational limits of graph neural networks. In International Conference on Machine Learning, pages 3419–3430. PMLR, 2020. 1

  22. [22]

    Explaining the Explainers in Graph Neural Networks: a Comparative Study

    Antonio Longa, Steve Azzolin, Gabriele Santin, Giulia Cencetti, Pietro Liò, Bruno Lepri, and Andrea Passerini. Explaining the explainers in graph neural networks: a comparative study. arXiv preprint arXiv:2210.15304, 2022. 1

  23. [23]

    Shedding light on random dropping and oversmoothing

    Han Xuanyuan, Tianxiang Zhao, and Dongsheng Luo. Shedding light on random dropping and oversmoothing. In NeurIPS 2023 Workshop: New Frontiers in Graph Learning. 1, 2

  24. [24]

    A survey on oversmoothing in graph neural networks

    T Konstantin Rusch, Michael M Bronstein, and Siddhartha Mishra. A survey on oversmoothing in graph neural networks. arXiv preprint arXiv:2303.10993, 2023. 1

  25. [25]

    On the trade-off between over-smoothing and over-squashing in deep graph neural networks

    Jhony H Giraldo, Konstantinos Skianis, Thierry Bouwmans, and Fragkiskos D Malliaros. On the trade-off between over-smoothing and over-squashing in deep graph neural networks. In Proceedings of the 32nd ACM international conference on information and knowledge management, pages 566–576, 2023. 1

  26. [26]

    Deeper insights into graph convolutional networks for semi-supervised learning

    Qimai Li, Zhichao Han, and Xiao-Ming Wu. Deeper insights into graph convolutional networks for semi-supervised learning. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018. 1

  27. [27]

    Dropmes- sage: Unifying random dropping for graph neural networks

    Taoran Fang, Zhiqing Xiao, Chunping Wang, Jiarong Xu, Xuan Yang, and Yang Yang. Dropmes- sage: Unifying random dropping for graph neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 4267–4275, 2023. 1, 2, 6, 7, 14

  28. [28]

    Dropedge: Towards deep graph convolutional networks on node classification

    Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. Dropedge: Towards deep graph convolutional networks on node classification. arXiv preprint arXiv:1907.10903, 2019. 1, 2, 7

  29. [29]

    Dropgnn: Random dropouts increase the expressiveness of graph neural networks

    Pál András Papp, Karolis Martinkus, Lukas Faber, and Roger Wattenhofer. Dropgnn: Random dropouts increase the expressiveness of graph neural networks. Advances in Neural Information Processing Systems, 34:21997–22009, 2021. 1, 2, 7

  30. [30]

    Dropout: a simple way to prevent neural networks from overfitting

    Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, 15(1):1929–1958, 2014. 2

  31. [31]

    Graph random neural networks for semi-supervised learning on graphs

    Wenzheng Feng, Jie Zhang, Yuxiao Dong, Yu Han, Huanbo Luan, Qian Xu, Qiang Yang, Evgeny Kharlamov, and Jie Tang. Graph random neural networks for semi-supervised learning on graphs. Advances in neural information processing systems, 33:22092–22103, 2020. 2, 14

  32. [32]

    Dropagg: Robust graph neural networks via drop aggregation

    Bo Jiang, Yong Chen, Beibei Wang, Haiyun Xu, and Bin Luo. Dropagg: Robust graph neural networks via drop aggregation. Neural Networks, 163:65–74, 2023. 2, 7, 14

  33. [33]

    Fairdrop: Biased edge dropout for enhancing fairness in graph representation learning

    Indro Spinelli, Simone Scardapane, Amir Hussain, and Aurelio Uncini. Fairdrop: Biased edge dropout for enhancing fairness in graph representation learning. IEEE Transactions on Artificial Intelligence, 3(3):344–354, 2021. 2

  34. [34]

    Learning to drop: Robust graph neural network via topological denoising

    Dongsheng Luo, Wei Cheng, Wenchao Yu, Bo Zong, Jingchao Ni, Haifeng Chen, and Xiang Zhang. Learning to drop: Robust graph neural network via topological denoising. InProceedings of the 14th ACM international conference on web search and data mining, pages 779–787, 2021. 2, 7, 14 11 xAI-Drop: Don’t Use What You Cannot Explain

  35. [35]

    Bayesian graph neural networks with adaptive con- nection sampling

    Arman Hasanzadeh, Ehsan Hajiramezanali, Shahin Boluki, Mingyuan Zhou, Nick Duffield, Krishna Narayanan, and Xiaoning Qian. Bayesian graph neural networks with adaptive con- nection sampling. In International conference on machine learning, pages 4094–4104. PMLR,

  36. [36]

    GNNInterpreter: A probabilistic generative model-level explanation for graph neural networks

    Xiaoqi Wang and Han Wei Shen. GNNInterpreter: A probabilistic generative model-level explanation for graph neural networks. In The Eleventh International Conference on Learning Representations, 2023. 3

  37. [37]

    Global explainability of GNNs via logic combination of learned concepts

    Steve Azzolin, Antonio Longa, Pietro Barbiero, Pietro Lio, and Andrea Passerini. Global explainability of GNNs via logic combination of learned concepts. InThe Eleventh International Conference on Learning Representations, 2023

  38. [38]

    Xgnn: Towards model-level explanations of graph neural networks

    Hao Yuan, Jiliang Tang, Xia Hu, and Shuiwang Ji. Xgnn: Towards model-level explanations of graph neural networks. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pages 430–438, 2020. 3

  39. [39]

    Gnnexplainer: Generating explanations for graph neural networks

    Zhitao Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, and Jure Leskovec. Gnnexplainer: Generating explanations for graph neural networks. Advances in neural information processing systems, 32, 2019. 3, 15

  40. [40]

    Pgm-explainer: Probabilistic graphical model explanations for graph neural networks

    Minh Vu and My T Thai. Pgm-explainer: Probabilistic graphical model explanations for graph neural networks. Advances in neural information processing systems, 33:12225–12235, 2020. 3, 15

  41. [41]

    Interpretable and generalizable graph learning via stochastic attention mechanism

    Siqi Miao, Mia Liu, and Pan Li. Interpretable and generalizable graph learning via stochastic attention mechanism. In International Conference on Machine Learning, pages 15524–15543. PMLR, 2022

  42. [42]

    On explainability of graph neural networks via subgraph explorations

    Hao Yuan, Haiyang Yu, Jie Wang, Kang Li, and Shuiwang Ji. On explainability of graph neural networks via subgraph explorations. In International conference on machine learning, pages 12241–12252. PMLR, 2021. 3

  43. [43]

    A survey on explain- ability of graph neural networks

    Jaykumar Kakkad, Jaspal Jannu, Kartik Sharma, Charu Aggarwal, and Sourav Medya. A survey on explainability of graph neural networks. arXiv preprint arXiv:2306.01958, 2023. 3

  44. [44]

    Explainability methods for graph convolutional neural networks

    Phillip E Pope, Soheil Kolouri, Mohammad Rostami, Charles E Martin, and Heiko Hoff- mann. Explainability methods for graph convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10772–10781, 2019. 3

  45. [45]

    Graphlime: Local interpretable model explanations for graph neural networks

    Qiang Huang, Makoto Yamada, Yuan Tian, Dinesh Singh, and Yi Chang. Graphlime: Local interpretable model explanations for graph neural networks. IEEE Transactions on Knowledge and Data Engineering, 2022. 3

  46. [46]

    Axiomatic attribution for deep networks

    Mukund Sundararajan, Ankur Taly, and Qiqi Yan. Axiomatic attribution for deep networks. In International conference on machine learning, pages 3319–3328. PMLR, 2017. 3, 15

  47. [47]

    Parameterized explainer for graph neural network

    Dongsheng Luo, Wei Cheng, Dongkuan Xu, Wenchao Yu, Bo Zong, Haifeng Chen, and Xiang Zhang. Parameterized explainer for graph neural network. Advances in neural information processing systems, 33:19620–19631, 2020. 3, 15

  48. [48]

    Dag matters! gflownets enhanced explainer for graph neural networks

    Wenqian Li, Yinchuan Li, Zhigang Li, HAO Jianye, and Yan Pang. Dag matters! gflownets enhanced explainer for graph neural networks. In The Eleventh International Conference on Learning Representations, 2022. 3

  49. [49]

    Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

    Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034,

  50. [50]

    Beyond explaining: Opportunities and challenges of xai-based model improvement

    Leander Weber, Sebastian Lapuschkin, Alexander Binder, and Wojciech Samek. Beyond explaining: Opportunities and challenges of xai-based model improvement. Information Fusion, 92:154–176, 2023. 3

  51. [51]

    Making deep neural networks right for the right scientific reasons by interacting with their explanations

    Patrick Schramowski, Wolfgang Stammer, Stefano Teso, Anna Brugger, Franziska Herbert, Xiaoting Shao, Hans-Georg Luigs, Anne-Katrin Mahlein, and Kristian Kersting. Making deep neural networks right for the right scientific reasons by interacting with their explanations. Nature Machine Intelligence, 2(8):476–486, 2020. 3 12 xAI-Drop: Don’t Use What You Cann...

  52. [52]

    Pruning by explaining: A novel criterion for deep neural network pruning

    Seul-Ki Yeom, Philipp Seegerer, Sebastian Lapuschkin, Alexander Binder, Simon Wiedemann, Klaus-Robert Müller, and Wojciech Samek. Pruning by explaining: A novel criterion for deep neural network pruning. Pattern Recognition, 115:107899, 2021. 3

  53. [53]

    A meta-learning approach for training explainable graph neural networks

    Indro Spinelli, Simone Scardapane, and Aurelio Uncini. A meta-learning approach for training explainable graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 2022. 3, 7

  54. [54]

    To- wards training gnns using explanation directed message passing

    Valentina Giunchiglia, Chirag Varun Shukla, Guadalupe Gonzalez, and Chirag Agarwal. To- wards training gnns using explanation directed message passing. In Learning on Graphs Conference, pages 28–1. PMLR, 2022. 3, 7, 14

  55. [55]

    Engage: Explanation guided data augmentation for graph representation learning

    Yucheng Shi, Kaixiong Zhou, and Ninghao Liu. Engage: Explanation guided data augmentation for graph representation learning. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 104–121. Springer, 2023. 3, 7, 14

  56. [56]

    The graph neural network model

    Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. The graph neural network model. IEEE Transactions on Neural Networks, 20(1):61–80, 2009. doi: 10.1109/TNN.2008.2005605. 3

  57. [57]

    Neural network for graphs: A contextual constructive approach

    Alessio Micheli. Neural network for graphs: A contextual constructive approach. IEEE Transactions on Neural Networks, 20(3):498–511, 2009

  58. [58]

    Kipf and Max Welling

    Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations, 2017. 3

  59. [59]

    Bagel: A benchmark for assessing graph neural network explanations

    Mandeep Rathee, Thorben Funke, Avishek Anand, and Megha Khosla. Bagel: A benchmark for assessing graph neural network explanations. arXiv preprint arXiv:2206.13983, 2022. 4

  60. [60]

    A new family of power transformations to improve normality or symmetry

    In-Kwon Yeo and Richard A Johnson. A new family of power transformations to improve normality or symmetry. Biometrika, 87(4):954–959, 2000. 5

  61. [61]

    Automating the construction of internet portals with machine learning

    Andrew Kachites McCallum, Kamal Nigam, Jason Rennie, and Kristie Seymore. Automating the construction of internet portals with machine learning. Information Retrieval, 3:127–163,

  62. [62]

    Citeseer: An automatic citation indexing system

    C Lee Giles, Kurt D Bollacker, and Steve Lawrence. Citeseer: An automatic citation indexing system. In Proceedings of the third ACM conference on Digital libraries, pages 89–98, 1998. 7

  63. [63]

    Revisiting semi-supervised learning with graph embeddings

    Zhilin Yang, William Cohen, and Ruslan Salakhudinov. Revisiting semi-supervised learning with graph embeddings. In International conference on machine learning, pages 40–48. PMLR,

  64. [64]

    Graph attention networks

    Petar Veliˇckovi´c, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. Graph attention networks. In International Conference on Learning Representations,

  65. [65]

    How Powerful are Graph Neural Networks?

    Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. How powerful are graph neural networks? arXiv preprint arXiv:1810.00826, 2018. 7

  66. [66]

    Learning and Reasoning with Graph-Structured Representations

    Federico Baldassarre and Hossein Azizpour. Explainability techniques for graph convolu- tional networks. In ICML 2019 Workshop" Learning and Reasoning with Graph-Structured Representations", 2019. 15

  67. [67]

    Microsoft academic graph: When experts are not enough

    Kuansan Wang, Zhihong Shen, Chiyuan Huang, Chieh-Han Wu, Yuxiao Dong, and Anshul Kanakia. Microsoft academic graph: When experts are not enough. Quantitative Science Studies, 1(1):396–413, 2020. 21 A Dataset Statistics Table 5 outlines the key characteristics of the dataset, including the number of classes, nodes, and unidirectional edges in the networks....