pith. sign in

arxiv: 2407.11089 · v2 · submitted 2024-07-14 · 💻 cs.LG

Explainable bank failure prediction models: Counterfactual explanations to reduce the failure risk

Pith reviewed 2026-05-23 22:51 UTC · model grok-4.3

classification 💻 cs.LG
keywords bank failure predictioncounterfactual explanationsexplainable AImachine learningimbalanced datacost sensitive learningvalidity proximity sparsityblack box models
0
0 comments X

The pith

Nearest Instance Counterfactual Explanation, especially with cost-sensitive balancing, produces the highest quality suggestions for reducing bank failure risk.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests several methods for generating counterfactual explanations on black-box machine learning models that predict US bank failures. It compares WhatIf, Multi Objective, and Nearest Instance Counterfactual Explanation approaches, combined with resampling techniques including cost-sensitive learning to handle imbalanced data. Results show that Nearest Instance and Multi Objective methods score highest on validity, proximity, and sparsity, with the cost-sensitive approach delivering the most desirable outcomes overall. This turns opaque high-accuracy predictions into concrete advice on which bank features to change to flip a failure prediction. Readers would care because it makes complex models usable for actual risk reduction by bank managers.

Core claim

The Nearest Instance Counterfactual Explanation method yields higher quality counterfactual explanations, mainly using the cost sensitive approach. The Multi Objective Counterfactual and Nearest Instance Counterfactual Explanation methods outperform others regarding validity, proximity, and sparsity metrics, with the cost sensitive approach providing the most desirable counterfactual explanations.

What carries the argument

Counterfactual generation methods (WhatIf, Multi Objective Counterfactual Explanation, Nearest Instance Counterfactual Explanation) evaluated on ML models for bank failure prediction, scored on validity, proximity, sparsity, and plausibility across balancing strategies.

If this is right

  • Bank managers can receive minimal, actionable changes to input features to reverse a failure prediction from black-box models.
  • Cost-sensitive resampling should be preferred when generating counterfactuals for imbalanced failure datasets.
  • Performance of counterfactual methods varies by balancing strategy, so the choice of method must be validated per dataset.
  • Combining these explanations with logistic regression or other interpretable models could further improve regulatory acceptance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the counterfactuals prove plausible in practice, they could be embedded in regulatory early-warning systems to trigger targeted interventions.
  • Extending the evaluation to non-US bank datasets would test whether the superiority of Nearest Instance and cost-sensitive methods holds across different regulatory environments.
  • Integrating these methods into live monitoring dashboards could allow banks to simulate 'what-if' scenarios before making operational changes.

Load-bearing premise

The suggested changes in the counterfactuals are feasible for real banks to implement and the models capture genuine drivers of failure rather than dataset artifacts.

What would settle it

A study that tracks actual banks following the suggested feature changes and measures whether their real failure rates drop would falsify the claim if no reduction occurs.

Figures

Figures reproduced from arXiv: 2407.11089 by Gizem Altun, Mustafa Cavus, Seyma Gunonu.

Figure 1
Figure 1. Figure 1: Mean and standard deviations plot of the counterfactual properties for the methods and resampling strategies [PITH_FULL_IMAGE:figures/full_fig_p014_1.png] view at source ↗
read the original abstract

The accuracy and understandability of bank failure prediction models are crucial. While interpretable models like logistic regression are favored for their explainability, complex models such as random forest, support vector machines, and deep learning offer higher predictive performance but lower explainability. These models, known as black boxes, make it difficult to derive actionable insights. To address this challenge, using counterfactual explanations is suggested. These explanations demonstrate how changes in input variables can alter the model output and suggest ways to mitigate bank failure risk. The key challenge lies in selecting the most effective method for generating useful counterfactuals, which should demonstrate validity, proximity, sparsity, and plausibility. The paper evaluates several counterfactual generation methods: WhatIf, Multi Objective, and Nearest Instance Counterfactual Explanation, and also explores resampling methods like undersampling, oversampling, SMOTE, and the cost sensitive approach to address data imbalance in bank failure prediction in the US. The results indicate that the Nearest Instance Counterfactual Explanation method yields higher quality counterfactual explanations, mainly using the cost sensitive approach. Overall, the Multi Objective Counterfactual and Nearest Instance Counterfactual Explanation methods outperform others regarding validity, proximity, and sparsity metrics, with the cost sensitive approach providing the most desirable counterfactual explanations. These findings highlight the variability in the performance of counterfactual generation methods across different balancing strategies and machine learning models, offering valuable strategies to enhance the utility of black box bank failure prediction models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper evaluates several counterfactual explanation methods (WhatIf, Multi Objective Counterfactual, Nearest Instance Counterfactual Explanation) paired with imbalance-handling strategies (undersampling, oversampling, SMOTE, cost-sensitive) for post-hoc explanation of black-box bank failure prediction models (random forest, SVM, deep learning) trained on US bank data. It claims that the Nearest Instance and Multi Objective methods, especially under the cost-sensitive approach, produce superior counterfactuals according to validity, proximity, and sparsity metrics and thereby offer actionable strategies to mitigate bank failure risk.

Significance. If the comparative results hold after addressing the gaps below, the work supplies concrete, method-level guidance on generating explanations from high-accuracy but opaque models in a high-stakes financial domain where regulatory interpretability requirements are strict. The explicit inclusion of multiple balancing techniques and the focus on an applied prediction task distinguish it from purely methodological counterfactual papers.

major comments (2)
  1. [Abstract] Abstract: The abstract asserts that useful counterfactuals must satisfy four properties—validity, proximity, sparsity, and plausibility—yet only the first three are quantified in the reported experiments. No quantitative or qualitative evaluation of plausibility (feasibility of suggested changes to capital ratios, asset composition, etc., under regulatory or economic constraints) is provided, directly weakening the central claim that the generated explanations can reduce failure risk.
  2. [Results / Experimental Setup] §4 (or equivalent results section): The evaluation omits any comparison against inherently interpretable baselines such as logistic regression or decision trees, even though the introduction notes that these models are already favored for explainability. Without this baseline, it is unclear whether the added complexity of black-box models plus counterfactual generation is necessary or superior to simpler alternatives that require no post-hoc explanation.
minor comments (2)
  1. [Experimental Setup] The manuscript does not report dataset size, number of features, class imbalance ratio, or cross-validation scheme, making it difficult to assess the stability of the validity/proximity/sparsity numbers.
  2. [Results] No statistical significance tests or error bars are shown for the metric comparisons across methods and balancing strategies.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address each major comment below, indicating planned revisions where appropriate to strengthen the work.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The abstract asserts that useful counterfactuals must satisfy four properties—validity, proximity, sparsity, and plausibility—yet only the first three are quantified in the reported experiments. No quantitative or qualitative evaluation of plausibility (feasibility of suggested changes to capital ratios, asset composition, etc., under regulatory or economic constraints) is provided, directly weakening the central claim that the generated explanations can reduce failure risk.

    Authors: We agree that plausibility is a key property for actionable counterfactuals in the bank failure domain and that its omission from the quantitative evaluation represents a limitation. The abstract introduces four properties as desiderata for useful counterfactuals, but the experiments focus on the three that can be directly measured via standard metrics. We will revise the abstract to explicitly note the three evaluated metrics and add a new subsection discussing plausibility qualitatively, drawing on domain knowledge of US bank regulatory constraints (e.g., capital ratio thresholds and asset composition feasibility). This will better support the risk-mitigation claims without overclaiming quantitative results. revision: yes

  2. Referee: [Results / Experimental Setup] §4 (or equivalent results section): The evaluation omits any comparison against inherently interpretable baselines such as logistic regression or decision trees, even though the introduction notes that these models are already favored for explainability. Without this baseline, it is unclear whether the added complexity of black-box models plus counterfactual generation is necessary or superior to simpler alternatives that require no post-hoc explanation.

    Authors: The manuscript centers on post-hoc counterfactual explanations for black-box models that achieve higher predictive performance on imbalanced bank failure data, as noted in the introduction. We acknowledge that including interpretable baselines would help contextualize the value of the black-box + counterfactual approach. We will add a comparison in the results section against logistic regression and decision trees, reporting both predictive metrics (to show performance trade-offs) and inherent interpretability (via feature importance or rules). This will clarify when the added complexity is justified. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical benchmark of off-the-shelf methods

full rationale

The paper conducts a standard empirical comparison of existing counterfactual generation algorithms (WhatIf, Multi Objective, Nearest Instance Counterfactual Explanation) applied to bank-failure classifiers, using public US bank data and standard balancing techniques. Reported results are direct measurements of validity, proximity, and sparsity on held-out instances; no equations, fitted parameters, or self-citations are invoked to derive the performance numbers. The central claims rest on observable differences across methods rather than any reduction of outputs to author-defined inputs or prior self-work. Plausibility is listed as a desideratum but its absence from the evaluation is a completeness issue, not a circularity in the reported metrics.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

Central claim rests on standard ML assumptions plus domain-specific premises about counterfactual quality metrics; no invented entities or heavy free-parameter fitting are visible in the abstract.

free parameters (2)
  • Hyperparameters of counterfactual generators and base ML models
    Choice and tuning of random forest, SVM, deep learning, and the three explanation algorithms are required but not specified.
  • Parameters of resampling methods (SMOTE ratio, cost matrix weights)
    Balancing strategies involve tunable parameters whose values affect the reported quality ordering.
axioms (2)
  • domain assumption Counterfactual explanations can be produced that simultaneously satisfy validity, proximity, sparsity, and plausibility on bank financial data.
    Invoked when the abstract identifies these four properties as the criteria for useful counterfactuals.
  • domain assumption The chosen ML models (RF, SVM, DL) achieve higher predictive performance than logistic regression on the US bank failure task.
    Stated as background motivation for using black-box models.

pith-pipeline@v0.9.0 · 5786 in / 1543 out tokens · 28241 ms · 2026-05-23T22:51:36.733581+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

  • IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear
    ?
    unclear

    Relation between the paper passage and the cited Recognition theorem.

    The results indicate that the Nearest Instance Counterfactual Explanation method yields higher quality counterfactual explanations, mainly using the cost sensitive approach... validity, proximity, and sparsity metrics, with the cost sensitive approach providing the most desirable counterfactual explanations.

  • IndisputableMonolith/Foundation/BranchSelection.lean branch_selection unclear
    ?
    unclear

    Relation between the paper passage and the cited Recognition theorem.

    Plausibility refers to the extent to which a CE is logical and realistic... Plausibility assesses the logical coherence and realism of a CE x′.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

54 extracted references · 54 canonical work pages · 2 internal anchors

  1. [1]

    & Vir´ ag, M

    Krist´ of, T. & Vir´ ag, M. EU-27 bank failure prediction with C5. 0 decision trees and deep learning neural networks. Research In International Business And Finance . 61 pp. 101644 (2022)

  2. [2]

    & Viviani, J

    Le, H. & Viviani, J. Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios. Research In International Business And Finance. 44 pp. 16-25 (2018)

  3. [3]

    & Agrapetidou, A

    Gogas, P., Papadimitriou, T. & Agrapetidou, A. Forecasting bank failures and stress testing: A machine learning approach. International Journal Of Forecasting. 34, 440-455 (2018)

  4. [4]

    & Momparler, A

    Carmona, P., Climent, F. & Momparler, A. Predicting failure in the US bank- ing sector: An extreme gradient boosting approach. International Review Of Eco- nomics And Finance. 61 pp. 304-323 (2019)

  5. [5]

    & Vlachogiannakis, N

    Petropoulos, A., Siakoulis, V., Stavroulakis, E. & Vlachogiannakis, N. Predict- ing bank insolvencies using machine learning techniques. International Journal Of Forecasting. 36, 1092-1113 (2020)

  6. [6]

    & Baourakis, G

    Manthoulis, G., Doumpos, M., Zopounidis, C., Galariotis, E. & Baourakis, G. Bank failure prediction: a comparison of machine learning approaches. Financial Risk Management And Modeling . pp. 349-366 (2021)

  7. [7]

    Others Explainable Artifi- cial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

    Arrieta, A., Diaz-Rodriguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., Garcia, S., Gil-Lopez, S., Molina, D., Benjamins, R. Others Explainable Artifi- cial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion. 58 pp. 82-115 (2020) 18 Gunonu et al

  8. [8]

    & M¨ uller, K

    Samek, W. & M¨ uller, K. Towards explainable artificial intelligence. Explainable AI: Interpreting, Explaining And Visualizing Deep Learning . pp. 5-22 (2019)

  9. [9]

    & Climent, F

    Momparler, A., Carmona, P. & Climent, F. Banking failure prediction: a boosting classification tree approach. Spanish Journal Of Finance And Accounting/Revista Espa˜ nola De Financiaci´ on Y Contabilidad. 45, 63-91 (2016)

  10. [10]

    & Hwang, H

    Park, M., Son, H., Hyun, C. & Hwang, H. Explainability of machine learning models for bankruptcy prediction. Ieee Access. 9 pp. 124887-124899 (2021)

  11. [11]

    & Chen, X

    Zhang, Z., Wu, C., Qu, S. & Chen, X. An explainable artificial intelligence ap- proach for financial distress prediction. Information Processing And Management. 59, 102988 (2022)

  12. [12]

    Liu, J., Li, C., Ouyang, P., Liu, J. & Wu, C. Interpreting the prediction results of the tree-based gradient boosting models for financial distress prediction with an explainable machine learning approach. Journal Of Forecasting. 42, 1112-1137 (2023)

  13. [13]

    & Valera, I

    Karimi, A., Barthe, G., Balle, B. & Valera, I. Model-agnostic counterfactual ex- planations for consequential decisions. International Conference On Artificial In- telligence And Statistics . pp. 895-905 (2020)

  14. [14]

    & Tan, C

    Mothilal, R., Sharma, A. & Tan, C. Explaining machine learning classifiers through diverse counterfactual explanations. Proceedings Of The 2020 Conference On Fair- ness, Accountability, And Transparency. pp. 607-617 (2020)

  15. [15]

    & Keserci, B

    Tanyel, T., Ayvaz, S. & Keserci, B. Beyond known reality: Exploiting counterfac- tual explanations for medical research. ArXiv Preprint arXiv:2307.02131 . (2023)

  16. [16]

    & Ragos, O

    Tsiakmaki, M. & Ragos, O. A Case Study of Interpretable Counterfactual Ex- planations for the Task of Predicting Student Academic Performance. 2021 25th International Conference On Circuits, Systems, Communications And Computers (CSCC). pp. 120-125 (2021)

  17. [17]

    & Bai, J

    Zhang, H., Dong, J., Lv, C., Lin, Y. & Bai, J. Visual analytics of potential dropout behavior patterns in online learning based on the counterfactual explanation.Jour- nal Of Visualization . 26, 723-741 (2023)

  18. [18]

    Interpretable Credit Application Predictions With Counterfactual Explanations

    Grath, R., Costabello, L., Van, C., Sweeney, P., Kamiab, F., Shen, Z. & Lecue, F. Interpretable credit application predictions with counterfactual explanations. ArXiv Preprint arXiv:1811.05245 . (2018)

  19. [19]

    & Vandierendonck, H

    Dastile, X., Celik, T. & Vandierendonck, H. Model-agnostic counterfactual expla- nations in credit scoring. IEEE Access. 10 pp. 69543-69554 (2022)

  20. [20]

    & Wen, B

    Wang, D., Chen, Z., Florescu, I. & Wen, B. A sparsity algorithm for finding optimal counterfactual explanations: Application to corporate credit rating. Research In International Business And Finance . 64 pp. 101869 (2023)

  21. [21]

    & Shin, K

    Cho, S. & Shin, K. Feature-weighted counterfactual-based explanation for bankruptcy prediction. Expert Systems With Applications . 216 pp. 119390 (2023)

  22. [22]

    An evaluation study on counterfactual expla- nations of bank failure prediction models

    Gunonu, S., Altun, G., & Cavus, M. An evaluation study on counterfactual expla- nations of bank failure prediction models. Proceedings of the V. International Ap- plied Statistics Congress, Istanbul, Turkiye, May 21-23, 2024. pp. 383–388 (2024)

  23. [23]

    & Wilson, J

    Wexler, J., Pushkarna, M., Bolukbasi, T., Wattenberg, M., Vi´ egas, F. & Wilson, J. The what-if tool: Interactive probing of machine learning models. IEEE Trans- actions On Visualization And Computer Graphics . 26, 56-65 (2019)

  24. [24]

    & Martens, D

    Brughmans, D., Leyman, P. & Martens, D. Nice: an algorithm for nearest in- stance counterfactual explanations. Data Mining And Knowledge Discovery . pp. 1-39 (2023)

  25. [25]

    & Bischl, B

    Dandl, S., Molnar, C., Binder, M. & Bischl, B. Multi-objective counterfactual explanations. International Conference On Parallel Problem Solving From Nature . pp. 448-469 (2020) Explainable bank failure prediction models 19

  26. [26]

    Counterfactuals in explainable artificial intelligence (XAI): Evidence from human reasoning

    Byrne, R. Counterfactuals in explainable artificial intelligence (XAI): Evidence from human reasoning.. IJCAI. pp. 6276-6282 (2019)

  27. [27]

    & Hammer, B

    Artelt, A. & Hammer, B. On the computation of counterfactual explanations–A survey. ArXiv Preprint arXiv:1911.07749 . (2019)

  28. [28]

    & Smyth, B

    Keane, M. & Smyth, B. Good counterfactuals and where to find them: A case-based technique for generating counterfactuals for explainable AI (XAI).Case-Based Rea- soning Research And Development: 28th International Conference, ICCBR 2020, Salamanca, Spain, June 8–12, 2020, Proceedings 28 . pp. 163-178 (2020)

  29. [29]

    & Keane, M

    Smyth, B. & Keane, M. A few good counterfactuals: generating interpretable, plausible and diverse counterfactual explanations. International Conference On Case-Based Reasoning. pp. 18-32 (2022)

  30. [30]

    & Ham- mer, B

    Artelt, A., Vaquet, V., Velioglu, R., Hinder, F., Brinkrolf, J., Schilling, M. & Ham- mer, B. Evaluating the robustness of counterfactual explanations.2021 IEEE Sym- posium Series On Computational Intelligence (SSCI) . pp. 01-09 (2021)

  31. [31]

    & Casalicchio, G

    Dandl, S., Hofheinz, A., Binder, M., Bischl, B. & Casalicchio, G. Counterfac- tuals: an R package for counterfactual explanation methods. ArXiv Preprint arXiv:2304.06569. (2023)

  32. [32]

    & Assent, I

    Hvilshøj, F., Iosifidis, A. & Assent, I. On quantitative evaluations of counterfactu- als. ArXiv Preprint arXiv:2111.00177 . (2021)

  33. [33]

    & Zhang, Y

    Ge, Y., Liu, S., Li, Z., Xu, S., Geng, S., Li, Y., Tan, J., Sun, F. & Zhang, Y. Counterfactual evaluation for explainable AI. ArXiv Preprint arXiv:2109.01962 . (2021)

  34. [34]

    & S´ everin, E

    Du Jardin, P. & S´ everin, E. Predicting corporate bankruptcy using a self-organizing map: An empirical study to improve the forecasting horizon of a financial failure model. Decision Support Systems. 51, 701-711 (2011)

  35. [35]

    & Galariotis, E

    Manthoulis, G., Doumpos, M., Zopounidis, C. & Galariotis, E. An ordinal classifi- cation framework for bank failure prediction: Methodology and empirical evidence for US banks. European Journal Of Operational Research. 282, 786-801 (2020)

  36. [36]

    & Ortega, J

    Audrino, F., Kostrov, A. & Ortega, J. Predicting US bank failures with MI- DAS logit models. Journal Of Financial And Quantitative Analysis . 54, 2575-2603 (2019)

  37. [37]

    Bank failure prediction models addressing imbalanced data and out-of-time performance

    Gunonu, S., Altun, G., & Cavus, M. Bank failure prediction models addressing imbalanced data and out-of-time performance. Proceedings of the 7th International Researchers, Statisticians, and Young Statisticians Congress, Istanbul, Turkiye, November 2-5, 2023. pp. 185–199 (2023)

  38. [38]

    & Kegelmeyer, W

    Chawla, N., Bowyer, K., Hall, L. & Kegelmeyer, W. SMOTE: synthetic minority over-sampling technique. Journal Of Artificial Intelligence Research . 16 pp. 321- 357 (2002)

  39. [39]

    & Van Calster, B

    Goorbergh, R., Smeden, M., Timmerman, D. & Van Calster, B. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. Journal Of The American Medical Informatics Association . 29, 1525-1534 (2022)

  40. [40]

    & Pisani, P

    Junior, J. & Pisani, P. Performance and model complexity on imbalanced datasets using resampling and cost-sensitive algorithms. Fourth International Workshop On Learning With Imbalanced Domains: Theory And Applications . pp. 83-97 (2022)

  41. [41]

    & Biecek, P

    Stando, A., Cavus, M. & Biecek, P. The effect of balancing methods on model behavior in imbalanced classification problems. Fifth International Workshop On Learning With Imbalanced Domains: Theory And Applications . pp. 16-30 (2024)

  42. [42]

    An Experimental Study on the Rashomon Effect of Balancing Methods in Imbalanced Classification

    Cavus, M. & Biecek, P. An Experimental Study on the Rashomon Effect of Bal- ancing Methods in Imbalanced Classification. ArXiv Preprint arXiv:2405.01557 . (2024) 20 Gunonu et al

  43. [43]

    G., Calster, B., van Smeden, M.: Harms of class imbalance corrections for ML prediction models: A simulation study

    Carriero, A., Luijken, K., Hond, A., Moons, K., Calster, B. & Smeden, M. The harms of class imbalance corrections for machine learning based prediction models: a simulation study. ArXiv Preprint arXiv:2404.19494 . (2024)

  44. [44]

    & Averbuch-Elor, H

    Elor, Y. & Averbuch-Elor, H. To SMOTE, or not to SMOTE? ArXiv Preprint arXiv:2201.08528. (2022)

  45. [45]

    & Varoquaux, G

    Grinsztajn, L., Oyallon, E. & Varoquaux, G. Why do tree-based models still out- perform deep learning on typical tabular data? Advances In Neural Information Processing Systems. 35 pp. 507-520 (2022)

  46. [46]

    & Stone, C

    Breiman, L., Friedman, J., Olshen, R. & Stone, C. Classification and Regression Trees. (Wadsworth,1984)

  47. [47]

    Random forests

    Breiman, L. Random forests. Machine Learning. 45 pp. 5-32 (2001)

  48. [48]

    & Wehenkel, L

    Geurts, P., Ernst, D. & Wehenkel, L. Extremely randomized trees. Machine Learn- ing. 63 pp. 3-42 (2006)

  49. [49]

    Counterfactual explanations and how to find them: literature review and benchmarking

    Guidotti, R. Counterfactual explanations and how to find them: literature review and benchmarking. Data Mining And Knowledge Discovery . pp. 1-55 (2022)

  50. [50]

    & Hines, K

    Verma, S., Dickerson, J. & Hines, K. Counterfactual explanations for machine learning: A review. ArXiv Preprint arXiv:2010.10596 . 2 pp. 1 (2020)

  51. [51]

    Warren, M

    Warren, G., Keane, M., Gueret, C. & Delaney, E. Explaining groups of in- stances counterfactually for XAI: a use case, algorithm and user study for group- counterfactuals. ArXiv Preprint arXiv:2303.09297 . (2023)

  52. [52]

    & Russell, C

    Wachter, S., Mittelstadt, B. & Russell, C. Counterfactual explanations without opening the black box: Automated decisions and the GDPR. Harv. JL And Tech.. 31 pp. 841 (2017)

  53. [53]

    & Pillmore, B

    Dar, U. & Pillmore, B. fdicdata: Accessing FDIC Bank Data. (2023), https://CRAN.R-project.org/package=fdicdata, R package version 0.1.0

  54. [54]

    & Naryanaswamy, B

    Lipton, Z., Elkan, C. & Naryanaswamy, B. Optimal thresholding of classifiers to maximize F1 measure. Machine Learning And Knowledge Discovery In Databases: European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part II 14. pp. 225-239 (2014)