pith. sign in

hub Mixed citations

On the Robustness of Interpretability Methods

Mixed citation behavior. Most common role is background (60%).

16 Pith papers citing it
Background 60% of classified citations
abstract

We argue that robustness of explanations---i.e., that similar inputs should give rise to similar explanations---is a key desideratum for interpretability. We introduce metrics to quantify robustness and demonstrate that current methods do not perform well according to these metrics. Finally, we propose ways that robustness can be enforced on existing interpretability approaches.

hub tools

citation-role summary

background 3 method 2

citation-polarity summary

clear filters

representative citing papers

Architecture-Aware Explanation Auditing for Industrial Visual Inspection

cs.LG · 2026-05-14 · unverdicted · novelty 6.0 · 2 refs

The paper proposes an architecture-aware explanation audit protocol demonstrating that perturbation-based faithfulness is bounded by structural compatibility between explainer and model readout rather than architecture family.

Explaining Predictions from Tree-based Boosting Ensembles

cs.LG · 2019-07-04 · unverdicted · novelty 6.0

Develops a method to find minimal input perturbations that flip GBDT predictions by extending random-forest counterfactuals to account for sequential tree dependencies and negative-gradient training.

Interpretability Can Be Actionable

cs.LG · 2026-05-11 · conditional · novelty 6.0

Interpretability research should be judged by actionability—the degree to which its insights support concrete decisions and interventions—rather than explanatory power alone.

GESD: Beyond Outcome-Oriented Fairness

cs.LG · 2026-05-14 · unverdicted · novelty 5.0

The paper proposes GESD, a procedural fairness metric for group disparities in explanation stability and robustness, and integrates it into the FEU multi-objective optimization framework.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Explaining Predictions from Tree-based Boosting Ensembles cs.LG · 2019-07-04 · unverdicted · none · ref 2 · internal anchor

    Develops a method to find minimal input perturbations that flip GBDT predictions by extending random-forest counterfactuals to account for sequential tree dependencies and negative-gradient training.