pith. sign in

arxiv: 1806.07421 · v3 · pith:OTIDWWPGnew · submitted 2018-06-19 · 💻 cs.CV

RISE: Randomized Input Sampling for Explanation of Black-box Models

classification 💻 cs.CV
keywords importanceriseapproachinputapproachesblack-boxdeepmethods
0
0 comments X
read the original abstract

Deep neural networks are being used increasingly to automate data analysis and decision making, yet their decision-making process is largely unclear and is difficult to explain to the end users. In this paper, we address the problem of Explainable AI for deep neural networks that take images as input and output a class probability. We propose an approach called RISE that generates an importance map indicating how salient each pixel is for the model's prediction. In contrast to white-box approaches that estimate pixel importance using gradients or other internal network state, RISE works on black-box models. It estimates importance empirically by probing the model with randomly masked versions of the input image and obtaining the corresponding outputs. We compare our approach to state-of-the-art importance extraction methods using both an automatic deletion/insertion metric and a pointing metric based on human-annotated object segments. Extensive experiments on several benchmark datasets show that our approach matches or exceeds the performance of other methods, including white-box approaches. Project page: http://cs-people.bu.edu/vpetsiuk/rise/

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 22 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability

    cs.AI 2026-05 unverdicted novelty 7.0

    Introduces Synergistic Faithfulness metric based on Shapley Interaction Index to evaluate cross-modal synergy in VLM explainers, revealing over-reliance on visual salience in existing methods.

  2. How to Evaluate and Refine your CAM

    cs.CV 2026-05 unverdicted novelty 7.0

    Introduces synthetic ground-truth dataset for CAM evaluation, proposes ARCC composite metric, and RefineCAM method that aggregates layers for higher-resolution maps outperforming baselines.

  3. Architecture-Aware Explanation Auditing for Industrial Visual Inspection

    cs.LG 2026-05 conditional novelty 7.0

    Explanation faithfulness for deep classifiers on wafer maps is highest when the explainer matches the model's native readout structure, with ViT-Tiny plus Attention Rollout achieving lower Deletion AUC than mismatched...

  4. Architecture-Aware Explanation Auditing for Industrial Visual Inspection

    cs.LG 2026-05 unverdicted novelty 7.0

    An audit protocol on wafer maps finds that ViT-Tiny with Attention Rollout achieves better deletion faithfulness than other models and explainers, with readout structure as the key factor and RISE outperforming native...

  5. Adjoint Inversion Reveals Holographic Superposition and Destructive Interference in CNN Classifiers

    cs.CV 2026-04 unverdicted novelty 7.0

    CNN classifiers work by holographic superposition and destructive interference in pixel space rather than selecting cleaned features, as proven by a new adjoint inversion framework that also yields a covariance-volume...

  6. From Baselines to Transport Geodesics: Axiomatic Attribution via Optimal Generative Flows

    cs.LG 2026-03 unverdicted novelty 7.0

    Transport-geodesic attribution via optimal generative flows selects principled paths for feature attributions by minimizing kinetic action.

  7. TreeGrad-Ranker: Feature Ranking via $O(L)$-Time Gradients for Decision Trees

    cs.LG 2026-02 accept novelty 7.0

    TreeGrad-Ranker produces feature rankings for decision trees by optimizing a joint insertion-deletion objective with O(L)-time gradients derived from the multilinear extension, outperforming probabilistic values like ...

  8. Bridging the Disciplinary Gap in Explainable AI: From Abstract Desiderata to Concrete Tasks

    cs.CY 2026-05 unverdicted novelty 6.0

    The authors introduce a taxonomy with target, functional role, and mode of justification axes plus a framework that decomposes abstract XAI desiderata into concrete benchmarkable tasks via identified dependency structures.

  9. OCCAM: Open-set Causal Concept explAnation and Ontology induction for black-box vision Models

    cs.AI 2026-05 unverdicted novelty 6.0

    OCCAM discovers open-set visual concepts, estimates causal contributions via object-level interventions on black-box vision models, and induces a global concept ontology from aggregated dataset evidence.

  10. From Weight Perturbation to Feature Attribution for Explaining Fully Connected Neural Networks

    cs.LG 2026-05 unverdicted novelty 6.0

    XWP and XWP_c are novel attribution methods for FCNNs that estimate feature importance by perturbing attached weights to avoid added bias and out-of-distribution issues in occlusion approaches.

  11. Evaluation Cards for XAI Metrics

    cs.CV 2026-05 unverdicted novelty 6.0

    The authors introduce the XAI Evaluation Card template to standardize how XAI evaluation metrics are defined, validated, and reported.

  12. Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models

    cs.RO 2026-05 unverdicted novelty 6.0

    Interventional attribution via ISS and NMR diagnoses causal misalignment in VLA policies and predicts their generalization performance across manipulation tasks.

  13. DRAGON: A Benchmark for Evidence-Grounded Visual Reasoning over Diagrams

    cs.CV 2026-04 unverdicted novelty 6.0

    DRAGON is a new benchmark with 11,664 annotated instances from six diagram QA datasets that requires models to localize visual evidence regions supporting their answers.

  14. Explainable AI in Speaker Recognition -- Making Latent Representations Understandable

    eess.AS 2026-04 unverdicted novelty 6.0

    Speaker recognition networks form hierarchical clusters in latent space that can be matched to semantic classes using new HCCM algorithm and quantified by Liebig's score.

  15. H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers

    cs.CV 2026-04 unverdicted novelty 6.0

    H-Sets detects higher-order feature interactions in image classifiers via Hessian-guided pair merging and attributes them with IDG-Vis to generate more interpretable saliency maps than existing marginal or coarse methods.

  16. PhiNet: Speaker Verification with Phonetic Interpretability

    eess.AS 2026-04 unverdicted novelty 6.0

    PhiNet adds phonetic interpretability to speaker verification while matching the accuracy of standard black-box models on VoxCeleb, SITW, and LibriSpeech.

  17. AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption

    cs.CL 2025-08 unverdicted novelty 6.0

    AttnTrace is an attention-weight-based context traceback method for LLMs that claims higher accuracy and efficiency than prior art like TracLLM while aiding prompt injection detection.

  18. Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP

    cs.CV 2025-02 conditional novelty 6.0

    Grad-ECLIP produces gradient-based visual and textual explanation heatmaps for CLIP by applying channel and spatial weights to token features instead of relying on sparse self-attention maps.

  19. SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

    cs.LG 2023-10 conditional novelty 6.0

    SalUn uses gradient-based weight saliency to achieve effective machine unlearning of data, classes, or concepts in image classification and generation, narrowing the gap to exact retraining.

  20. Learning Quantifiable Visual Explanations Without Ground-Truth

    cs.AI 2026-05 unverdicted novelty 5.0

    A perturbation-based metric for XAI quality that formalizes sufficiency and necessity, paired with an adapter trained via differentiable supervision to generate causal explanations on black-box models.

  21. Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation

    cs.CV 2026-04 unverdicted novelty 5.0

    An optimized KernelSHAP method for 3D medical image segmentation restricts computation to ROI and receptive fields, uses patch logit caching for 15-30% savings, and compares organ units versus supervoxels for clinical...

  22. FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision

    cs.CV 2023-12 unverdicted novelty 4.0

    FM-G-CAM extends Grad-CAM by fusing explanations across multiple top classes for holistic CNN prediction understanding and ships an open-source library.