RISE: Randomized Input Sampling for Explanation of Black-box Models

Abir Das; Kate Saenko; Vitali Petsiuk

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1806.07421 v3 pith:OTIDWWPG submitted 2018-06-19 cs.CV

RISE: Randomized Input Sampling for Explanation of Black-box Models

Vitali Petsiuk , Abir Das , Kate Saenko This is my paper

classification cs.CV

keywords importanceriseapproachinputapproachesblack-boxdeepmethods

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

Deep neural networks are being used increasingly to automate data analysis and decision making, yet their decision-making process is largely unclear and is difficult to explain to the end users. In this paper, we address the problem of Explainable AI for deep neural networks that take images as input and output a class probability. We propose an approach called RISE that generates an importance map indicating how salient each pixel is for the model's prediction. In contrast to white-box approaches that estimate pixel importance using gradients or other internal network state, RISE works on black-box models. It estimates importance empirically by probing the model with randomly masked versions of the input image and obtaining the corresponding outputs. We compare our approach to state-of-the-art importance extraction methods using both an automatic deletion/insertion metric and a pointing metric based on human-annotated object segments. Extensive experiments on several benchmark datasets show that our approach matches or exceeds the performance of other methods, including white-box approaches. Project page: http://cs-people.bu.edu/vpetsiuk/rise/

discussion (0)

Forward citations

Cited by 39 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability
cs.AI 2026-05 unverdicted novelty 7.0

Introduces Synergistic Faithfulness metric based on Shapley Interaction Index to evaluate cross-modal synergy in VLM explainers, revealing over-reliance on visual salience in existing methods.
How Do Document Parsers Break? Auditing Structural Vulnerability in Document Intelligence
cs.CL 2026-05 unverdicted novelty 7.0

ProSA auditing framework shows that structural loss metrics track OCR instability and downstream QA degradation far better than area-based footprint measures across two document parsers on 1,000 pages.
How Do Document Parsers Break? Auditing Structural Vulnerability in Document Intelligence
cs.CL 2026-05 conditional novelty 7.0

A new output-level auditing framework with B-SLR and exposure descriptors shows that structure-targeted perturbations better predict OCR instability and downstream degradation than footprint size in document parsers.
How to Evaluate and Refine your CAM
cs.CV 2026-05 unverdicted novelty 7.0

Introduces synthetic ground-truth dataset for CAM evaluation, proposes ARCC composite metric, and RefineCAM method that aggregates layers for higher-resolution maps outperforming baselines.
Architecture-Aware Explanation Auditing for Industrial Visual Inspection
cs.LG 2026-05 unverdicted novelty 7.0

An audit protocol on wafer maps finds that ViT-Tiny with Attention Rollout achieves better deletion faithfulness than other models and explainers, with readout structure as the key factor and RISE outperforming native...
Architecture-Aware Explanation Auditing for Industrial Visual Inspection
cs.LG 2026-05 conditional novelty 7.0

Explanation faithfulness for deep classifiers on wafer maps is highest when the explainer matches the model's native readout structure, with ViT-Tiny plus Attention Rollout achieving lower Deletion AUC than mismatched...
Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
cs.RO 2026-05 unverdicted novelty 7.0

Introduces ISS and NMR as interventional metrics to diagnose causal misalignment in VLA policies and link it to generalization performance.
Adjoint Inversion Reveals Holographic Superposition and Destructive Interference in CNN Classifiers
cs.CV 2026-04 unverdicted novelty 7.0

CNN classifiers work by holographic superposition and destructive interference in pixel space rather than selecting cleaned features, as proven by a new adjoint inversion framework that also yields a covariance-volume...
From Baselines to Transport Geodesics: Axiomatic Attribution via Optimal Generative Flows
cs.LG 2026-03 unverdicted novelty 7.0

Transport-geodesic attribution via optimal generative flows selects principled paths for feature attributions by minimizing kinetic action.
TreeGrad-Ranker: Feature Ranking via $O(L)$-Time Gradients for Decision Trees
cs.LG 2026-02 accept novelty 7.0

TreeGrad-Ranker produces feature rankings for decision trees by optimizing a joint insertion-deletion objective with O(L)-time gradients derived from the multilinear extension, outperforming probabilistic values like ...
CPG-PAD: Concept-Informed Prompts Guided Presentation Attack Detection
cs.CV 2026-07 unverdicted novelty 6.0

CPG-PAD uses XAI-derived visual concepts to guide prompt learning in VLMs, enabling better cross-domain generalization for presentation attack detection on nine benchmarks.
Validating Causal Abstraction Metrics on Simulated Complex Systems
cs.LG 2026-06 unverdicted novelty 6.0

Authors create a benchmark across discrete/continuous and static/dynamical systems and introduce the Causal Abstraction Error (CAE) metric that reliably distinguishes valid from invalid causal abstractions when it inc...
FedLAB: Traceable Semantic Codebooks for Federated Multimodal Graph Foundation Learning
cs.LG 2026-06 unverdicted novelty 6.0

FedLAB organizes multimodal graph knowledge into typed hierarchical codebooks for modality evidence, node semantics, and topology context via federated semantic barycenter pre-training, improving performance by up to ...
GRAPE: Graph-Augmented Prototype Explanations for Interactive Medical Image Diagnosis
cs.CV 2026-06 unverdicted novelty 6.0

GRAPE augments prototype medical image classifiers with graph attention for co-occurrence, a mismatch safety check, and open-vocabulary anchoring to support incremental findings without retraining.
GRAPE: Graph-Augmented Prototype Explanations for Interactive Medical Image Diagnosis
cs.CV 2026-06 unverdicted novelty 6.0

GRAPE augments prototype medical image classifiers with graph attention for co-occurrence, a mismatch safety check, and open-vocabulary anchoring to support incremental addition of findings from single examples.
Partition-Guided Distance Saliency: Bridging Decision and Objective Spaces in Many-Objective Optimization
cs.LG 2026-06 unverdicted novelty 6.0

PGDS is a new explainable AI method for many-objective optimization that automates target selection via partitioning and identifies influential decision variables through distance-based sensitivity analysis.
XtrAIn: Training-Guided Occlusion for Feature Attribution
cs.LG 2026-06 unverdicted novelty 6.0

XtrAIn shifts occlusion from input space to parameter space along the training trajectory to produce cleaner feature attributions than standard methods.
Landseer: Exploring the Machine Learning Defense Landscape
cs.CR 2026-05 unverdicted novelty 6.0

Landseer offers a containerized modular system to integrate and evaluate combinations of machine learning defenses, with an initial analysis of 35 defenses highlighting replicability challenges.
Bridging the Disciplinary Gap in Explainable AI: From Abstract Desiderata to Concrete Tasks
cs.CY 2026-05 unverdicted novelty 6.0

The authors introduce a taxonomy with target, functional role, and mode of justification axes plus a framework that decomposes abstract XAI desiderata into concrete benchmarkable tasks via identified dependency structures.
OCCAM: Open-set Causal Concept explAnation and Ontology induction for black-box vision Models
cs.AI 2026-05 unverdicted novelty 6.0

OCCAM discovers open-set visual concepts, estimates causal contributions via object-level interventions on black-box vision models, and induces a global concept ontology from aggregated dataset evidence.
From Weight Perturbation to Feature Attribution for Explaining Fully Connected Neural Networks
cs.LG 2026-05 unverdicted novelty 6.0

XWP and XWP_c are novel attribution methods for FCNNs that estimate feature importance by perturbing attached weights to avoid added bias and out-of-distribution issues in occlusion approaches.
Architecture-Aware Explanation Auditing for Industrial Visual Inspection
cs.LG 2026-05 unverdicted novelty 6.0

The paper proposes an architecture-aware explanation audit protocol demonstrating that perturbation-based faithfulness is bounded by structural compatibility between explainer and model readout rather than architectur...
Evaluation Cards for XAI Metrics
cs.CV 2026-05 unverdicted novelty 6.0

The authors introduce the XAI Evaluation Card template to standardize how XAI evaluation metrics are defined, validated, and reported.
Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
cs.RO 2026-05 unverdicted novelty 6.0

Interventional attribution via ISS and NMR diagnoses causal misalignment in VLA policies and predicts their generalization performance across manipulation tasks.
DRAGON: A Benchmark for Evidence-Grounded Visual Reasoning over Diagrams
cs.CV 2026-04 unverdicted novelty 6.0

DRAGON is a new benchmark with 11,664 annotated instances from six diagram QA datasets that requires models to localize visual evidence regions supporting their answers.
Explainable AI in Speaker Recognition -- Making Latent Representations Understandable
eess.AS 2026-04 unverdicted novelty 6.0

Speaker recognition networks form hierarchical clusters in latent space that can be matched to semantic classes using new HCCM algorithm and quantified by Liebig's score.
H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers
cs.CV 2026-04 unverdicted novelty 6.0

H-Sets detects higher-order feature interactions in image classifiers via Hessian-guided pair merging and attributes them with IDG-Vis to generate more interpretable saliency maps than existing marginal or coarse methods.
PhiNet: Speaker Verification with Phonetic Interpretability
eess.AS 2026-04 unverdicted novelty 6.0

PhiNet adds phonetic interpretability to speaker verification while matching the accuracy of standard black-box models on VoxCeleb, SITW, and LibriSpeech.
AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
cs.CL 2025-08 unverdicted novelty 6.0

AttnTrace is an attention-weight-based context traceback method for LLMs that claims higher accuracy and efficiency than prior art like TracLLM while aiding prompt injection detection.
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
cs.CV 2025-02 conditional novelty 6.0

Grad-ECLIP produces gradient-based visual and textual explanation heatmaps for CLIP by applying channel and spatial weights to token features instead of relying on sparse self-attention maps.
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
cs.LG 2023-10 conditional novelty 6.0

SalUn uses gradient-based weight saliency to achieve effective machine unlearning of data, classes, or concepts in image classification and generation, narrowing the gap to exact retraining.
Few-class Fidelity: Evaluating Explanations of Real-conditions CNN classifiers with Optimized Perturbations
cs.CV 2026-06 unverdicted novelty 5.0

Introduces a perturbation-based fidelity metric tailored to few-class CNN classifiers for real-conditions XAI evaluation, tested on medical and natural imaging against human-centric metrics.
Explainable AI in Speaker Recognition -- Attention Map Visualisation and Evaluation
eess.AS 2026-06 unverdicted novelty 5.0

The paper introduces Modified RISE-eval to evaluate GradCAM and LayerCAM attention maps on speaker recognition networks and reports distinct advantages for each method under different conditions.
EIVE: End-to-End Instance-Specific Visual Explanations for Detection Transformers
cs.CV 2026-06 unverdicted novelty 5.0

EIVE reformulates decoder cross-attention in Detection Transformers to produce instance-specific saliency maps via cross-layer fusion and attention-aware training, matching post-hoc methods in quality while improving speed.
Learning Quantifiable Visual Explanations Without Ground-Truth
cs.AI 2026-05 unverdicted novelty 5.0

A perturbation-based metric for XAI quality that formalizes sufficiency and necessity, paired with an adapter trained via differentiable supervision to generate causal explanations on black-box models.
Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation
cs.CV 2026-04 unverdicted novelty 5.0

An optimized KernelSHAP method for 3D medical image segmentation restricts computation to ROI and receptive fields, uses patch logit caching for 15-30% savings, and compares organ units versus supervoxels for clinical...
Explainable AI in Speaker Recognition -- Making Latent Representations Understandable
eess.AS 2026-04 conditional novelty 4.0

Hierarchical clustering of speaker recognition embeddings reveals that the network organizes representations by gender at upper levels and by gender-nationality conjunctions at lower levels, interpreted via a proposed...
FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision
cs.CV 2023-12 unverdicted novelty 4.0

FM-G-CAM extends Grad-CAM by fusing explanations across multiple top classes for holistic CNN prediction understanding and ships an open-source library.
Position: Genomic Model Research Must Move Beyond Anecdotal Evaluation of Interpretability Methods
cs.LG 2026-05 unverdicted novelty 3.0

Benchmarking on transcription factor binding shows that different interpretability methods often contradict each other, miss known motifs, and fail to match model behavior, so the field needs a systematic tiered valid...