pith. sign in

arxiv: 1711.06104 · v4 · pith:YDKCANLJnew · submitted 2017-11-16 · 💻 cs.LG · stat.ML

Towards better understanding of gradient-based attribution methods for Deep Neural Networks

classification 💻 cs.LG stat.ML
keywords methodsattributionbeengradient-basedcomparisondeepnetworknetworks
0
0 comments X
read the original abstract

Understanding the flow of information in Deep Neural Networks (DNNs) is a challenging problem that has gain increasing attention over the last few years. While several methods have been proposed to explain network predictions, there have been only a few attempts to compare them from a theoretical perspective. What is more, no exhaustive empirical comparison has been performed in the past. In this work, we analyze four gradient-based attribution methods and formally prove conditions of equivalence and approximation between them. By reformulating two of these methods, we construct a unified framework which enables a direct comparison, as well as an easier implementation. Finally, we propose a novel evaluation metric, called Sensitivity-n and test the gradient-based attribution methods alongside with a simple perturbation-based attribution method on several datasets in the domains of image and text classification, using various network architectures.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. AIM: Adversarial Information Masking for Faithfulness Evaluation of Saliency Maps

    cs.LG 2026-05 unverdicted novelty 7.0

    AIM is a new saliency-guided adversarial feature replacement method to evaluate faithfulness of saliency maps and reliability of masking operators on image, audio, and EEG tasks.

  2. WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models

    cs.CV 2026-04 unverdicted novelty 5.0

    WSVD delivers over 1.8x faster VLM decoding via weighted low-rank approximation at fine granularity plus quantization, without accuracy loss.

  3. Data-driven Sensor Placement for Predictive Applications: A Correlation-Assisted Attribution Framework (CAAF)

    cs.CE 2025-10 unverdicted novelty 5.0

    CAAF clusters candidate sensor locations before applying feature attribution to reduce redundancy and improve optimal sensor placement for predictions in dynamical systems.