Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification

Antonio De Santis; Marco Brambilla; Matteo Bianchi; Riccardo Campi

arxiv: 2411.05698 · v3 · pith:A3R52Q4Pnew · submitted 2024-11-08 · 💻 cs.CV · cs.AI· cs.LG

Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification

Antonio De Santis , Riccardo Campi , Matteo Bianchi , Marco Brambilla This is my paper

classification 💻 cs.CV cs.AIcs.LG

keywords conceptimagevisual-tcavattributionexplanationsmethodssaliencycannot

0 comments

read the original abstract

Convolutional Neural Networks (CNNs) have shown remarkable performance in image classification. However, interpreting their predictions is challenging due to the size and complexity of these models. State-of-the-art saliency methods generate local explanations highlighting the area in the input image where a class is identified but cannot explain how a concept of interest contributes to the prediction. On the other hand, concept-based methods, such as TCAV, provide insights into how sensitive the network is to a human-defined concept but cannot compute its attribution in a specific prediction nor show its location within the input image. We introduce Visual-TCAV, a novel explainability framework aiming to bridge the gap between these methods by providing both local and global explanations. Visual-TCAV uses Concept Activation Vectors (CAVs) to generate class-agnostic saliency maps that show where the network recognizes a certain concept. Moreover, it can estimate the attribution of these concepts to the output of any class using a generalization of Integrated Gradients. We evaluate the method's faithfulness via a controlled experiment where the ground truth for explanations is known, showing better ground truth alignment than TCAV. Our code is available at https://github.com/DataSciencePolimi/Visual-TCAV.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

$\alpha$-TCAV: A Unified Framework for Testing with Concept Activation Vectors
stat.ML 2026-05 unverdicted novelty 7.0

α-TCAV replaces TCAV's hard indicator with a tunable smooth function to create a unified probabilistic framework with lower variance and guidance for parameter choice or Bayes-optimal scoring.
A Framework for Evaluating Zero-Shot Image Generation in Concept-based Explainability
cs.CV 2026-05 unverdicted novelty 6.0

The paper introduces a framework of four complementary analyses to evaluate the faithfulness of synthetic concept images from zero-shot T2I models versus real images for concept-based XAI.