Causal Explanations for Image Classifiers

Daniel Kroening; David A. Kelly; Hana Chockler; Youcheng Sun

arxiv: 2411.08875 · v4 · pith:FTUK3ET4new · submitted 2024-11-13 · 💻 cs.AI

Causal Explanations for Image Classifiers

Hana Chockler , David A. Kelly , Daniel Kroening , Youcheng Sun This is my paper

classification 💻 cs.AI

keywords explanationsblack-boxdefinitionstoolsalgorithmapproachclassifierscomputing

0 comments

read the original abstract

Existing algorithms for explaining the output of image classifiers use different definitions of explanations and a variety of techniques to find them. However, none of the existing tools use a principled approach based on formal definitions of cause and explanation. In this paper we present a novel black-box approach to computing explanations grounded in the theory of actual causality. We prove relevant theoretical results and present an algorithm for computing approximate explanations based on these definitions. We prove termination of our algorithm and discuss its complexity and the amount of approximation compared to the precise definition. We implemented the framework in a tool ReX and we present experimental results and a comparison with state-of-the-art tools. We demonstrate that ReX is the most efficient black-box tool and produces the smallest explanations, in addition to outperforming other black-box tools on standard quality measures.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Explaining Failures of Cyber-Physical Systems with Actual Causality
cs.RO 2026-06 unverdicted novelty 6.0

Adapts actual causality to CPS failure explanation with two new algorithms, demonstrated on a neural autonomous car avoiding collisions.
Out-of-the-box: Black-box Causal Attacks on Object Detectors
cs.CV 2025-12 unverdicted novelty 6.0

BlackCAtt creates smaller, explainable black-box attacks on object detectors by targeting minimal causal pixel sets, outperforming or matching standard methods and acting as a meta-algorithm when combined with them.