Visualizing and Understanding Convolutional Networks

Matthew D Zeiler , Rob Fergus

Authors on Pith no claims yet

classification 💻 cs.CV

keywords imagenetmodelbenchmarkclassificationclassifierconvolutionaldatasetslayers

read the original abstract

Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. We also perform an ablation study to discover the performance contribution from different model layers. This enables us to find model architectures that outperform Krizhevsky \etal on the ImageNet classification benchmark. We show our ImageNet model generalizes well to other datasets: when the softmax classifier is retrained, it convincingly beats the current state-of-the-art results on Caltech-101 and Caltech-256 datasets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Intriguing properties of neural networks
cs.CV 2013-12 accept novelty 8.0

Deep neural networks exhibit distributed high-level semantic representations and discontinuous input-output mappings vulnerable to transferable adversarial perturbations.
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
cs.CV 2013-12 unverdicted novelty 7.0

Gradient ascent on class scores and input-image gradients produce visualizations of ConvNet class notions and saliency maps usable for weakly supervised segmentation.
Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings
cs.LG 2026-04 unverdicted novelty 6.0

Circuit-based metrics from Vision Transformer internals provide better label-free proxies for generalization under distribution shift than existing methods like model confidence.
Biological Plausibility and Representational Alignment of Feedback Alignment in Convolutional Networks
cs.AI 2026-05 unverdicted novelty 5.0

Modified feedback alignment in convolutional networks produces representations geometrically aligned with backpropagation on CIFAR-10.
Memory Efficient Full-gradient Attacks (MEFA) Framework for Adversarial Defense Evaluations
cs.LG 2026-05 unverdicted novelty 5.0

MEFA enables exact full-gradient white-box attacks on iterative stochastic purification defenses like diffusion and Langevin EBMs by trading recomputation for lower memory, revealing vulnerabilities missed by approxim...