Visualizing and Understanding Convolutional Networks

Matthew D Zeiler; Rob Fergus

arxiv: 1311.2901 · v3 · pith:CLB25YGGnew · submitted 2013-11-12 · 💻 cs.CV

Visualizing and Understanding Convolutional Networks

Matthew D Zeiler , Rob Fergus This is my paper

classification 💻 cs.CV

keywords imagenetmodelbenchmarkclassificationclassifierconvolutionaldatasetslayers

0 comments

read the original abstract

Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we address both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. We also perform an ablation study to discover the performance contribution from different model layers. This enables us to find model architectures that outperform Krizhevsky \etal on the ImageNet classification benchmark. We show our ImageNet model generalizes well to other datasets: when the softmax classifier is retrained, it convincingly beats the current state-of-the-art results on Caltech-101 and Caltech-256 datasets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 13 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Intriguing properties of neural networks
cs.CV 2013-12 accept novelty 8.0

Deep neural networks exhibit distributed high-level semantic representations and discontinuous input-output mappings vulnerable to transferable adversarial perturbations.
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
cs.CV 2013-12 unverdicted novelty 7.0

Gradient ascent on class scores and input-image gradients produce visualizations of ConvNet class notions and saliency maps usable for weakly supervised segmentation.
Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings
cs.LG 2026-04 unverdicted novelty 6.0

Circuit-based metrics from Vision Transformer internals provide better label-free proxies for generalization under distribution shift than existing methods like model confidence.
Prophecy: Inferring Formal Properties from Neuron Activations
cs.LG 2025-09 unverdicted novelty 6.0

Prophecy infers formal properties of feed-forward neural networks by extracting rules from neuron activation patterns that imply desirable output behaviors.
Explaining Object Detectors via Collective Contribution of Pixels
cs.CV 2024-12 unverdicted novelty 6.0

A Shapley-value method with interaction terms that explains object detector decisions by capturing collective pixel contributions for localization and classification.
Biological Plausibility and Representational Alignment of Feedback Alignment in Convolutional Networks
cs.AI 2026-05 unverdicted novelty 5.0

Modified feedback alignment in convolutional networks produces representations geometrically aligned with backpropagation on CIFAR-10.
Memory Efficient Full-gradient Attacks (MEFA) Framework for Adversarial Defense Evaluations
cs.LG 2026-05 unverdicted novelty 5.0

MEFA enables exact full-gradient white-box attacks on iterative stochastic purification defenses like diffusion and Langevin EBMs by trading recomputation for lower memory, revealing vulnerabilities missed by approxim...
Out-of-Distribution Detection Using Neural Rendering Generative Models
cs.LG 2019-07 unverdicted novelty 5.0

NRM enables OoD detection by joint latent likelihood, assigning lower values to SVHN than CIFAR-10 (unlike VAEs/flows) and consistent across other OoD sets.
Neuron ranking -- an informed way to condense convolutional neural networks architecture
cs.LG 2019-07 unverdicted novelty 5.0

Shapley value and variational importance switch methods produce consistent rankings of filter importance in CNNs, enabling compression and interpretability.
Convolutional neural network based decoders for surface codes
quant-ph 2023-12 unverdicted novelty 4.0

Convolutional neural network decoders achieve good performance on surface code error correction and adapt across noise models, with explainable AI used to inspect their decisions.
Autoencoding sensory substitution
q-bio.NC 2019-07 unverdicted novelty 4.0

Deep recurrent autoencoders convert images to shortened audio signals that incorporate hearing models, enabling above-chance hand posture discrimination and object reaching after a few hours of training instead of months.
On the notion of number in humans and machines
cs.CV 2019-06 unverdicted novelty 2.0

Experiments indicate deep learning models achieve higher accuracy on numerosity tasks for counts below human subitizing capacity.
Water Preservation in Soan River Basin using Deep Learning Techniques
cs.NE 2019-06 unverdicted novelty 2.0

RNN and LSTM models outperform other algorithms in predicting stream flow from precipitation, land use, and temperature, with a public dataset released.