pith. machine review for the scientific record. sign in

arxiv: 1608.00507 · v1 · submitted 2016-08-01 · 💻 cs.CV

Recognition: unknown

Top-down Neural Attention by Excitation Backprop

Authors on Pith no claims yet
classification 💻 cs.CV
keywords attentiontop-downmodelbackpropexcitationlocalizationmapsmethod
0
0 comments X
read the original abstract

We aim to model the top-down attention of a Convolutional Neural Network (CNN) classifier for generating task-specific attention maps. Inspired by a top-down human visual attention model, we propose a new backpropagation scheme, called Excitation Backprop, to pass along top-down signals downwards in the network hierarchy via a probabilistic Winner-Take-All process. Furthermore, we introduce the concept of contrastive attention to make the top-down attention maps more discriminative. In experiments, we demonstrate the accuracy and generalizability of our method in weakly supervised localization tasks on the MS COCO, PASCAL VOC07 and ImageNet datasets. The usefulness of our method is further validated in the text-to-region association task. On the Flickr30k Entities dataset, we achieve promising performance in phrase localization by leveraging the top-down attention of a CNN model that has been trained on weakly labeled web images.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CheXTemporal: A Dataset for Temporally-Grounded Reasoning in Chest Radiography

    cs.CV 2026-05 accept novelty 8.0

    CheXTemporal supplies paired chest X-rays with explicit temporal progression taxonomy and spatial grounding to benchmark and improve models on longitudinal reasoning tasks.