CBAM: Convolutional Block Attention Module

In So Kweon; Jongchan Park; Joon-Young Lee; Sanghyun Woo

arxiv: 1807.06521 · v2 · pith:M6GY76FXnew · submitted 2018-07-17 · 💻 cs.CV

CBAM: Convolutional Block Attention Module

Sanghyun Woo , Jongchan Park , Joon-Young Lee , In So Kweon This is my paper

classification 💻 cs.CV

keywords attentioncbammoduleconvolutionaldetectionfeaturealongblock

0 comments

read the original abstract

We propose Convolutional Block Attention Module (CBAM), a simple yet effective attention module for feed-forward convolutional neural networks. Given an intermediate feature map, our module sequentially infers attention maps along two separate dimensions, channel and spatial, then the attention maps are multiplied to the input feature map for adaptive feature refinement. Because CBAM is a lightweight and general module, it can be integrated into any CNN architectures seamlessly with negligible overheads and is end-to-end trainable along with base CNNs. We validate our CBAM through extensive experiments on ImageNet-1K, MS~COCO detection, and VOC~2007 detection datasets. Our experiments show consistent improvements in classification and detection performances with various models, demonstrating the wide applicability of CBAM. The code and models will be publicly available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Architecture-Aware Explanation Auditing for Industrial Visual Inspection
cs.LG 2026-05 conditional novelty 7.0

Explanation faithfulness for deep classifiers on wafer maps is highest when the explainer matches the model's native readout structure, with ViT-Tiny plus Attention Rollout achieving lower Deletion AUC than mismatched...
Architecture-Aware Explanation Auditing for Industrial Visual Inspection
cs.LG 2026-05 unverdicted novelty 7.0

An audit protocol on wafer maps finds that ViT-Tiny with Attention Rollout achieves better deletion faithfulness than other models and explainers, with readout structure as the key factor and RISE outperforming native...
A Wasserstein GAN-based climate scenario generator for risk management and insurance: the case of soil subsidence
cs.LG 2026-04 unverdicted novelty 4.0

A conditional Wasserstein GAN generates plausible future SWI drought trajectories for French insurance risk management under climate change.
Improved Reinforcement Learning through Imitation Learning Pretraining Towards Image-based Autonomous Driving
cs.LG 2019-07 unverdicted novelty 3.0

Imitation learning pretraining of a ResNet-34 DDPG agent improves performance on image-based autonomous driving in simulation over pure IL or pure RL.