See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

Tao Hu, Honggang Qi, Qingming Huang, Yan Lu · 2019 · cs.CV · arXiv 1901.09891

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

Data augmentation is usually adopted to increase the amount of training data, prevent overfitting and improve the performance of deep models. However, in practice, random data augmentation, such as random image cropping, is low-efficiency and might introduce many uncontrolled background noises. In this paper, we propose Weakly Supervised Data Augmentation Network (WS-DAN) to explore the potential of data augmentation. Specifically, for each training image, we first generate attention maps to represent the object's discriminative parts by weakly supervised learning. Next, we augment the image guided by these attention maps, including attention cropping and attention dropping. The proposed WS-DAN improves the classification accuracy in two folds. In the first stage, images can be seen better since more discriminative parts' features will be extracted. In the second stage, attention regions provide accurate location of object, which ensures our model to look at the object closer and further improve the performance. Comprehensive experiments in common fine-grained visual classification datasets show that our WS-DAN surpasses the state-of-the-art methods, which demonstrates its effectiveness.

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

The DeepFake Detection Challenge (DFDC) Dataset

cs.CV · 2020-06-12 · accept · novelty 7.0

The DFDC dataset is the largest public collection of face-swapped videos and supports detectors that generalize to in-the-wild deepfakes.

How to Choose Your Teacher for Fine Grained Image Recognition

cs.CV · 2026-05-15 · conditional · novelty 6.0

Proposes Ratio 1-2 metric for teacher selection in knowledge distillation for fine-grained image recognition, validated across 1000+ experiments showing 18% better selection and up to 17% student accuracy gains.

A Large-Scale Study on the Accuracy vs Cost Trade-offs of Training and Evaluation Settings in Fine-Grained Image Recognition

cs.CV · 2026-05-18 · unverdicted · novelty 5.0

Large-scale experiments demonstrate that data-aware augmentations applied only during training allow fine-grained image models to reach high accuracy without using discriminative crops at inference, lowering costs.

citing papers explorer

Showing 3 of 3 citing papers.

The DeepFake Detection Challenge (DFDC) Dataset cs.CV · 2020-06-12 · accept · none · ref 12
The DFDC dataset is the largest public collection of face-swapped videos and supports detectors that generalize to in-the-wild deepfakes.
How to Choose Your Teacher for Fine Grained Image Recognition cs.CV · 2026-05-15 · conditional · none · ref 13 · internal anchor
Proposes Ratio 1-2 metric for teacher selection in knowledge distillation for fine-grained image recognition, validated across 1000+ experiments showing 18% better selection and up to 17% student accuracy gains.
A Large-Scale Study on the Accuracy vs Cost Trade-offs of Training and Evaluation Settings in Fine-Grained Image Recognition cs.CV · 2026-05-18 · unverdicted · none · ref 10 · internal anchor
Large-scale experiments demonstrate that data-aware augmentations applied only during training allow fine-grained image models to reach high accuracy without using discriminative crops at inference, lowering costs.

See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer