Part-based R-CNNs for Fine-grained Category Detection

Jeff Donahue; Ning Zhang; Ross Girshick; Trevor Darrell

arxiv: 1407.3867 · v1 · pith:YLICT6G5new · submitted 2014-07-15 · 💻 cs.CV

Part-based R-CNNs for Fine-grained Category Detection

Ning Zhang , Jeff Donahue , Ross Girshick , Trevor Darrell This is my paper

classification 💻 cs.CV

keywords fine-grainedcategorizationboundingcategorydetectionmethodmethodsobject

0 comments

read the original abstract

Semantic part localization can facilitate fine-grained categorization by explicitly isolating subtle appearance differences associated with specific object parts. Methods for pose-normalized representations have been proposed, but generally presume bounding box annotations at test time due to the difficulty of object detection. We propose a model for fine-grained categorization that overcomes these limitations by leveraging deep convolutional features computed on bottom-up region proposals. Our method learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a fine-grained category from a pose-normalized representation. Experiments on the Caltech-UCSD bird dataset confirm that our method outperforms state-of-the-art fine-grained categorization methods in an end-to-end evaluation without requiring a bounding box at test time.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Large-Scale Study on the Accuracy vs Cost Trade-offs of Training and Evaluation Settings in Fine-Grained Image Recognition
cs.CV 2026-05 unverdicted novelty 5.0

Large-scale experiments demonstrate that data-aware augmentations applied only during training allow fine-grained image models to reach high accuracy without using discriminative crops at inference, lowering costs.