Grid R-CNN

Buyu Li; Junjie Yan; Quanquan Li; Xin Lu; Yuxin Yue

Grid R-CNN

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1811.12030 v1 pith:HU2K3NZC submitted 2018-11-29 cs.CV

Grid R-CNN

Xin Lu , Buyu Li , Yuxin Yue , Quanquan Li , Junjie Yan This is my paper

classification cs.CV

keywords gridr-cnnpointsdetectionlocalizationobjectarchitecturedifferent

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection. Different from the traditional regression based methods, the Grid R-CNN captures the spatial information explicitly and enjoys the position sensitive property of fully convolutional architecture. Instead of using only two independent points, we design a multi-point supervision formulation to encode more clues in order to reduce the impact of inaccurate prediction of specific points. To take the full advantage of the correlation of points in a grid, we propose a two-stage information fusion strategy to fuse feature maps of neighbor grid points. The grid guided localization approach is easy to be extended to different state-of-the-art detection frameworks. Grid R-CNN leads to high quality object localization, and experiments demonstrate that it achieves a 4.1% AP gain at IoU=0.8 and a 10.0% AP gain at IoU=0.9 on COCO benchmark compared to Faster R-CNN with Res50 backbone and FPN architecture.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Rethinking Classification and Localization for Cascade R-CNN
cs.CV 2019-07 unverdicted novelty 4.0

Feature sharing embedded in every stage of Cascade R-CNN narrows the low-IoU gap, improves all thresholds, and reaches 43.2 AP on COCO with negligible added parameters.