Reprojection R-CNN is a two-stage detector for 360° images combining a distortion-aware spherical RPN on ERP with a reprojection network on perspective projections, reporting higher mAP than prior methods on two new synthetic datasets at 178 ms per image.
Object Detection in Equirectangular Panorama
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
We introduced a high-resolution equirectangular panorama (360-degree, virtual reality) dataset for object detection and propose a multi-projection variant of YOLO detector. The main challenge with equirectangular panorama image are i) the lack of annotated training data, ii) high-resolution imagery and iii) severe geometric distortions of objects near the panorama projection poles. In this work, we solve the challenges by i) using training examples available in the "conventional datasets" (ImageNet and COCO), ii) employing only low-resolution images that require only moderate GPU computing power and memory, and iii) our multi-projection YOLO handles projection distortions by making multiple stereographic sub-projections. In our experiments, YOLO outperforms the other state-of-art detector, Faster RCNN and our multi-projection YOLO achieves the best accuracy with low-resolution input.
fields
cs.CV 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Reprojection R-CNN: A Fast and Accurate Object Detector for 360{\deg} Images
Reprojection R-CNN is a two-stage detector for 360° images combining a distortion-aware spherical RPN on ERP with a reprojection network on perspective projections, reporting higher mAP than prior methods on two new synthetic datasets at 178 ms per image.