pith. sign in

arxiv: 1811.03818 · v1 · pith:MZY3E3VInew · submitted 2018-11-09 · 💻 cs.CV

RoarNet: A Robust 3D Object Detection based on RegiOn Approximation Refinement

classification 💻 cs.CV
keywords roarnetdetectionobjectcloudspointavailablefeasibleimage
0
0 comments X
read the original abstract

We present RoarNet, a new approach for 3D object detection from a 2D image and 3D Lidar point clouds. Based on two-stage object detection framework with PointNet as our backbone network, we suggest several novel ideas to improve 3D object detection performance. The first part of our method, RoarNet_2D, estimates the 3D poses of objects from a monocular image, which approximates where to examine further, and derives multiple candidates that are geometrically feasible. This step significantly narrows down feasible 3D regions, which otherwise requires demanding processing of 3D point clouds in a huge search space. Then the second part, RoarNet_3D, takes the candidate regions and conducts in-depth inferences to conclude final poses in a recursive manner. Inspired by PointNet, RoarNet_3D processes 3D point clouds directly without any loss of data, leading to precise detection. We evaluate our method in KITTI, a 3D object detection benchmark. Our result shows that RoarNet has superior performance to state-of-the-art methods that are publicly available. Remarkably, RoarNet also outperforms state-of-the-art methods even in settings where Lidar and camera are not time synchronized, which is practically important for actual driving environments. RoarNet is implemented in Tensorflow and publicly available with pre-trained models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. STD: Sparse-to-Dense 3D Object Detector for Point Cloud

    cs.CV 2019-07 unverdicted novelty 5.0

    STD is a two-stage point-cloud 3D detector using point-seeded spherical anchors for proposals, PointsPool for feature aggregation, and an IoU branch, reporting large gains over prior methods on KITTI especially on har...