CAPNet: Continuous Approximation Projection For 3D Point Cloud Reconstruction Using 2D Supervision

Mayank Agarwal; Navaneet K L; Priyanka Mandikal; R. Venkatesh Babu

arxiv: 1811.11731 · v1 · pith:MBONU7P3new · submitted 2018-11-28 · 💻 cs.CV

CAPNet: Continuous Approximation Projection For 3D Point Cloud Reconstruction Using 2D Supervision

Navaneet K L , Priyanka Mandikal , Mayank Agarwal , R. Venkatesh Babu This is my paper

classification 💻 cs.CV

keywords cloudpointprojectionreconstructionsapproachesapproximationcapnetcontinuous

0 comments

read the original abstract

Knowledge of 3D properties of objects is a necessity in order to build effective computer vision systems. However, lack of large scale 3D datasets can be a major constraint for data-driven approaches in learning such properties. We consider the task of single image 3D point cloud reconstruction, and aim to utilize multiple foreground masks as our supervisory data to alleviate the need for large scale 3D datasets. A novel differentiable projection module, called 'CAPNet', is introduced to obtain such 2D masks from a predicted 3D point cloud. The key idea is to model the projections as a continuous approximation of the points in the point cloud. To overcome the challenges of sparse projection maps, we propose a loss formulation termed 'affinity loss' to generate outlier-free reconstructions. We significantly outperform the existing projection based approaches on a large-scale synthetic dataset. We show the utility and generalizability of such a 2D supervised approach through experiments on a real-world dataset, where lack of 3D data can be a serious concern. To further enhance the reconstructions, we also propose a test stage optimization procedure to obtain reconstructions that display high correspondence with the observed input image.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Improving 3D Object Detection for Pedestrians with Virtual Multi-View Synthesis Orientation Estimation
cs.CV 2019-07 unverdicted novelty 6.0

A new Virtual Multi-View Synthesis module improves pedestrian orientation estimation when integrated into the AVOD-FPN 3D detector, outperforming prior methods on KITTI Orientation, 3D, and Bird's Eye View benchmarks.