pith. sign in

arxiv: 1811.11731 · v1 · pith:MBONU7P3new · submitted 2018-11-28 · 💻 cs.CV

CAPNet: Continuous Approximation Projection For 3D Point Cloud Reconstruction Using 2D Supervision

classification 💻 cs.CV
keywords cloudpointprojectionreconstructionsapproachesapproximationcapnetcontinuous
0
0 comments X
read the original abstract

Knowledge of 3D properties of objects is a necessity in order to build effective computer vision systems. However, lack of large scale 3D datasets can be a major constraint for data-driven approaches in learning such properties. We consider the task of single image 3D point cloud reconstruction, and aim to utilize multiple foreground masks as our supervisory data to alleviate the need for large scale 3D datasets. A novel differentiable projection module, called 'CAPNet', is introduced to obtain such 2D masks from a predicted 3D point cloud. The key idea is to model the projections as a continuous approximation of the points in the point cloud. To overcome the challenges of sparse projection maps, we propose a loss formulation termed 'affinity loss' to generate outlier-free reconstructions. We significantly outperform the existing projection based approaches on a large-scale synthetic dataset. We show the utility and generalizability of such a 2D supervised approach through experiments on a real-world dataset, where lack of 3D data can be a serious concern. To further enhance the reconstructions, we also propose a test stage optimization procedure to obtain reconstructions that display high correspondence with the observed input image.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Improving 3D Object Detection for Pedestrians with Virtual Multi-View Synthesis Orientation Estimation

    cs.CV 2019-07 unverdicted novelty 6.0

    A new Virtual Multi-View Synthesis module improves pedestrian orientation estimation when integrated into the AVOD-FPN 3D detector, outperforming prior methods on KITTI Orientation, 3D, and Bird's Eye View benchmarks.