Volumetric and Multi-View CNNs for Object Classification on 3D Data

Angela Dai; Charles R. Qi; Hao Su; Leonidas J. Guibas; Matthias Niessner; Mengyuan Yan

arxiv: 1604.03265 · v2 · pith:NNRUWEDZnew · submitted 2016-04-12 · 💻 cs.CV · cs.AI

Volumetric and Multi-View CNNs for Object Classification on 3D Data

Charles R. Qi , Hao Su , Matthias Niessner , Angela Dai , Mengyuan Yan , Leonidas J. Guibas This is my paper

classification 💻 cs.CV cs.AI

keywords cnnsvolumetricmulti-viewavailableclassificationmethodsobjectrepresentations

0 comments

read the original abstract

3D shape models are becoming widely available and easier to capture, making available 3D information crucial for progress in object classification. Current state-of-the-art methods rely on CNNs to address this problem. Recently, we witness two types of CNNs being developed: CNNs based upon volumetric representations versus CNNs based upon multi-view representations. Empirical results from these two types of CNNs exhibit a large gap, indicating that existing volumetric CNN architectures and approaches are unable to fully exploit the power of 3D representations. In this paper, we aim to improve both volumetric CNNs and multi-view CNNs according to extensive analysis of existing approaches. To this end, we introduce two distinct network architectures of volumetric CNNs. In addition, we examine multi-view CNNs, where we introduce multi-resolution filtering in 3D. Overall, we are able to outperform current state-of-the-art methods for both volumetric CNNs and multi-view CNNs. We provide extensive experiments designed to evaluate underlying design choices, thus providing a better understanding of the space of methods available for object classification on 3D data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques
cs.CV 2019-07 unverdicted novelty 2.0

A survey of RGB-D object detection from traditional hand-crafted features with machine learning to deep learning techniques.
A review on deep learning techniques for 3D sensed data classification
cs.CV 2019-07 unverdicted novelty 1.0

A survey of deep learning architectures for 3D sensed data classification covering RGB-D, multi-view, volumetric and end-to-end methods along with datasets and future directions.