pith. machine review for the scientific record. sign in

arxiv: 1801.07791 · v5 · submitted 2018-01-23 · 💻 cs.CV · cs.AI· cs.GR

Recognition: unknown

PointCNN: Convolution On mathcal{X}-Transformed Points

Authors on Pith no claims yet
classification 💻 cs.CV cs.AIcs.GR
keywords pointspointcloudsconvolutionfeaturesmathcalpointcnnassociated
0
0 comments X
read the original abstract

We present a simple and general framework for feature learning from point clouds. The key to the success of CNNs is the convolution operator that is capable of leveraging spatially-local correlation in data represented densely in grids (e.g. images). However, point clouds are irregular and unordered, thus directly convolving kernels against features associated with the points, will result in desertion of shape information and variance to point ordering. To address these problems, we propose to learn an $\mathcal{X}$-transformation from the input points, to simultaneously promote two causes. The first is the weighting of the input features associated with the points, and the second is the permutation of the points into a latent and potentially canonical order. Element-wise product and sum operations of the typical convolution operator are subsequently applied on the $\mathcal{X}$-transformed features. The proposed method is a generalization of typical CNNs to feature learning from point clouds, thus we call it PointCNN. Experiments show that PointCNN achieves on par or better performance than state-of-the-art methods on multiple challenging benchmark datasets and tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. LOD-Net: Locality-Aware 3D Object Detection Using Multi-Scale Transformer Network

    cs.CV 2026-04 unverdicted novelty 4.0

    Adding multi-scale attention and upsampling to 3DETR improves mAP@50 by 4.78% on ScanNetv2 for 3D object detection.