3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

Benjamin Graham; Laurens van der Maaten; Martin Engelcke

arxiv: 1711.10275 · v1 · pith:B2JDGHOLnew · submitted 2017-11-28 · 💻 cs.CV

3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

Benjamin Graham , Martin Engelcke , Laurens van der Maaten This is my paper

classification 💻 cs.CV

keywords convolutionaldatanetworkssparsesegmentationsemanticcloudsdense

0 comments

read the original abstract

Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D point clouds that were obtained using a LiDAR scanner or RGB-D camera. Standard "dense" implementations of convolutional networks are very inefficient when applied on such sparse data. We introduce new sparse convolutional operations that are designed to process spatially-sparse data more efficiently, and use them to develop spatially-sparse convolutional networks. We demonstrate the strong performance of the resulting models, called submanifold sparse convolutional networks (SSCNs), on two tasks involving semantic segmentation of 3D point clouds. In particular, our models outperform all prior state-of-the-art on the test set of a recent semantic segmentation competition.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Enhanced Ionization Charge Identification in the Short-Baseline Neutrino Program Neutrino Detectors with Deep Neural Networks
physics.ins-det 2026-05 conditional novelty 6.0

A DNN-based region of interest detection method for SBN neutrino detectors outperforms traditional wire-by-wire thresholding in identification accuracy and reconstruction quality while being more robust to performance...