Analyzing the Cross-Sensor Portability of Neural Network Architectures for LiDAR-based Semantic Labeling

Florian Piewak; Marius Z\"ollner; Peter Pinggera

arxiv: 1907.02149 · v1 · pith:2OEJIQDLnew · submitted 2019-07-03 · 💻 cs.CV · cs.LG

Analyzing the Cross-Sensor Portability of Neural Network Architectures for LiDAR-based Semantic Labeling

Florian Piewak , Peter Pinggera , Marius Z\"ollner This is my paper

Pith reviewed 2026-05-25 09:56 UTC · model grok-4.3

classification 💻 cs.CV cs.LG

keywords LiDARsemantic labelingpoint cloudsCNNcross-sensor portabilitysemantic segmentationIoU evaluationtraining data generation

0 comments

The pith

A new CNN architecture for LiDAR point cloud semantic labeling achieves state-of-the-art results while transferring more effectively across different sensor types.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to establish that a convolutional neural network can label LiDAR point clouds point-wise at high accuracy while remaining effective when moved to different LiDAR sensor types, unlike earlier networks built around sensor-specific choices. This matters because LiDAR hardware develops rapidly, so an architecture that avoids repeated redesigns would lower the cost of keeping labeling systems current. The authors run a quantitative comparison against a reference method on multiple sensors and report a consistent 10 percentage point gain in Intersection-over-Union score. They also note that the same design supports automatic creation of large training sets for new sensors without manual labeling or cross-modal transfer.

Core claim

The proposed CNN architecture for the point-wise semantic labeling of LiDAR data achieves state-of-the-art results while increasing portability across sensor types. This is shown through a quantitative cross-sensor analysis where it yields a 10 percentage point improvement in the Intersection-over-Union score compared to a state-of-the-art reference method. The results suggest it provides an efficient way for the automated generation of large-scale training data for novel LiDAR sensor types without extensive manual annotation or multi-modal label transfer.

What carries the argument

A convolutional neural network architecture built with sensor-agnostic choices in network structure and data representation to support transfer between LiDAR sensor types.

If this is right

The architecture maintains performance when applied to LiDAR sensors different from those used in training.
New LiDAR hardware can be integrated with less redesign of the labeling network.
Large-scale training data for novel sensors can be generated automatically.
Reliance on manual annotation or multi-modal label transfer decreases for new sensor deployments.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same portability principles could support models that handle data from mixed fleets of different LiDAR sensors at once.
Design choices that reduce sensor dependence might shorten the time needed to update autonomous systems when hardware changes.
Cross-sensor testing could become a standard evaluation step for future semantic labeling methods.

Load-bearing premise

The observed 10 percentage point IoU gain stems from the architecture's cross-sensor design choices rather than from differences in data representation, preprocessing steps, or training procedures.

What would settle it

Train and evaluate the reference method using exactly the same data representation, preprocessing pipeline, and training schedule as the proposed architecture on the same cross-sensor datasets and check whether the IoU gap closes.

Figures

Figures reproduced from arXiv: 1907.02149 by Florian Piewak, Marius Z\"ollner, Peter Pinggera.

**Figure 2.** Figure 2: Exemplary semantic labeling result obtained via the proposed [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 3.** Figure 3: 3D voxel CNN architecture for point-wise semantic segmentation. The voxel space as well the processing chain per voxel are represented in [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 4.** Figure 4: Example of a voxel feature encoder based on two consecutive VFE-Layers. The original point features serve as input to the first VFE-Layer, [PITH_FULL_IMAGE:figures/full_fig_p003_4.png] view at source ↗

**Figure 5.** Figure 5: Compression of the 3D space to 2.5D by reducing the number of [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 6.** Figure 6: Illustration of the Autolabeling method of [16] used to transfer semantic reference data from a camera image to a point cloud (VLP-32C). point-wise semantic segmentation result this approach is not sufficient, especially for large voxels such as pillars where all points within a voxel would be assigned the same semantic class. Therefore, we introduce a point-wise semantic extraction head as shown in [PITH… view at source ↗

read the original abstract

State-of-the-art approaches for the semantic labeling of LiDAR point clouds heavily rely on the use of deep Convolutional Neural Networks (CNNs). However, transferring network architectures across different LiDAR sensor types represents a significant challenge, especially due to sensor specific design choices with regard to network architecture as well as data representation. In this paper we propose a new CNN architecture for the point-wise semantic labeling of LiDAR data which achieves state-of-the-art results while increasing portability across sensor types. This represents a significant advantage given the fast-paced development of LiDAR hardware technology. We perform a thorough quantitative cross-sensor analysis of semantic labeling performance in comparison to a state-of-the-art reference method. Our evaluation shows that the proposed architecture is indeed highly portable, yielding an improvement of 10 percentage points in the Intersection-over-Union (IoU) score when compared to the reference approach. Further, the results indicate that the proposed network architecture can provide an efficient way for the automated generation of large-scale training data for novel LiDAR sensor types without the need for extensive manual annotation or multi-modal label transfer.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper offers a new CNN design aimed at cross-sensor portability for LiDAR semantic labeling and reports a 10pp IoU gain, but the gain is not clearly isolated from possible differences in training or preprocessing.

read the letter

The main thing to know is that the authors built a CNN for point-wise LiDAR labeling that they say transfers better across sensor types than prior work, with a reported 10-point IoU lift in cross-sensor tests. They also suggest it could help create training data for new sensors without heavy manual labeling. That addresses a real pain point, since LiDAR hardware moves quickly and retraining is expensive. The experiments compare the new net to a reference method on multiple sensors, which is the right direction for the claim. The portability focus and the automated data generation angle are the clearest practical contributions. The evaluation is empirical and straightforward, with no heavy math or derivations to check. The soft spot is the comparison itself. The abstract gives no information on whether the reference method used the same voxelization, point representation, augmentation, optimizer, or loss settings. If those differed, the 10pp delta could come from any of them rather than the architecture choices meant to improve portability. Without those controls or error bars, the central result is hard to interpret cleanly. The paper is aimed at people doing LiDAR perception for robotics or autonomous driving who need models that survive sensor changes. A reader working on segmentation baselines would find the cross-sensor numbers worth checking, but would have to dig into the methods to see how fair the comparison really is. It deserves peer review because the problem is timely and the experiments are a reasonable first step, even if the controls need tightening before the portability claim can be taken as settled.

Referee Report

1 major / 0 minor

Summary. The paper proposes a new CNN architecture for point-wise semantic labeling of LiDAR point clouds, designed to improve portability across different sensor types compared to prior work that relies on sensor-specific design choices. It reports state-of-the-art quantitative results on cross-sensor evaluation, claiming a 10 percentage point IoU improvement over a reference method, and suggests the architecture enables automated generation of training data for novel sensors without extensive manual annotation.

Significance. If the reported IoU gain can be shown to arise specifically from the architecture's cross-sensor design choices under controlled conditions, the result would be significant for LiDAR semantic segmentation, as it addresses the practical problem of rapid hardware evolution by reducing the need for per-sensor retraining or annotation.

major comments (1)

[Abstract] Abstract: the central claim of a 10pp IoU improvement attributable to the proposed architecture's cross-sensor portability is presented without any description of the reference method, sensor characteristics, data splits, preprocessing steps, voxelization parameters, augmentation, optimizer, or loss weighting; without these controls the delta cannot be isolated from confounding factors in data representation or training.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comment on the abstract below and agree that a revision is warranted to strengthen the presentation of our results.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim of a 10pp IoU improvement attributable to the proposed architecture's cross-sensor portability is presented without any description of the reference method, sensor characteristics, data splits, preprocessing steps, voxelization parameters, augmentation, optimizer, or loss weighting; without these controls the delta cannot be isolated from confounding factors in data representation or training.

Authors: We agree that the abstract, due to length constraints, omits the specific experimental controls. These details (reference method, sensor models, data splits, preprocessing, voxelization, augmentation, optimizer, and loss) are provided in Sections 3 and 4 of the manuscript, where the cross-sensor evaluation is described. To address the concern and better support the claim in the abstract itself, we will revise the abstract to briefly reference the key controls and reference method used. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical comparison with no derivations

full rationale

The paper is an empirical study proposing a CNN architecture for LiDAR point cloud semantic labeling and reporting a 10pp IoU gain versus a reference method. No equations, derivations, fitted parameters, or mathematical predictions are present in the provided text. The central claim rests on experimental results rather than any self-definitional reduction, fitted-input-as-prediction, or self-citation chain. External benchmarks (IoU scores) are independent of the paper's own inputs, satisfying the self-contained criterion for a score of 0.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based solely on the abstract, the central claim rests on standard assumptions of supervised CNN training for point-cloud segmentation (e.g., availability of labeled data, choice of loss function, and sensor-specific data formatting). No explicit free parameters, axioms, or invented entities are stated.

pith-pipeline@v0.9.0 · 5728 in / 1054 out tokens · 33153 ms · 2026-05-25T09:56:28.195223+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

30 extracted references · 30 canonical work pages · 6 internal anchors

[1]

Experience, results and lessons learned from automated driving on Germany’s highways,

M. Aeberhard, S. Rauch, M. Bahram, G. Tanzmeister, J. Thomas, Y . Pilat, F. Homm, W. Huber, and N. Kaempchen, “Experience, results and lessons learned from automated driving on Germany’s highways,” Intelligent Transportation Systems Magazine , vol. 7, no. 1, pp. 42–57, 2015

work page 2015
[2]

Making Bertha Drive - An Autonomous Journey on a Historic Route,

J. Ziegler, P. Bender, M. Schreiber, and Others, “Making Bertha Drive - An Autonomous Journey on a Historic Route,” Intelligent Transportation Systems Magazine , vol. 6, no. 2, pp. 8–20, 2014

work page 2014
[3]

Autonomous Driving in Trafﬁc: Boss and the Urban Challenge,

C. Urmson, C. Baker, J. Dolan, and Others, “Autonomous Driving in Trafﬁc: Boss and the Urban Challenge,” AI Magazine , vol. 30, no. 2, pp. 17–28, 2009

work page 2009
[4]

Stanley, the Robot that Won the DARPA Grand Challenge,

S. Thrun, M. Montemerlo, H. Dahlkamp, D. Stavens, A. Aron, J. Diebel, P. Fong, J. Gale, M. Halpenny, and G. Hoffmann, “Stanley, the Robot that Won the DARPA Grand Challenge,” Journal of Field Robotics, vol. 23, no. 9, pp. 661–692, 2006

work page 2006
[5]

A random ﬁnite set approach for dynamic occu- pancy grid maps with real-time application,

D. Nuss, S. Reuter, M. Thom, T. Yuan, G. Krehl, M. Maile, A. Gern, and K. Dietmayer, “A random ﬁnite set approach for dynamic occu- pancy grid maps with real-time application,” International Journal of Robotics Research, vol. 37, no. 8, pp. 841–866, 2018

work page 2018
[6]

Grid-based localization and local mapping with moving object detection and tracking Grid- based Localization and Local Mapping with Moving Object Detection and Tracking,

T.-d. Vu, J. Burlet, O. Aycard, and Others, “Grid-based localization and local mapping with moving object detection and tracking Grid- based Localization and Local Mapping with Moving Object Detection and Tracking,” Journal Information Fusion , vol. 12, no. 1, pp. 58–69, 2011

work page 2011
[7]

Probabilistic Analysis of Dynamic Scenes and Collision Risks Assessment to Improve Driving Safety,

C. Laugier, I. E. Paromtchik, M. Perrollaz, M. Y . Yong, J.-D. Yoder, C. Tay, K. Mekhnacha, and A. Negre, “Probabilistic Analysis of Dynamic Scenes and Collision Risks Assessment to Improve Driving Safety,” Intelligent Transportation Systems Magazine (ITSM) , vol. 3, no. 4, pp. 4–19, 2011

work page 2011
[8]

Intention-aware online POMDP planning for autonomous driving in a crowd,

H. Bai, S. Cai, N. Ye, and Others, “Intention-aware online POMDP planning for autonomous driving in a crowd,” in International Con- ference on Robotics and Automation (ICRA) , 2015

work page 2015
[9]

Vehicle Detection and Localiza- tion using 3D LIDAR Point Cloud and Image Semantic Segmentation,

R. Barea, C. Perez, L. M. Bergasa, E. Lopez-Guillen, E. Romera, E. Molinos, M. Ocana, and J. Lopez, “Vehicle Detection and Localiza- tion using 3D LIDAR Point Cloud and Image Semantic Segmentation,” Intelligent Transportation Systems Conference (ITSC) , 2018

work page 2018
[10]

A Review on Deep Learning Techniques Applied to Semantic Segmentation

A. Garcia-Garcia, S. Orts-Escolano, S. Oprea, and Others, “A Review on Deep Learning Techniques Applied to Semantic Segmentation,” in arXiv preprint: 1704.06857 , 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[11]

Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driv- ing: Datasets, Methods, and Challenges,

D. Feng, C. Haase-Schuetz, L. Rosenbaum, H. Hertlein, F. Duffhauss, C. Glaeser, W. Wiesbeck, and K. Dietmayer, “Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driv- ing: Datasets, Methods, and Challenges,” arXiv preprint: 1902.07830 , 2019

work page arXiv 1902
[12]

Are we ready for autonomous driving? the KITTI vision benchmark suite,

A. Geiger, P. Lenz, and R. Urtasun, “Are we ready for autonomous driving? the KITTI vision benchmark suite,” in Conference on Com- puter Vision and Pattern Recognition (CVPR) , 2012

work page 2012
[13]

SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud

B. Wu, X. Zhou, S. Zhao, X. Yue, and K. Keutzer, “SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud,” arXiv preprint: 1809.08495, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[14]

Deep Semantic Classiﬁ- cation for 3D LiDAR Data,

A. Dewan, G. L. Oliveira, and W. Burgard, “Deep Semantic Classiﬁ- cation for 3D LiDAR Data,” in International Conference on Intelligent Robots and Systems (IROS) , 2017

work page 2017
[15]

Super-sensor for 360- degree Environment Perception: Point Cloud Segmentation Using Image Features,

R. Varga, A. Costea, H. Florea, and Others, “Super-sensor for 360- degree Environment Perception: Point Cloud Segmentation Using Image Features,” in International Conference on Intelligent Trans- portation Systems (ITSC) , 2017

work page 2017
[16]

Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation,

F. Piewak, P. Pinggera, M. Sch ¨afer, D. Peter, B. Schwarz, N. Schneider, M. Enzweiler, D. Pfeiffer, and M. Z ¨ollner, “Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation,” in European Conference on Computer Vision Workshops (ECCV) , 2018

work page 2018
[17]

PIXOR: Real-time 3D Object Detection from Point Clouds,

B. Yang, W. Luo, and R. Urtasun, “PIXOR: Real-time 3D Object Detection from Point Clouds,” in Computer Vision and Pattern Recog- nition (CVPR) , 2018

work page 2018
[18]

BirdNet: A 3D Object Detection Framework from LiDAR Information,

J. Beltr ´an, C. Guindel, F. M. Moreno, D. Cruzado, F. Garc ´ıa, and A. De La Escalera, “BirdNet: A 3D Object Detection Framework from LiDAR Information,” Intelligent Transportation Systems Conference (ITSC), 2018

work page 2018
[19]

PointSeg: Real-Time Semantic Segmentation Based on 3D LiDAR Point Cloud

Y . Wang, T. Shi, P. Yun, L. Tai, and M. Liu, “PointSeg: Real-Time Semantic Segmentation Based on 3D LiDAR Point Cloud,” arXiv preprint: 1807.06288, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[20]

Semantic Segmentation of 3D LiDAR Data in Dynamic Scene Using Semi-supervised Learning

J. Mei, B. Gao, D. Xu, W. Yao, X. Zhao, and H. Zhao, “Semantic Segmentation of 3D LiDAR Data in Dynamic Scene Using Semi- supervised Learning,” arXiv preprint: 1809.00426 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[21]

PointNet: Deep Learning on Point Sets for 3D Classiﬁcation and Segmentation,

C. R. Qi, H. Su, K. Mo, and Others, “PointNet: Deep Learning on Point Sets for 3D Classiﬁcation and Segmentation,” in Computer Vision and Pattern Recognition (CVPR) , 2017

work page 2017
[22]

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,

C. R. Qi, L. Yi, H. Su, and Others, “PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,” in Advances in Neural Information Processing Systems (NIPS) , 2017

work page 2017
[23]

PointCNN: Convolution On X- Transformed Points,

Y . Li, R. Bu, M. Sun, and Others, “PointCNN: Convolution On X- Transformed Points,” in Advances in Neural Information Processing Systems (NIPS) , 2018

work page 2018
[24]

V oxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition,

D. Maturana and S. Scherer, “V oxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition,” in International Confer- ence on Intelligent Robots and Systems (IROS) , 2015

work page 2015
[25]

OctNet: Learning Deep 3D Representations at High Resolutions,

G. Riegler, A. O. Ulusoy, and A. Geiger, “OctNet: Learning Deep 3D Representations at High Resolutions,” in Computer Vision and Pattern Recognition (CVPR) , 2017

work page 2017
[26]

V oxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection,

Y . Zhou and O. Tuzel, “V oxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection,” in Conference on Computer Vision and Pattern Recognition (CVPR) , 2018

work page 2018
[27]

PointPillars: Fast Encoders for Object Detection from Point Clouds

A. H. Lang, S. V ora, H. Caesar, L. Zhou, J. Yang, and O. Beijbom, “PointPillars: Fast Encoders for Object Detection from Point Clouds,” arXiv preprint: 1812.05784 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[28]

The Cityscapes Dataset for Semantic Urban Scene Understanding,

M. Cordts, M. Omran, S. Ramos, and Others, “The Cityscapes Dataset for Semantic Urban Scene Understanding,” in Conference on Computer Vision and Pattern Recognition (CVPR) , 2016

work page 2016
[29]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimiza- tion,” in arXiv preprint: 1412.6980 , 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[30]

Delving Deep into Rectiﬁers: Surpassing Human-Level Performance on ImageNet Classiﬁcation,

K. He, X. Zhang, S. Ren, and Others, “Delving Deep into Rectiﬁers: Surpassing Human-Level Performance on ImageNet Classiﬁcation,” in International Conference on Computer Vision (ICCV) , 2015

work page 2015

[1] [1]

Experience, results and lessons learned from automated driving on Germany’s highways,

M. Aeberhard, S. Rauch, M. Bahram, G. Tanzmeister, J. Thomas, Y . Pilat, F. Homm, W. Huber, and N. Kaempchen, “Experience, results and lessons learned from automated driving on Germany’s highways,” Intelligent Transportation Systems Magazine , vol. 7, no. 1, pp. 42–57, 2015

work page 2015

[2] [2]

Making Bertha Drive - An Autonomous Journey on a Historic Route,

J. Ziegler, P. Bender, M. Schreiber, and Others, “Making Bertha Drive - An Autonomous Journey on a Historic Route,” Intelligent Transportation Systems Magazine , vol. 6, no. 2, pp. 8–20, 2014

work page 2014

[3] [3]

Autonomous Driving in Trafﬁc: Boss and the Urban Challenge,

C. Urmson, C. Baker, J. Dolan, and Others, “Autonomous Driving in Trafﬁc: Boss and the Urban Challenge,” AI Magazine , vol. 30, no. 2, pp. 17–28, 2009

work page 2009

[4] [4]

Stanley, the Robot that Won the DARPA Grand Challenge,

S. Thrun, M. Montemerlo, H. Dahlkamp, D. Stavens, A. Aron, J. Diebel, P. Fong, J. Gale, M. Halpenny, and G. Hoffmann, “Stanley, the Robot that Won the DARPA Grand Challenge,” Journal of Field Robotics, vol. 23, no. 9, pp. 661–692, 2006

work page 2006

[5] [5]

A random ﬁnite set approach for dynamic occu- pancy grid maps with real-time application,

D. Nuss, S. Reuter, M. Thom, T. Yuan, G. Krehl, M. Maile, A. Gern, and K. Dietmayer, “A random ﬁnite set approach for dynamic occu- pancy grid maps with real-time application,” International Journal of Robotics Research, vol. 37, no. 8, pp. 841–866, 2018

work page 2018

[6] [6]

Grid-based localization and local mapping with moving object detection and tracking Grid- based Localization and Local Mapping with Moving Object Detection and Tracking,

T.-d. Vu, J. Burlet, O. Aycard, and Others, “Grid-based localization and local mapping with moving object detection and tracking Grid- based Localization and Local Mapping with Moving Object Detection and Tracking,” Journal Information Fusion , vol. 12, no. 1, pp. 58–69, 2011

work page 2011

[7] [7]

Probabilistic Analysis of Dynamic Scenes and Collision Risks Assessment to Improve Driving Safety,

C. Laugier, I. E. Paromtchik, M. Perrollaz, M. Y . Yong, J.-D. Yoder, C. Tay, K. Mekhnacha, and A. Negre, “Probabilistic Analysis of Dynamic Scenes and Collision Risks Assessment to Improve Driving Safety,” Intelligent Transportation Systems Magazine (ITSM) , vol. 3, no. 4, pp. 4–19, 2011

work page 2011

[8] [8]

Intention-aware online POMDP planning for autonomous driving in a crowd,

H. Bai, S. Cai, N. Ye, and Others, “Intention-aware online POMDP planning for autonomous driving in a crowd,” in International Con- ference on Robotics and Automation (ICRA) , 2015

work page 2015

[9] [9]

Vehicle Detection and Localiza- tion using 3D LIDAR Point Cloud and Image Semantic Segmentation,

R. Barea, C. Perez, L. M. Bergasa, E. Lopez-Guillen, E. Romera, E. Molinos, M. Ocana, and J. Lopez, “Vehicle Detection and Localiza- tion using 3D LIDAR Point Cloud and Image Semantic Segmentation,” Intelligent Transportation Systems Conference (ITSC) , 2018

work page 2018

[10] [10]

A Review on Deep Learning Techniques Applied to Semantic Segmentation

A. Garcia-Garcia, S. Orts-Escolano, S. Oprea, and Others, “A Review on Deep Learning Techniques Applied to Semantic Segmentation,” in arXiv preprint: 1704.06857 , 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[11] [11]

Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driv- ing: Datasets, Methods, and Challenges,

D. Feng, C. Haase-Schuetz, L. Rosenbaum, H. Hertlein, F. Duffhauss, C. Glaeser, W. Wiesbeck, and K. Dietmayer, “Deep Multi-modal Object Detection and Semantic Segmentation for Autonomous Driv- ing: Datasets, Methods, and Challenges,” arXiv preprint: 1902.07830 , 2019

work page arXiv 1902

[12] [12]

Are we ready for autonomous driving? the KITTI vision benchmark suite,

A. Geiger, P. Lenz, and R. Urtasun, “Are we ready for autonomous driving? the KITTI vision benchmark suite,” in Conference on Com- puter Vision and Pattern Recognition (CVPR) , 2012

work page 2012

[13] [13]

SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud

B. Wu, X. Zhou, S. Zhao, X. Yue, and K. Keutzer, “SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud,” arXiv preprint: 1809.08495, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[14] [14]

Deep Semantic Classiﬁ- cation for 3D LiDAR Data,

A. Dewan, G. L. Oliveira, and W. Burgard, “Deep Semantic Classiﬁ- cation for 3D LiDAR Data,” in International Conference on Intelligent Robots and Systems (IROS) , 2017

work page 2017

[15] [15]

Super-sensor for 360- degree Environment Perception: Point Cloud Segmentation Using Image Features,

R. Varga, A. Costea, H. Florea, and Others, “Super-sensor for 360- degree Environment Perception: Point Cloud Segmentation Using Image Features,” in International Conference on Intelligent Trans- portation Systems (ITSC) , 2017

work page 2017

[16] [16]

Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation,

F. Piewak, P. Pinggera, M. Sch ¨afer, D. Peter, B. Schwarz, N. Schneider, M. Enzweiler, D. Pfeiffer, and M. Z ¨ollner, “Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation,” in European Conference on Computer Vision Workshops (ECCV) , 2018

work page 2018

[17] [17]

PIXOR: Real-time 3D Object Detection from Point Clouds,

B. Yang, W. Luo, and R. Urtasun, “PIXOR: Real-time 3D Object Detection from Point Clouds,” in Computer Vision and Pattern Recog- nition (CVPR) , 2018

work page 2018

[18] [18]

BirdNet: A 3D Object Detection Framework from LiDAR Information,

J. Beltr ´an, C. Guindel, F. M. Moreno, D. Cruzado, F. Garc ´ıa, and A. De La Escalera, “BirdNet: A 3D Object Detection Framework from LiDAR Information,” Intelligent Transportation Systems Conference (ITSC), 2018

work page 2018

[19] [19]

PointSeg: Real-Time Semantic Segmentation Based on 3D LiDAR Point Cloud

Y . Wang, T. Shi, P. Yun, L. Tai, and M. Liu, “PointSeg: Real-Time Semantic Segmentation Based on 3D LiDAR Point Cloud,” arXiv preprint: 1807.06288, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[20] [20]

Semantic Segmentation of 3D LiDAR Data in Dynamic Scene Using Semi-supervised Learning

J. Mei, B. Gao, D. Xu, W. Yao, X. Zhao, and H. Zhao, “Semantic Segmentation of 3D LiDAR Data in Dynamic Scene Using Semi- supervised Learning,” arXiv preprint: 1809.00426 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[21] [21]

PointNet: Deep Learning on Point Sets for 3D Classiﬁcation and Segmentation,

C. R. Qi, H. Su, K. Mo, and Others, “PointNet: Deep Learning on Point Sets for 3D Classiﬁcation and Segmentation,” in Computer Vision and Pattern Recognition (CVPR) , 2017

work page 2017

[22] [22]

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,

C. R. Qi, L. Yi, H. Su, and Others, “PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,” in Advances in Neural Information Processing Systems (NIPS) , 2017

work page 2017

[23] [23]

PointCNN: Convolution On X- Transformed Points,

Y . Li, R. Bu, M. Sun, and Others, “PointCNN: Convolution On X- Transformed Points,” in Advances in Neural Information Processing Systems (NIPS) , 2018

work page 2018

[24] [24]

V oxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition,

D. Maturana and S. Scherer, “V oxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition,” in International Confer- ence on Intelligent Robots and Systems (IROS) , 2015

work page 2015

[25] [25]

OctNet: Learning Deep 3D Representations at High Resolutions,

G. Riegler, A. O. Ulusoy, and A. Geiger, “OctNet: Learning Deep 3D Representations at High Resolutions,” in Computer Vision and Pattern Recognition (CVPR) , 2017

work page 2017

[26] [26]

V oxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection,

Y . Zhou and O. Tuzel, “V oxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection,” in Conference on Computer Vision and Pattern Recognition (CVPR) , 2018

work page 2018

[27] [27]

PointPillars: Fast Encoders for Object Detection from Point Clouds

A. H. Lang, S. V ora, H. Caesar, L. Zhou, J. Yang, and O. Beijbom, “PointPillars: Fast Encoders for Object Detection from Point Clouds,” arXiv preprint: 1812.05784 , 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[28] [28]

The Cityscapes Dataset for Semantic Urban Scene Understanding,

M. Cordts, M. Omran, S. Ramos, and Others, “The Cityscapes Dataset for Semantic Urban Scene Understanding,” in Conference on Computer Vision and Pattern Recognition (CVPR) , 2016

work page 2016

[29] [29]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimiza- tion,” in arXiv preprint: 1412.6980 , 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[30] [30]

Delving Deep into Rectiﬁers: Surpassing Human-Level Performance on ImageNet Classiﬁcation,

K. He, X. Zhang, S. Ren, and Others, “Delving Deep into Rectiﬁers: Surpassing Human-Level Performance on ImageNet Classiﬁcation,” in International Conference on Computer Vision (ICCV) , 2015

work page 2015