pith. sign in

arxiv: 2605.15074 · v1 · pith:XGJFJYB3new · submitted 2026-05-14 · 💻 cs.RO

SOCC-ICP: Semantics-Assisted Odometry based on Occupancy Grids and ICP

Pith reviewed 2026-06-30 20:12 UTC · model grok-4.3

classification 💻 cs.RO
keywords LiDAR odometryoccupancy gridICPsemantic mappingdynamic object filteringrobot navigation
0
0 comments X

The pith

SOCC-ICP performs LiDAR odometry and semantic occupancy grid mapping inside one shared voxel structure.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a method that builds a semantic occupancy grid during LiDAR scan alignment rather than maintaining separate structures for each task. Each voxel stores both geometric statistics and semantic labels, which lets the system switch between point-to-point and point-to-plane ICP according to local planarity and removes dynamic objects through raycasting free-space updates. The same grid is then available for motion planning and other downstream uses without extra conversion steps. When semantic labels are present they improve correspondence weighting and downsampling, but the system still works competitively without them. A reader would care because the approach removes the usual duplication of map data between odometry and planning modules.

Core claim

SOCC-ICP jointly executes semantic occupancy grid mapping and LiDAR scan alignment by letting each voxel encode geometric and semantic statistics; this representation supports adaptive selection of point-to-point or point-to-plane ICP, filters dynamic objects via raycasting, and directly supplies a map usable for robotic planning, achieving competitive accuracy that improves further when semantic cues are incorporated.

What carries the argument

The semantic occupancy grid voxel that stores both geometric and semantic statistics to drive adaptive ICP selection and raycasting-based dynamic filtering.

If this is right

  • The resulting occupancy grid can be passed directly to motion planners without additional map conversion.
  • Performance remains competitive in geometrically degenerate environments even when semantic labels are absent.
  • Adding semantic labels improves accuracy through better downsampling and correspondence weighting.
  • A single map representation removes the need to maintain separate point-cloud or surfel structures for odometry.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Robotic software stacks could drop one layer of map synchronization code.
  • The same voxel structure might support incremental semantic segmentation updates across multiple sensors.
  • Long-term operation in crowded scenes would test whether the raycasting filter accumulates enough free-space evidence to keep drift low.

Load-bearing premise

The method assumes raycasting free-space updates will reliably remove dynamic objects and that local planarity statistics will correctly choose between ICP variants without causing alignment errors in mixed scenes.

What would settle it

Odometry error rising above baseline methods on a sequence containing many independently moving objects that raycasting fails to clear from the grid.

Figures

Figures reproduced from arXiv: 2605.15074 by Henri Mee{\ss}, Johannes Scherer, Sebastian Hirt.

Figure 1
Figure 1. Figure 1: Overview of SOCC-ICP scan registration. A semantically segmented LiDAR point cloud is aligned with a semantic [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Local maps in approaches such as KISS-ICP retain persistent ghosted traces, which can degrade odometry accuracy [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Comparison of occupancy grid mapping results. [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
read the original abstract

Reliable pose estimation in previously unseen environments is a fundamental capability of autonomous systems. Existing LiDAR odometry methods typically employ point-, surfel-, or NDT-based map representations, which are distinct from the semantic occupancy grids commonly used for downstream tasks such as motion planning. We introduce SOCC-ICP, a semantics-assisted odometry framework that jointly performs Semantic OCCupancy grid mapping and LiDAR scan alignment. Each map voxel encodes geometric and semantic statistics, enabling adaptive point-to-point or point-to-plane ICP based on local planarity. Further, the occupancy grid naturally filters dynamic objects through raycasting-based free-space updates. Across diverse evaluation scenarios, SOCC-ICP achieves performance competitive with state-of-the-art LiDAR odometry and remains robust in geometrically degenerate environments, even in the absence of semantic cues. When semantic labels are available, integrating them into map construction, downsampling, and correspondence weighting yields further accuracy gains. By unifying odometry and semantic occupancy grid mapping within a single representation, SOCC-ICP eliminates redundant map structures and directly provides a map suitable for downstream robotic applications.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces SOCC-ICP, a semantics-assisted LiDAR odometry framework that performs joint semantic occupancy grid mapping and scan alignment within a single voxel-based representation. Each voxel encodes geometric and semantic statistics to support adaptive selection between point-to-point and point-to-plane ICP according to local planarity, while raycasting-based free-space updates are used to filter dynamic objects. The paper claims that this unified approach achieves performance competitive with state-of-the-art LiDAR odometry methods across diverse scenarios, remains robust in geometrically degenerate environments even without semantic cues, yields further gains when semantics are available, and directly supplies a map usable for downstream tasks without redundant structures.

Significance. If the empirical claims hold, the work would be significant for robotics by addressing the typical separation between odometry representations (points, surfels, NDT) and planning representations (occupancy grids). A single structure that supports both accurate pose estimation and downstream usability could reduce system complexity. The adaptive ICP mechanism and dynamic filtering via raycasting are conceptually appealing, but their contribution to the unification claim requires concrete validation that is not evident from the provided description.

major comments (2)
  1. [Abstract and §5] Abstract and §5 (Evaluation): the central claim of 'performance competitive with state-of-the-art' and 'robustness in geometrically degenerate environments' is asserted without any reported error metrics (RMSE, ATE, RPE), dataset names, sequence counts, or quantitative baseline comparisons. This absence prevents assessment of whether the unification actually delivers the stated benefits.
  2. [§3] §3 (Method): the unification premise rests on two unverified mechanisms—raycasting reliably marking dynamic points as free space and local planarity statistics correctly selecting ICP modes. No ablation studies, failure-case analysis under partial occlusion or fast motion, or sensitivity tests on planarity thresholds are described; failure of either mechanism would simultaneously degrade both odometry and map quality.
minor comments (2)
  1. [§3.1] The description of voxel encoding for geometric/semantic statistics lacks explicit notation or pseudocode, making it difficult to reproduce the per-voxel update rules.
  2. [§4] No mention of computational overhead (memory per voxel, raycasting cost) relative to separate odometry + mapping pipelines, which would be needed to substantiate the 'eliminates redundant map structures' claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. The comments correctly identify areas where the manuscript requires additional quantitative detail and validation to support its claims. We address each point below and will revise accordingly.

read point-by-point responses
  1. Referee: [Abstract and §5] Abstract and §5 (Evaluation): the central claim of 'performance competitive with state-of-the-art' and 'robustness in geometrically degenerate environments' is asserted without any reported error metrics (RMSE, ATE, RPE), dataset names, sequence counts, or quantitative baseline comparisons. This absence prevents assessment of whether the unification actually delivers the stated benefits.

    Authors: We agree that explicit quantitative results are necessary to substantiate the performance claims. The revised manuscript will include RMSE, ATE, and RPE metrics on standard datasets (e.g., KITTI sequences with specified counts), along with direct numerical comparisons to baselines such as LOAM and LeGO-LOAM. This will enable assessment of the unification benefits. revision: yes

  2. Referee: [§3] §3 (Method): the unification premise rests on two unverified mechanisms—raycasting reliably marking dynamic points as free space and local planarity statistics correctly selecting ICP modes. No ablation studies, failure-case analysis under partial occlusion or fast motion, or sensitivity tests on planarity thresholds are described; failure of either mechanism would simultaneously degrade both odometry and map quality.

    Authors: The referee is correct that the mechanisms require explicit validation. We will add ablation studies quantifying the impact of raycasting-based free-space updates for dynamic filtering and the planarity-driven ICP mode selection. The revision will also include failure-case analysis under occlusion/fast motion and sensitivity tests on the planarity threshold. revision: yes

Circularity Check

0 steps flagged

No circularity: method is a high-level framework without load-bearing derivations or self-citation chains.

full rationale

The paper presents SOCC-ICP as a practical unification of odometry and semantic occupancy mapping via voxel statistics, raycasting, and adaptive ICP selection. No equations, fitted parameters renamed as predictions, or uniqueness theorems are described in the provided text. The central claim (single representation eliminating redundant maps) is an engineering integration, not a mathematical reduction to its own inputs. Assumptions about raycasting and planarity are stated as design choices but are not derived from prior self-citations in a load-bearing way. This is the common case of a self-contained algorithmic contribution evaluated empirically.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; the approach implicitly relies on standard ICP assumptions and occupancy grid raycasting without detailing any new fitted quantities.

pith-pipeline@v0.9.1-grok · 5726 in / 1079 out tokens · 28185 ms · 2026-06-30T20:12:04.094415+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

36 extracted references

  1. [1]

    Sa- loam: Semantic-aided lidar slam with loop closure,

    L. Li, X. Kong, X. Zhao, W. Li, F. Wen, H. Zhang, and Y . Liu, “Sa- loam: Semantic-aided lidar slam with loop closure,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 7627–7634

  2. [2]

    Sloam: Semantic lidar odometry and mapping for forest inventory,

    S. W. Chen, G. V . Nardari, E. S. Lee, C. Qu, X. Liu, R. A. F. Romero, and V . Kumar, “Sloam: Semantic lidar odometry and mapping for forest inventory,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 612–619, 2020

  3. [3]

    A lidar-inertial navigation system for uavs in gnss-denied environment with spatial grid structures,

    Z. Qiu, J. Lv, D. Lin, Y . Yu, Z. Sun, and Z. Zheng, “A lidar-inertial navigation system for uavs in gnss-denied environment with spatial grid structures,”Applied Sciences, vol. 13, no. 1, p. 414, 2022

  4. [4]

    Lidar odometry survey: recent advancements and remaining challenges,

    D. Lee, M. Jung, W. Yang, and A. Kim, “Lidar odometry survey: recent advancements and remaining challenges,”Intelligent Service Robotics, vol. 17, no. 2, pp. 95–118, 2024

  5. [5]

    Method for registration of 3-d shapes,

    P. J. Besl and N. D. McKay, “Method for registration of 3-d shapes,” inSensor fusion IV: control paradigms and data structures, vol. 1611. Spie, 1992, pp. 586–606

  6. [6]

    Kiss-icp: In defense of point-to-point icp–simple, accu- rate, and robust registration if done the right way,

    I. Vizzo, T. Guadagnino, B. Mersch, L. Wiesmann, J. Behley, and C. Stachniss, “Kiss-icp: In defense of point-to-point icp–simple, accu- rate, and robust registration if done the right way,”IEEE Robotics and Automation Letters, vol. 8, no. 2, pp. 1029–1036, 2023

  7. [7]

    Genz-icp: Generalizable and degeneracy- robust lidar odometry using an adaptive weighting,

    D. Lee, H. Lim, and S. Han, “Genz-icp: Generalizable and degeneracy- robust lidar odometry using an adaptive weighting,”IEEE Robotics and Automation Letters, 2024

  8. [8]

    A review of global path planning methods for occupancy grid maps regardless of obstacle density,

    E. G. Tsardoulias, A. Iliakopoulou, A. Kargakos, and L. Petrou, “A review of global path planning methods for occupancy grid maps regardless of obstacle density,”Journal of intelligent & robotic systems, vol. 84, no. 1, pp. 829–858, 2016

  9. [9]

    Decentralized multi- agent trajectory planning in dynamic environments with spatiotemporal occupancy grid maps,

    S. Wu, G. Chen, M. Shi, and J. Alonso-Mora, “Decentralized multi- agent trajectory planning in dynamic environments with spatiotemporal occupancy grid maps,” in2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024, pp. 7208–7214

  10. [10]

    Semantic 3d occupancy mapping through efficient high order crfs,

    S. Yang, Y . Huang, and S. Scherer, “Semantic 3d occupancy mapping through efficient high order crfs,” in2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2017, pp. 590–597

  11. [11]

    Robust matching of occupancy maps for odometry in autonomous vehicles

    M. D. Dimitrievski, D. Van Hamme, P. Veelaert, and W. Philips, “Robust matching of occupancy maps for odometry in autonomous vehicles.” in VISIGRAPP (3: VISAPP), 2016, pp. 628–635

  12. [12]

    Occupancy-slam: An efficient and robust algorithm for simultaneously optimizing robot poses and occupancy map,

    Y . Wang, L. Zhao, and S. Huang, “Occupancy-slam: An efficient and robust algorithm for simultaneously optimizing robot poses and occupancy map,”IEEE Transactions on Robotics, 2025

  13. [13]

    Efficient surfel-based slam using 3d laser range data in urban environments

    J. Behley and C. Stachniss, “Efficient surfel-based slam using 3d laser range data in urban environments.” inRobotics: science and systems, vol. 2018, 2018, p. 59

  14. [14]

    Efficient and probabilistic adaptive voxel mapping for accurate online lidar odometry,

    C. Yuan, W. Xu, X. Liu, X. Hong, and F. Zhang, “Efficient and probabilistic adaptive voxel mapping for accurate online lidar odometry,” IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 8518–8525, 2022

  15. [15]

    Ndt- loam: A real-time lidar odometry and mapping with weighted ndt and lfa,

    S. Chen, H. Ma, C. Jiang, B. Zhou, W. Xue, Z. Xiao, and Q. Li, “Ndt- loam: A real-time lidar odometry and mapping with weighted ndt and lfa,”IEEE Sensors Journal, vol. 22, no. 4, pp. 3660–3671, 2021

  16. [16]

    Real-time scan-to-map matching localization system based on lightweight pre-built occupancy high-definition map,

    J. Wen, J. Tang, H. Liu, C. Qian, and X. Fan, “Real-time scan-to-map matching localization system based on lightweight pre-built occupancy high-definition map,”Remote Sensing, vol. 15, no. 3, p. 595, 2023

  17. [17]

    Kiss-slam: A simple, robust, and accurate 3d lidar slam system with enhanced generalization capabilities,

    T. Guadagnino, B. Mersch, S. Gupta, I. Vizzo, G. Grisetti, and C. Stach- niss, “Kiss-slam: A simple, robust, and accurate 3d lidar slam system with enhanced generalization capabilities,” in2025 IEEE/RSJ Interna- tional Conference on Intelligent Robots and Systems (IROS), 2025, pp. 5363–5370

  18. [18]

    Lsk3dnet: Towards effective and efficient 3d perception with large sparse kernels,

    T. Feng, W. Wang, F. Ma, and Y . Yang, “Lsk3dnet: Towards effective and efficient 3d perception with large sparse kernels,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 14 916–14 927

  19. [19]

    The normal distributions transform: A new approach to laser scan matching,

    P. Biber and W. Straßer, “The normal distributions transform: A new approach to laser scan matching,” inProceedings 2003 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems (IROS 2003)(Cat. No. 03CH37453), vol. 3. IEEE, 2003, pp. 2743–2748

  20. [20]

    Ct-icp: Real-time elastic lidar odometry with loop closure,

    P. Dellenbach, J.-E. Deschaud, B. Jacquet, and F. Goulette, “Ct-icp: Real-time elastic lidar odometry with loop closure,” in2022 Interna- tional Conference on Robotics and Automation (ICRA). IEEE, 2022, pp. 5580–5586

  21. [21]

    Low-drift and real-time lidar odometry and mapping,

    J. Zhang and S. Singh, “Low-drift and real-time lidar odometry and mapping,”Autonomous robots, vol. 41, no. 2, pp. 401–416, 2017

  22. [22]

    Mulls: Versatile lidar slam via multi-metric linear least square,

    Y . Pan, P. Xiao, Y . He, Z. Shao, and Z. Li, “Mulls: Versatile lidar slam via multi-metric linear least square,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 11 633–11 640

  23. [23]

    Suma++: Efficient lidar-based semantic slam,

    X. Chen, A. Milioto, E. Palazzolo, P. Giguere, J. Behley, and C. Stach- niss, “Suma++: Efficient lidar-based semantic slam,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 4530–4537

  24. [24]

    Sage-icp: Semantic information-assisted icp,

    J. Cui, J. Chen, and L. Li, “Sage-icp: Semantic information-assisted icp,” in2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024, pp. 8537–8543

  25. [25]

    Bonxai: Fast, hierarchical, sparse voxel grid,

    D. Faconti, “Bonxai: Fast, hierarchical, sparse voxel grid,” GitHub repos- itory, 2024, https://github.com/facontidavide/Bonxai [accessed: 2025-12- 03]

  26. [26]

    Octomap: An efficient probabilistic 3d mapping framework based on octrees,

    A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Burgard, “Octomap: An efficient probabilistic 3d mapping framework based on octrees,”Autonomous robots, vol. 34, no. 3, pp. 189–206, 2013

  27. [27]

    Vdb: High-resolution sparse volumes with dynamic topol- ogy,

    K. Museth, “Vdb: High-resolution sparse volumes with dynamic topol- ogy,”ACM transactions on graphics (TOG), vol. 32, no. 3, pp. 1–22, 2013

  28. [28]

    Bayesian spatial kernel smoothing for scalable dense semantic map- ping,

    L. Gan, R. Zhang, J. W. Grizzle, R. M. Eustice, and M. Ghaffari, “Bayesian spatial kernel smoothing for scalable dense semantic map- ping,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 790–797, 2020

  29. [29]

    Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers,

    M. Weinmann, B. Jutzi, S. Hinz, and C. Mallet, “Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers,”ISPRS Journal of Photogrammetry and Remote Sensing, vol. 105, pp. 286–304, 2015

  30. [30]

    Vision meets robotics: The kitti dataset,

    A. Geiger, P. Lenz, C. Stiller, and R. Urtasun, “Vision meets robotics: The kitti dataset,”The international journal of robotics research, vol. 32, no. 11, pp. 1231–1237, 2013

  31. [31]

    Semantickitti: A dataset for semantic scene understanding of lidar sequences,

    J. Behley, M. Garbade, A. Milioto, J. Quenzel, S. Behnke, C. Stachniss, and J. Gall, “Semantickitti: A dataset for semantic scene understanding of lidar sequences,” inProceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 9297–9307

  32. [32]

    Mulran: Multimodal range dataset for urban place recognition,

    G. Kim, Y . S. Park, Y . Cho, J. Jeong, and A. Kim, “Mulran: Multimodal range dataset for urban place recognition,” in2020 IEEE international conference on robotics and automation (ICRA). IEEE, 2020, pp. 6246– 6253

  33. [33]

    The newer college dataset: Handheld lidar, inertial and vision with ground truth,

    M. Ramezani, Y . Wang, M. Camurri, D. Wisth, M. Mattamala, and M. Fallon, “The newer college dataset: Handheld lidar, inertial and vision with ground truth,” in2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2020, pp. 4353–4360

  34. [34]

    Ground-challenge: A multi-sensor slam dataset focusing on corner cases for ground robots,

    J. Yin, H. Yin, C. Liang, H. Jiang, and Z. Zhang, “Ground-challenge: A multi-sensor slam dataset focusing on corner cases for ground robots,” in2023 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE, 2023, pp. 1–5

  35. [35]

    Subt-mrs dataset: Pushing slam towards all-weather environments,

    S. Zhao, Y . Gao, T. Wu, D. Singh, R. Jiang, H. Sun, M. Sarawata, Y . Qiu, W. Whittaker, I. Higginset al., “Subt-mrs dataset: Pushing slam towards all-weather environments,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 22 647–22 657

  36. [36]

    evo: Python package for the evaluation of odometry and slam,

    M. Grupp, “evo: Python package for the evaluation of odometry and slam,” GitHub repository, 2017, https://github.com/MichaelGrupp/evo [accessed: 2025-12-03]