Incremental Semantic Mapping with Unsupervised On-line Learning

Hansenclever F. Bassani; Ygor C. N. Sousa

arxiv: 1907.04001 · v2 · pith:BFTLNNVVnew · submitted 2019-07-09 · 💻 cs.RO · cs.LG· cs.NE

Incremental Semantic Mapping with Unsupervised On-line Learning

Ygor C. N. Sousa , Hansenclever F. Bassani This is my paper

Pith reviewed 2026-05-25 00:40 UTC · model grok-4.3

classification 💻 cs.RO cs.LGcs.NE

keywords semantic mappingtopological mapsself-organizing mapsunsupervised learningincremental learningplace categorizationrobot navigationonline learning

0 comments

The pith

A robotic mapping system builds topological maps enriched with semantic object data and uses an unsupervised online SOM to cluster similar places while continuing to learn without degrading prior knowledge.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper describes a two-part system for robots: one module incrementally constructs a topological map of an environment and attaches recognized objects to each node for semantic enrichment. A second module applies an incremental unsupervised self-organizing map trained online to categorize places according to those object features. Real-world experiments demonstrate that the system acquires maps over successive visits, groups semantically similar places together, and incorporates data from new locations without erasing earlier clusters.

Core claim

The proposed approach includes a mapping module that incrementally creates a topological map of the environment enriched with objects recognized around each topological node, and a places categorization module endowed with an incremental unsupervised learning SOM with on-line training. When tested in experiments with real-world data, the system acquires topological maps with semantic information, clusters together similar places based on that information, and continues learning from newly visited environments without degrading the information previously learned.

What carries the argument

An incremental unsupervised Self-Organizing Map (SOM) with on-line training that categorizes places using only the semantic object information attached to topological nodes.

If this is right

Robots can maintain consistent place categories across repeated visits to the same or similar spaces.
Semantic object labels attached to map nodes suffice for unsupervised place grouping without external supervision.
New environments can be incorporated into the map and categorization system while preserving all prior structure.
The method supports building semantic maps in previously unseen areas without requiring offline retraining.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same SOM could be extended to handle gradual environmental changes such as moved furniture by treating them as additional online updates.
Object recognition errors would propagate directly into place clusters, suggesting a need for confidence-weighted inputs in future versions.
This unsupervised clustering might transfer to other sensor modalities if object features are replaced by equivalent descriptors from vision or lidar.

Load-bearing premise

The unsupervised online SOM can reliably cluster places from semantic object data alone without any supervision, labeled examples, or loss of earlier clusters when new environments are added.

What would settle it

A test in which place clusters formed from an initial environment show reduced accuracy or separation after the SOM is trained on data from a second, distinct environment.

Figures

Figures reproduced from arXiv: 1907.04001 by Hansenclever F. Bassani, Ygor C. N. Sousa.

**Figure 2.** Figure 2: Diagram built from a data sequence from path 2 of the Freiburg [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Semantic map of a data sequence from the path 2 of the Freiburg sub-dataset. The colors of the nodes represent the categories found by the model [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: CE (a) and Accuracy (b) obtained (y axis) of each of the 18 selected data sequences (x axis) evaluated in two moments: right after the data sequence [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

read the original abstract

This paper introduces an incremental semantic mapping approach, with on-line unsupervised learning, based on Self-Organizing Maps (SOM) for robotic agents. The method includes a mapping module, which incrementally creates a topological map of the environment, enriched with objects recognized around each topological node, and a module of places categorization, endowed with an incremental unsupervised learning SOM with on-line training. The proposed approach was tested in experiments with real-world data, in which it demonstrates promising capabilities of incremental acquisition of topological maps enriched with semantic information, and for clustering together similar places based on this information. The approach was also able to continue learning from newly visited environments without degrading the information previously learned.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper combines incremental topological mapping with an online unsupervised SOM for semantic place clustering and claims stable learning across new environments, but supplies no metrics or baselines to check any of it.

read the letter

The main point is that this work puts together an incremental topological mapper that tags nodes with recognized objects and feeds them into an online unsupervised SOM for clustering similar places, with the added claim that new environments can be learned without degrading what was already stored. The online training of the SOM is the piece they highlight as different from typical batch-trained maps. The architecture description is clear enough on how object information moves from the mapper into the SOM and how the system is meant to stay stable over time. That stability goal matters for robots that keep exploring. The soft spot is the evaluation. The experiments are described only as showing promising results on real-world data, with no accuracy figures, no learning curves, no comparison to other incremental or offline clustering methods, and no details on dataset size or how object recognition errors affect the SOM. Without those the central claim about reliable incremental clustering rests on the authors' assertion rather than visible evidence. This is the kind of paper that might interest people working on semantic mapping or lifelong robot navigation who want to see one concrete way to wire an online SOM into a topological graph. A reader looking for validated performance numbers or reproducible code would not get much use from it yet. I would not cite the work as it stands. It deserves peer review so the authors can add the missing quantitative results and comparisons, which would let referees judge whether the no-degradation property actually holds.

Referee Report

2 major / 1 minor

Summary. The paper introduces an incremental semantic mapping approach for robotic agents using Self-Organizing Maps (SOM) for on-line unsupervised learning. It consists of a mapping module that incrementally builds a topological map enriched with recognized objects at each node, and a places categorization module using an incremental unsupervised SOM with on-line training. Experiments on real-world data are reported to demonstrate incremental acquisition of topological maps with semantic information, clustering of similar places, and the ability to continue learning from new environments without degrading previously learned information.

Significance. If the experimental results hold with proper validation, the work could advance semantic mapping and lifelong learning in robotics by providing a method for unsupervised, incremental place categorization based on semantic objects that adapts without catastrophic forgetting. However, the lack of quantitative evaluation in the provided description limits the ability to gauge its impact.

major comments (2)

[Abstract / Experimental Results] Abstract / Experimental Results: The abstract claims 'promising capabilities' demonstrated in experiments with real-world data, including incremental map acquisition, place clustering, and continued learning without degradation. However, no quantitative metrics, error analysis, baseline comparisons, or specific method details (such as SOM parameters, object recognition accuracy, or clustering performance measures) are supplied, preventing assessment of whether the data supports the claims.
[Places Categorization Module] Places Categorization Module: The core assumption that an unsupervised on-line SOM can reliably cluster places using only semantic object information attached to topological nodes, without supervision or degradation of prior knowledge when new environments are encountered, is central but lacks supporting details on the SOM architecture, training procedure, or validation experiments.

minor comments (1)

The abstract could benefit from more precise language regarding the experimental setup and results to allow readers to better understand the contributions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comments point by point below, indicating where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract / Experimental Results] Abstract / Experimental Results: The abstract claims 'promising capabilities' demonstrated in experiments with real-world data, including incremental map acquisition, place clustering, and continued learning without degradation. However, no quantitative metrics, error analysis, baseline comparisons, or specific method details (such as SOM parameters, object recognition accuracy, or clustering performance measures) are supplied, preventing assessment of whether the data supports the claims.

Authors: We agree that the abstract is high-level and does not include quantitative metrics or specific details. The full manuscript describes the real-world experiments but relies primarily on qualitative demonstrations. To address this, we will revise the abstract to reference key aspects of the results more precisely and expand the experimental section with available details on SOM parameters and performance observations. revision: yes
Referee: [Places Categorization Module] Places Categorization Module: The core assumption that an unsupervised on-line SOM can reliably cluster places using only semantic object information attached to topological nodes, without supervision or degradation of prior knowledge when new environments are encountered, is central but lacks supporting details on the SOM architecture, training procedure, or validation experiments.

Authors: The manuscript outlines the incremental unsupervised SOM for place categorization and reports on-line training results across environments. However, we acknowledge that additional specifics on architecture and training would improve clarity. We will revise the methodology section to provide more explicit details on the SOM structure, training procedure, and how the experiments validate continued learning without degradation. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper describes an incremental semantic mapping method using Self-Organizing Maps (SOM) and reports results from experiments on real-world data. No equations, derivations, fitted parameters, or load-bearing self-citations are present in the provided text. The central claims are empirical assertions about the method's performance in incremental map acquisition, place clustering, and continued learning without degradation. These rest on experimental demonstration rather than any self-referential construction or reduction of predictions to inputs by definition. This is the most common honest finding for purely descriptive or experimental papers without mathematical derivations.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no free parameters, axioms, or invented entities are described in sufficient detail to populate the ledger.

pith-pipeline@v0.9.0 · 5643 in / 984 out tokens · 21058 ms · 2026-05-25T00:40:32.759777+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages · 1 internal anchor

[1]

Large-scale semantic mapping and rea- soning with heterogeneous modalities,

A. Pronobis and P. Jensfelt, “Large-scale semantic mapping and rea- soning with heterogeneous modalities,” in International Conference on Robotics and Automation . IEEE, May 2012, pp. 3515–3522

work page 2012
[2]

A framework for learning semantic maps from grounded natural language descriptions,

M. R. Walter, S. Hemachandra, B. Homberg, S. Tellex, and S. Teller, “A framework for learning semantic maps from grounded natural language descriptions,” The International Journal of Robotics Research , vol. 33, no. 9, pp. 1167–1190, 2014

work page 2014
[3]

Natural language direction following for robots in un- structured unknown environments,

F. Duvallet, “Natural language direction following for robots in un- structured unknown environments,” Ph.D. dissertation, Carnegie Mellon University, 2015

work page 2015
[4]

A situationally aware voice-commandable robotic forklift working alongside people in unstructured outdoor environments,

M. R. Walter, M. Antone, E. Chuangsuwanich, A. Correa, R. Davis, L. Fletcher, E. Frazzoli, Y . Friedman, J. Glass, J. P. How, J. Jeon, S. Karaman, B. Luders, N. Roy, S. Tellex, and S. Teller, “A situationally aware voice-commandable robotic forklift working alongside people in unstructured outdoor environments,” Journal of Field Robotics , vol. 32, no. 4,...

work page 2015
[5]

Semantic mapping for mobile robotics tasks: A survey,

I. Kostavelis and A. Gasteratos, “Semantic mapping for mobile robotics tasks: A survey,” Robotics and Autonomous Systems , vol. 66, pp. 86 – 103, 2015

work page 2015
[6]

Learning spatially semantic representations for cognitive robot navigation,

——, “Learning spatially semantic representations for cognitive robot navigation,” Robotics and Autonomous Systems , vol. 61, no. 12, pp. 1460 – 1475, 2013

work page 2013
[7]

Semantic maps from multiple visual cues,

——, “Semantic maps from multiple visual cues,” Expert Systems with Applications, vol. 68, pp. 45 – 57, 2017

work page 2017
[8]

Place categorization and semantic mapping on a mobile robot,

N. Sunderhauf, F. Dayoub, S. Mcmahon, B. Talbot, R. Schultz, P. Corke, G. Wyeth, B. Upcroft, and M. Milford, “Place categorization and semantic mapping on a mobile robot,” in International Conference on Robotics and Automation (ICRA) , May 2016, pp. 5729–5736

work page 2016
[9]

An integrated model of autonomous topological spatial cognition,

H. Karao ˘guz and H. Bozma, “An integrated model of autonomous topological spatial cognition,” Autonomous Robots , vol. 40, no. 8, pp. 1379–1402, 2016

work page 2016
[10]

Self-organizing maps with a time-varying structure,

A. F. Araujo and R. L. Rego, “Self-organizing maps with a time-varying structure,” ACM Comput. Surv. , vol. 46, no. 1, pp. 7:1–7:38, Jul. 2013

work page 2013
[11]

Cold: The cosy localization database,

A. Pronobis and B. Caputo, “Cold: The cosy localization database,” The International Journal of Robotics Research , vol. 28, no. 5, pp. 588–594, 2009

work page 2009
[12]

On-line semantic mapping,

E. Bastianelli, D. D. Bloisi, R. Capobianco, F. Cossu, G. Germignani, L. Iocchi, and D. Nardi, “On-line semantic mapping,” in International Conference on Advanced Robotics . IEEE Computer Society, Nov 2013, pp. 1–6

work page 2013
[13]

Chain graph models and their causal interpretations,

S. Lauritzen and T. Richardson, “Chain graph models and their causal interpretations,” Journal of the Royal Statistical Society: Series B (Statistical Methodology) , vol. 64, no. 3, pp. 321–348, 2002

work page 2002
[14]

Living with robots: Interactive environmental knowledge acquisition,

G. Gemignani, R. Capobianco, E. Bastianelli, D. D. Bloisi, L. Iocchi, and D. Nardi, “Living with robots: Interactive environmental knowledge acquisition,” Robotics and Autonomous Systems , vol. 78, pp. 1 – 16, 2016

work page 2016
[15]

Dimension selective self-organizing maps with time-varying structure for subspace and projected clustering,

H. F. Bassani and A. F. R. Araujo, “Dimension selective self-organizing maps with time-varying structure for subspace and projected clustering,” IEEE Transactions on Neural Networks and Learning Systems , vol. 26, no. 3, pp. 458–471, 2015

work page 2015
[16]

Rethinking the Inception Architecture for Computer Vision

C. Szegedy, V . Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Re- thinking the inception architecture for computer vision,” CoRR, vol. abs/1512.00567, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[17]

TensorFlow: Large- scale machine learning on heterogeneous systems,

M. Abadi, A. Agarwal, P. Barham, and E. Brevdo, “TensorFlow: Large- scale machine learning on heterogeneous systems,” 2015, software available from tensorﬂow.org. [Online]. Available: http://tensorﬂow.org/

work page 2015
[18]

Self-organized formation of topologically correct feature maps,

T. Kohonen, “Self-organized formation of topologically correct feature maps,” Biological Cybernetics , vol. 43, no. 1, pp. 59–69, 1982

work page 1982
[19]

Evaluating clustering in subspace projections of high dimensional data,

E. M ¨uller, S. G¨unnemann, I. Assent, and T. Seidl, “Evaluating clustering in subspace projections of high dimensional data,” Proc. VLDB Endow., vol. 2, no. 1, pp. 1270–1281, Aug. 2009

work page 2009
[20]

A comparison of uncertainty and sensitivity analysis results obtained with random and latin hypercube sampling

J. C. Helton, F. J. Davis, and J. D. Johnson, “A comparison of uncertainty and sensitivity analysis results obtained with random and latin hypercube sampling.” Reliability Engineering and System Safety , vol. 89, pp. 305– 330, 2005

work page 2005
[21]

A transfer learn- ing approach for multi-cue semantic place recognition,

G. Constante, T. A. Ciarfuglia, P. Valigi, and E. Ricci, “A transfer learn- ing approach for multi-cue semantic place recognition,” in International Conference on Intelligent Robots and Systems , Nov 2013, pp. 2122– 2129

work page 2013
[22]

Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories,

S. Lazebnik, C. Schmid, and J. Ponce, “Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories,” in IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06) , vol. 2, 2006, pp. 2169–2178

work page 2006

[1] [1]

Large-scale semantic mapping and rea- soning with heterogeneous modalities,

A. Pronobis and P. Jensfelt, “Large-scale semantic mapping and rea- soning with heterogeneous modalities,” in International Conference on Robotics and Automation . IEEE, May 2012, pp. 3515–3522

work page 2012

[2] [2]

A framework for learning semantic maps from grounded natural language descriptions,

M. R. Walter, S. Hemachandra, B. Homberg, S. Tellex, and S. Teller, “A framework for learning semantic maps from grounded natural language descriptions,” The International Journal of Robotics Research , vol. 33, no. 9, pp. 1167–1190, 2014

work page 2014

[3] [3]

Natural language direction following for robots in un- structured unknown environments,

F. Duvallet, “Natural language direction following for robots in un- structured unknown environments,” Ph.D. dissertation, Carnegie Mellon University, 2015

work page 2015

[4] [4]

A situationally aware voice-commandable robotic forklift working alongside people in unstructured outdoor environments,

M. R. Walter, M. Antone, E. Chuangsuwanich, A. Correa, R. Davis, L. Fletcher, E. Frazzoli, Y . Friedman, J. Glass, J. P. How, J. Jeon, S. Karaman, B. Luders, N. Roy, S. Tellex, and S. Teller, “A situationally aware voice-commandable robotic forklift working alongside people in unstructured outdoor environments,” Journal of Field Robotics , vol. 32, no. 4,...

work page 2015

[5] [5]

Semantic mapping for mobile robotics tasks: A survey,

I. Kostavelis and A. Gasteratos, “Semantic mapping for mobile robotics tasks: A survey,” Robotics and Autonomous Systems , vol. 66, pp. 86 – 103, 2015

work page 2015

[6] [6]

Learning spatially semantic representations for cognitive robot navigation,

——, “Learning spatially semantic representations for cognitive robot navigation,” Robotics and Autonomous Systems , vol. 61, no. 12, pp. 1460 – 1475, 2013

work page 2013

[7] [7]

Semantic maps from multiple visual cues,

——, “Semantic maps from multiple visual cues,” Expert Systems with Applications, vol. 68, pp. 45 – 57, 2017

work page 2017

[8] [8]

Place categorization and semantic mapping on a mobile robot,

N. Sunderhauf, F. Dayoub, S. Mcmahon, B. Talbot, R. Schultz, P. Corke, G. Wyeth, B. Upcroft, and M. Milford, “Place categorization and semantic mapping on a mobile robot,” in International Conference on Robotics and Automation (ICRA) , May 2016, pp. 5729–5736

work page 2016

[9] [9]

An integrated model of autonomous topological spatial cognition,

H. Karao ˘guz and H. Bozma, “An integrated model of autonomous topological spatial cognition,” Autonomous Robots , vol. 40, no. 8, pp. 1379–1402, 2016

work page 2016

[10] [10]

Self-organizing maps with a time-varying structure,

A. F. Araujo and R. L. Rego, “Self-organizing maps with a time-varying structure,” ACM Comput. Surv. , vol. 46, no. 1, pp. 7:1–7:38, Jul. 2013

work page 2013

[11] [11]

Cold: The cosy localization database,

A. Pronobis and B. Caputo, “Cold: The cosy localization database,” The International Journal of Robotics Research , vol. 28, no. 5, pp. 588–594, 2009

work page 2009

[12] [12]

On-line semantic mapping,

E. Bastianelli, D. D. Bloisi, R. Capobianco, F. Cossu, G. Germignani, L. Iocchi, and D. Nardi, “On-line semantic mapping,” in International Conference on Advanced Robotics . IEEE Computer Society, Nov 2013, pp. 1–6

work page 2013

[13] [13]

Chain graph models and their causal interpretations,

S. Lauritzen and T. Richardson, “Chain graph models and their causal interpretations,” Journal of the Royal Statistical Society: Series B (Statistical Methodology) , vol. 64, no. 3, pp. 321–348, 2002

work page 2002

[14] [14]

Living with robots: Interactive environmental knowledge acquisition,

G. Gemignani, R. Capobianco, E. Bastianelli, D. D. Bloisi, L. Iocchi, and D. Nardi, “Living with robots: Interactive environmental knowledge acquisition,” Robotics and Autonomous Systems , vol. 78, pp. 1 – 16, 2016

work page 2016

[15] [15]

Dimension selective self-organizing maps with time-varying structure for subspace and projected clustering,

H. F. Bassani and A. F. R. Araujo, “Dimension selective self-organizing maps with time-varying structure for subspace and projected clustering,” IEEE Transactions on Neural Networks and Learning Systems , vol. 26, no. 3, pp. 458–471, 2015

work page 2015

[16] [16]

Rethinking the Inception Architecture for Computer Vision

C. Szegedy, V . Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Re- thinking the inception architecture for computer vision,” CoRR, vol. abs/1512.00567, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[17] [17]

TensorFlow: Large- scale machine learning on heterogeneous systems,

M. Abadi, A. Agarwal, P. Barham, and E. Brevdo, “TensorFlow: Large- scale machine learning on heterogeneous systems,” 2015, software available from tensorﬂow.org. [Online]. Available: http://tensorﬂow.org/

work page 2015

[18] [18]

Self-organized formation of topologically correct feature maps,

T. Kohonen, “Self-organized formation of topologically correct feature maps,” Biological Cybernetics , vol. 43, no. 1, pp. 59–69, 1982

work page 1982

[19] [19]

Evaluating clustering in subspace projections of high dimensional data,

E. M ¨uller, S. G¨unnemann, I. Assent, and T. Seidl, “Evaluating clustering in subspace projections of high dimensional data,” Proc. VLDB Endow., vol. 2, no. 1, pp. 1270–1281, Aug. 2009

work page 2009

[20] [20]

A comparison of uncertainty and sensitivity analysis results obtained with random and latin hypercube sampling

J. C. Helton, F. J. Davis, and J. D. Johnson, “A comparison of uncertainty and sensitivity analysis results obtained with random and latin hypercube sampling.” Reliability Engineering and System Safety , vol. 89, pp. 305– 330, 2005

work page 2005

[21] [21]

A transfer learn- ing approach for multi-cue semantic place recognition,

G. Constante, T. A. Ciarfuglia, P. Valigi, and E. Ricci, “A transfer learn- ing approach for multi-cue semantic place recognition,” in International Conference on Intelligent Robots and Systems , Nov 2013, pp. 2122– 2129

work page 2013

[22] [22]

Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories,

S. Lazebnik, C. Schmid, and J. Ponce, “Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories,” in IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06) , vol. 2, 2006, pp. 2169–2178

work page 2006