BPDA-GMM: Bayesian Probabilistic Data Association via Gaussian Mixture Models for Semantic SLAM

Antonio Sgorbissa; Haolan Zhang; Nak Young Chong; Thanh Nguyen Canh; Xiem HoangVan

arxiv: 2606.04618 · v1 · pith:JUYPKXAGnew · submitted 2026-06-03 · 💻 cs.RO

BPDA-GMM: Bayesian Probabilistic Data Association via Gaussian Mixture Models for Semantic SLAM

Thanh Nguyen Canh , Haolan Zhang , Xiem HoangVan , Antonio Sgorbissa , Nak Young Chong This is my paper

Pith reviewed 2026-06-28 06:37 UTC · model grok-4.3

classification 💻 cs.RO

keywords semantic SLAMprobabilistic data associationGaussian mixture modelsChinese Restaurant ProcessDirichlet processdata associationrobot navigationperceptual aliasing

0 comments

The pith

BPDA-GMM applies a Dirichlet-process prior to manage data association for growing semantic landmark maps in SLAM.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces BPDA-GMM to solve data association problems in semantic SLAM where the map grows over time and environments have repeated similar objects. It employs a Dirichlet process prior that creates a Chinese Restaurant Process model for weighting associations, selecting candidates with a joint semantic and geometric gate, and updating landmarks as Gaussian mixtures. This setup passes dominant components as max-mixture factors to a decoupled back-end that avoids perturbing the trajectory with noisy detections. If the approach works as described, it would produce more accurate trajectories and higher quality semantic maps even when classifiers make errors or scenes look alike from different angles.

Core claim

BPDA-GMM is an online Bayesian PDA framework for semantic SLAM with a growing object-level map. It uses a Dirichlet-process prior to induce a CRP association model where accumulated evidence favors existing landmarks and the concentration parameter assigns probability to new landmarks. For each detection, a joint semantic-geometric gate selects candidates, CRP-weighted probabilities are computed, and landmarks are updated as semantic Gaussians in closed form. The landmark set forms a GMM whose dominant component is passed as a max-mixture semantic factor to the back-end. When weights are inconclusive, alpha-divergence tempering improves discrimination, and a decoupled back-end zeroes the pos

What carries the argument

The Chinese Restaurant Process association model induced by a Dirichlet-process prior, which computes CRP-weighted association probabilities for updating semantic Gaussian landmarks in a growing map.

If this is right

Trajectory accuracy improves in both simulation and real indoor datasets.
Semantic mapping quality increases compared to state-of-the-art baselines.
The system shows greater robustness to perceptual aliasing and classifier errors.
Association remains online without recomputing all weights as the map grows.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The decoupled back-end could allow semantic information to refine maps even when pose estimates carry high uncertainty.
The alpha-divergence tempering step might reduce the rate of incorrect new-landmark creation in long sequences with repeated visits.
The closed-form Gaussian updates could support incremental merging of nearby landmarks when association weights remain ambiguous.

Load-bearing premise

The joint semantic-geometric gate reliably selects plausible candidates and the CRP concentration parameter can be set so that accumulated evidence correctly favors existing landmarks without excessive new-landmark creation.

What would settle it

Running the method on a dataset with known repeated landmarks under perceptual aliasing and checking whether the number of duplicate landmarks created stays low while trajectory error metrics decrease relative to baselines.

Figures

Figures reproduced from arXiv: 2606.04618 by Antonio Sgorbissa, Haolan Zhang, Nak Young Chong, Thanh Nguyen Canh, Xiem HoangVan.

**Figure 1.** Figure 1: BPDA-GMM system overview. Each semantic detection is first filtered by a semantic-geometric gate, then assigned CRP-weighted wtkj association probabilities. These weights update semantic Gaussian landmarks (µˆj , Σˆ j , ˆℓ s j ) in the front-end, while the dominant mixture component is converted into a max-mixture semantic factor for the decoupled back-end. six components, organized to mirror the posterior… view at source ↗

**Figure 2.** Figure 2: Effect of ambiguity-triggered posterior tempering. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Estimated trajectories on all benchmark scenarios. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Robustness to odometry noise and semantic misclas [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Qualitative object-map comparison on the real indoor sequence. BPDA-GMM produces a compact semantic map with [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

read the original abstract

Probabilistic data association (PDA) improves semantic SLAM in perceptually aliased scenes, but existing methods often assume a fixed landmark set, recompute association weights as the map grows, or rely on hand-tuned null-hypothesis weights. To address these limitations, we propose \textbf{BPDA-GMM}, an online Bayesian PDA framework for semantic SLAM with a growing object-level map. BPDA-GMM uses a Dirichlet-process prior to induce a Chinese Restaurant Process (CRP) association model, where accumulated evidence favors existing landmarks, and the concentration parameter assigns probability mass to new landmarks. For each semantic detection, plausible candidates are selected by a joint semantic-geometric gate, CRP-weighted association probabilities are computed, and object landmarks are updated as semantic Gaussians in closed form. The resulting landmark set forms a Gaussian mixture model, and its dominant component is passed to the back-end as a max-mixture semantic factor. When association weights are inconclusive, an ambiguity-triggered $\alpha$-divergence tempering step improves discrimination. Finally, a decoupled back-end zeroes the pose Jacobian of semantic factors, allowing noisy detections to refine landmarks without directly perturbing the trajectory. Experiments in simulation and on a real indoor dataset demonstrate improved trajectory accuracy, semantic mapping quality, and robustness to perceptual aliasing and classifier errors over state-of-the-art baselines. Code and video are publicly available at https://github.com/thanhnguyencanh/BPDA-SLAM.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

BPDA-GMM integrates CRP association, closed-form updates, and max-mixture factors for growing semantic maps, but the robustness claims rest on an unexamined concentration parameter.

read the letter

The main addition is an online PDA that uses a Dirichlet-process CRP to handle a growing landmark set without assuming a fixed map or recomputing everything from scratch. It picks candidates with a joint semantic-geometric gate, computes CRP-weighted probabilities, updates landmarks as Gaussians in closed form, feeds the dominant component as a max-mixture factor, and applies alpha-divergence tempering only when weights are inconclusive. The back-end then decouples the semantic factors so noisy detections update landmarks without directly pulling on the trajectory. That pipeline is coherent on paper and directly targets the fixed-landmark limitation in prior PDA work.

Public code and the claim of gains on both simulation and a real indoor dataset are concrete positives. Closed-form updates and the decoupled back-end are practical engineering choices that should reduce computation and avoid trajectory corruption.

The soft spot is the CRP concentration parameter. The abstract states it assigns mass to new landmarks while evidence favors old ones, but gives no sensitivity analysis, ablation, or fixed-value justification across the two datasets. If that single parameter must be retuned to avoid either missed associations or excessive new landmarks, the reported improvements over baselines cannot be cleanly attributed to the method. The joint gate thresholds are also unexamined in the provided text. These are not fatal, but they are load-bearing for the robustness story.

This is for robotics researchers already working on semantic SLAM who need an incremental data-association fix and are willing to check the code themselves. It is coherent enough and grounded enough to deserve peer review rather than a desk reject, though referees will need to see the parameter study and experimental controls.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes BPDA-GMM, an online Bayesian probabilistic data association framework for semantic SLAM with a growing object-level map. It employs a Dirichlet-process prior inducing a Chinese Restaurant Process (CRP) for association probabilities, a joint semantic-geometric gate to select candidates, closed-form Gaussian updates for landmarks, max-mixture semantic factors in the back-end, an ambiguity-triggered α-divergence tempering step, and a decoupled back-end that zeroes the pose Jacobian of semantic factors. Experiments in simulation and on a real indoor dataset are reported to show gains in trajectory accuracy, semantic mapping quality, and robustness to perceptual aliasing and classifier errors over baselines, with public code available.

Significance. If the central claims hold, the work provides a principled online Bayesian treatment of data association for expanding semantic maps that avoids fixed landmark assumptions and hand-tuned null weights, while the public code and video support reproducibility. The decoupled back-end and tempering mechanism address specific practical issues in semantic SLAM pipelines.

major comments (2)

[Abstract] Abstract (paragraph on CRP-weighted association probabilities): the central experimental claim of improved robustness to aliasing and classifier errors rests on the CRP concentration parameter and joint semantic-geometric gate correctly favoring existing landmarks without excessive new-landmark creation, yet no sensitivity analysis, ablation, or derivation is provided showing that a single fixed value generalizes across the reported simulation and real datasets; this is load-bearing because if the parameter induces under- or over-association the PDA step reduces to a standard filter and the reported gains cannot be attributed to BPDA-GMM.
[Abstract] Abstract (final sentence on experiments): the reported improvements in trajectory accuracy and mapping quality are presented without explicit quantification of how the decoupled back-end (zeroing pose Jacobian of semantic factors) or the tempering step contribute versus the CRP association model; an ablation isolating these components is needed to substantiate that the Bayesian PDA is the source of the gains.

minor comments (1)

The manuscript states that code is publicly available at the given GitHub link; confirming that the released implementation matches the described pipeline (including the CRP concentration handling and tempering) would strengthen the reproducibility claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and will revise the manuscript to incorporate the suggested analyses.

read point-by-point responses

Referee: [Abstract] Abstract (paragraph on CRP-weighted association probabilities): the central experimental claim of improved robustness to aliasing and classifier errors rests on the CRP concentration parameter and joint semantic-geometric gate correctly favoring existing landmarks without excessive new-landmark creation, yet no sensitivity analysis, ablation, or derivation is provided showing that a single fixed value generalizes across the reported simulation and real datasets; this is load-bearing because if the parameter induces under- or over-association the PDA step reduces to a standard filter and the reported gains cannot be attributed to BPDA-GMM.

Authors: We agree that the concentration parameter is central to the CRP model and that its generalization should be demonstrated. The manuscript currently fixes this value without reporting sensitivity or ablation results. In the revision we will add a dedicated sensitivity analysis (varying the parameter over a range) and an ablation on association performance, evaluated on both the simulation and real indoor datasets, to confirm that the reported gains are attributable to the BPDA-GMM formulation rather than a fortuitous fixed setting. revision: yes
Referee: [Abstract] Abstract (final sentence on experiments): the reported improvements in trajectory accuracy and mapping quality are presented without explicit quantification of how the decoupled back-end (zeroing pose Jacobian of semantic factors) or the tempering step contribute versus the CRP association model; an ablation isolating these components is needed to substantiate that the Bayesian PDA is the source of the gains.

Authors: We acknowledge that the current experiments do not isolate the individual contributions of the decoupled back-end and the ambiguity-triggered tempering step from the CRP-based association. In the revised manuscript we will include additional ablation studies that disable or vary these components independently while keeping the CRP PDA fixed, thereby quantifying their marginal impact on trajectory accuracy and mapping quality relative to the core Bayesian association model. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper proposes BPDA-GMM as an online Bayesian PDA framework using a Dirichlet-process prior to induce CRP association, joint semantic-geometric gating, closed-form Gaussian updates, max-mixture factors, and alpha-divergence tempering. No equations or steps in the abstract or described method reduce any claimed performance gain (trajectory accuracy, mapping quality, robustness) to a fitted parameter or self-citation by construction. The CRP concentration parameter and gate are presented as modeling choices whose behavior is evaluated empirically against baselines on separate simulation and real datasets; these are externally falsifiable and not forced by the paper's own inputs. The derivation relies on standard Bayesian and CRP mechanics without load-bearing self-citations or ansatz smuggling visible in the provided text.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Only abstract available; concentration parameter of the Dirichlet process is a free parameter whose value affects new-landmark probability; standard Gaussian modeling of semantic detections is assumed.

free parameters (1)

CRP concentration parameter
Controls probability mass assigned to new landmarks; value must be chosen or tuned for the scene.

axioms (1)

domain assumption Semantic object detections can be represented and updated as Gaussians in closed form
Stated in the update step for object landmarks.

pith-pipeline@v0.9.1-grok · 5808 in / 1232 out tokens · 24482 ms · 2026-06-28T06:37:38.671387+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references

[1]

Semantic visual simultaneous localization and mapping: A survey on state of the art, challenges, and future directions,

T. N. Canh, H. Zhang, X. HoangVan, and N. Y . Chong, “Semantic visual simultaneous localization and mapping: A survey on state of the art, challenges, and future directions,”arXiv preprint arXiv:2510.00783, 2025

arXiv 2025
[2]

Maximum likelihood estimation,

T. Brox, “Maximum likelihood estimation,” inComputer Vision: A Reference Guide. Springer, 2021, pp. 799–801

2021
[3]

Probabilis- tic data association for semantic slam,

S. L. Bowman, N. Atanasov, K. Daniilidis, and G. J. Pappas, “Probabilis- tic data association for semantic slam,” inIEEE international conference on robotics and automation (ICRA). IEEE, 2017, pp. 1722–1729

2017
[4]

Inference on networks of mixtures for robust robot mapping,

E. Olson and P. Agarwal, “Inference on networks of mixtures for robust robot mapping,”The International Journal of Robotics Research, vol. 32, no. 7, pp. 826–840, 2013

2013
[5]

Proba- bilistic data association via mixture models for robust semantic SLAM,

K. J. Doherty, D. P. Baxter, E. Schneeweiss, and J. J. Leonard, “Proba- bilistic data association via mixture models for robust semantic SLAM,” inIEEE International Conference on Robotics and Automation (ICRA), 2020, pp. 1098–1104

2020
[6]

Prob- abilistic data association for semantic SLAM at scale,

E. Michael, T. Summers, T. A. Wood, C. Manzie, and I. Shames, “Prob- abilistic data association for semantic SLAM at scale,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 4359–4364

2022
[7]

Robust exploration with multiple hypothesis data association,

J. Wang and B. Englot, “Robust exploration with multiple hypothesis data association,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 2018, pp. 3537–3544

2018
[8]

Mh-isam2: Multi-hypothesis isam using bayes tree and hypo-tree,

M. Hsiao and M. Kaess, “Mh-isam2: Multi-hypothesis isam using bayes tree and hypo-tree,” in2019 International Conference on Robotics and Automation (ICRA). IEEE, 2019, pp. 1274–1280

2019
[9]

Modeling topic hierarchies with the recursive chinese restaurant process,

J. H. Kim, D. Kim, S. Kim, and A. Oh, “Modeling topic hierarchies with the recursive chinese restaurant process,” inProceedings of the 21st ACM international conference on Information and knowledge management, 2012, pp. 783–792

2012
[10]

Tracking in a cluttered environment with probabilistic data association,

Y . Bar-Shalom and E. Tse, “Tracking in a cluttered environment with probabilistic data association,”Automatica, vol. 11, no. 5, pp. 451–460, 1975

1975
[11]

An algorithm for tracking multiple targets,

D. Reid, “An algorithm for tracking multiple targets,”IEEE transactions on Automatic Control, vol. 24, no. 6, pp. 843–854, 2003

2003
[12]

Modeling a dynamic environment using a bayesian multiple hypothesis approach,

I. J. Cox and J. J. Leonard, “Modeling a dynamic environment using a bayesian multiple hypothesis approach,”Artificial intelligence, vol. 66, no. 2, pp. 311–344, 1994

1994
[13]

Multimodal semantic SLAM with probabilistic data association,

K. Doherty, D. Fourie, and J. Leonard, “Multimodal semantic SLAM with probabilistic data association,” inIEEE International Conference on Robotics and Automation (ICRA), 2019, pp. 2419–2425

2019
[14]

An algorithm for ranking all the assignments in order of increasing cost,

K. G. Murty, “An algorithm for ranking all the assignments in order of increasing cost,”Operations research, vol. 16, no. 3, pp. 682–687, 1968

1968
[15]

Data association in stochastic mapping using the joint compatibility test,

J. Neira and J. D. Tard ´os, “Data association in stochastic mapping using the joint compatibility test,”IEEE Transactions on robotics and automation, vol. 17, no. 6, pp. 890–897, 2002

2002
[16]

CubeSLAM: Monocular 3D object SLAM,

S. Yang and S. Scherer, “CubeSLAM: Monocular 3D object SLAM,” IEEE Transactions on Robotics, vol. 35, no. 4, pp. 925–938, 2019

2019
[17]

QuadricSLAM: Dual quadrics from object detections as landmarks in object-oriented SLAM,

L. Nicholson, M. Milford, and N. S ¨underhauf, “QuadricSLAM: Dual quadrics from object detections as landmarks in object-oriented SLAM,” IEEE Robotics and Automation Letters, vol. 4, no. 1, pp. 1–8, 2019

2019
[18]

Semantic SLAM with au- tonomous object-level data association,

Z. Qian, K. Patath, J. Fu, and J. Xiao, “Semantic SLAM with au- tonomous object-level data association,” inIEEE International Confer- ence on Robotics and Automation (ICRA), 2021, pp. 11 203–11 209

2021
[19]

DSP-SLAM: Object oriented SLAM with deep shape priors,

J. Wang, M. R ¨unz, and L. Agapito, “DSP-SLAM: Object oriented SLAM with deep shape priors,” inInternational Conference on 3D Vision (3DV), 2021, pp. 1362–1371

2021
[20]

EAO- SLAM: Monocular semi-dense object SLAM based on ensemble data association,

Y . Wu, Y . Zhang, D. Zhu, Y . Feng, S. Coleman, and D. Kerr, “EAO- SLAM: Monocular semi-dense object SLAM based on ensemble data association,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 4966–4973

2020
[21]

An object slam framework for association, mapping, and high-level tasks,

Y . Wu, Y . Zhang, D. Zhu, Z. Deng, W. Sun, X. Chen, and J. Zhang, “An object slam framework for association, mapping, and high-level tasks,” IEEE Transactions on Robotics, vol. 39, no. 4, pp. 2912–2932, 2023. 9

2023
[22]

Sgba: Semantic gaussian mixture model-based lidar bundle adjustment,

X. Ji, S. Yuan, J. Li, P. Yin, H. Cao, and L. Xie, “Sgba: Semantic gaussian mixture model-based lidar bundle adjustment,”IEEE Robotics and Automation Letters, vol. 9, no. 12, pp. 10 922–10 929, 2024

2024
[23]

Slideslam: Sparse, lightweight, decentral- ized metric-semantic slam for multirobot navigation,

X. Liu, J. Lei, A. Prabhu, Y . Tao, I. Spasojevic, P. Chaudhari, N. Atanasov, and V . Kumar, “Slideslam: Sparse, lightweight, decentral- ized metric-semantic slam for multirobot navigation,”IEEE Transactions on Robotics, vol. 41, pp. 6529–6548, 2025

2025
[24]

E 2M: Double bounded α-divergence optimization for tensor-based discrete density estimation,

K. Ghalamkari, J. L. Hinrich, and M. Mørup, “E 2M: Double bounded α-divergence optimization for tensor-based discrete density estimation,” Transactions on Machine Learning Research, 2026

2026
[25]

Factor graphs and GTSAM: A hands-on introduction,

F. Dellaert, “Factor graphs and GTSAM: A hands-on introduction,” Georgia Institute of Technology, Tech. Rep. GT-RIM-CP&R-2012-002, 2012

2012
[26]

isam2: Incremental smoothing and mapping using the bayes tree,

M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “isam2: Incremental smoothing and mapping using the bayes tree,”The International Journal of Robotics Research, vol. 31, no. 2, pp. 216–235, 2012

2012

[1] [1]

Semantic visual simultaneous localization and mapping: A survey on state of the art, challenges, and future directions,

T. N. Canh, H. Zhang, X. HoangVan, and N. Y . Chong, “Semantic visual simultaneous localization and mapping: A survey on state of the art, challenges, and future directions,”arXiv preprint arXiv:2510.00783, 2025

arXiv 2025

[2] [2]

Maximum likelihood estimation,

T. Brox, “Maximum likelihood estimation,” inComputer Vision: A Reference Guide. Springer, 2021, pp. 799–801

2021

[3] [3]

Probabilis- tic data association for semantic slam,

S. L. Bowman, N. Atanasov, K. Daniilidis, and G. J. Pappas, “Probabilis- tic data association for semantic slam,” inIEEE international conference on robotics and automation (ICRA). IEEE, 2017, pp. 1722–1729

2017

[4] [4]

Inference on networks of mixtures for robust robot mapping,

E. Olson and P. Agarwal, “Inference on networks of mixtures for robust robot mapping,”The International Journal of Robotics Research, vol. 32, no. 7, pp. 826–840, 2013

2013

[5] [5]

Proba- bilistic data association via mixture models for robust semantic SLAM,

K. J. Doherty, D. P. Baxter, E. Schneeweiss, and J. J. Leonard, “Proba- bilistic data association via mixture models for robust semantic SLAM,” inIEEE International Conference on Robotics and Automation (ICRA), 2020, pp. 1098–1104

2020

[6] [6]

Prob- abilistic data association for semantic SLAM at scale,

E. Michael, T. Summers, T. A. Wood, C. Manzie, and I. Shames, “Prob- abilistic data association for semantic SLAM at scale,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 4359–4364

2022

[7] [7]

Robust exploration with multiple hypothesis data association,

J. Wang and B. Englot, “Robust exploration with multiple hypothesis data association,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 2018, pp. 3537–3544

2018

[8] [8]

Mh-isam2: Multi-hypothesis isam using bayes tree and hypo-tree,

M. Hsiao and M. Kaess, “Mh-isam2: Multi-hypothesis isam using bayes tree and hypo-tree,” in2019 International Conference on Robotics and Automation (ICRA). IEEE, 2019, pp. 1274–1280

2019

[9] [9]

Modeling topic hierarchies with the recursive chinese restaurant process,

J. H. Kim, D. Kim, S. Kim, and A. Oh, “Modeling topic hierarchies with the recursive chinese restaurant process,” inProceedings of the 21st ACM international conference on Information and knowledge management, 2012, pp. 783–792

2012

[10] [10]

Tracking in a cluttered environment with probabilistic data association,

Y . Bar-Shalom and E. Tse, “Tracking in a cluttered environment with probabilistic data association,”Automatica, vol. 11, no. 5, pp. 451–460, 1975

1975

[11] [11]

An algorithm for tracking multiple targets,

D. Reid, “An algorithm for tracking multiple targets,”IEEE transactions on Automatic Control, vol. 24, no. 6, pp. 843–854, 2003

2003

[12] [12]

Modeling a dynamic environment using a bayesian multiple hypothesis approach,

I. J. Cox and J. J. Leonard, “Modeling a dynamic environment using a bayesian multiple hypothesis approach,”Artificial intelligence, vol. 66, no. 2, pp. 311–344, 1994

1994

[13] [13]

Multimodal semantic SLAM with probabilistic data association,

K. Doherty, D. Fourie, and J. Leonard, “Multimodal semantic SLAM with probabilistic data association,” inIEEE International Conference on Robotics and Automation (ICRA), 2019, pp. 2419–2425

2019

[14] [14]

An algorithm for ranking all the assignments in order of increasing cost,

K. G. Murty, “An algorithm for ranking all the assignments in order of increasing cost,”Operations research, vol. 16, no. 3, pp. 682–687, 1968

1968

[15] [15]

Data association in stochastic mapping using the joint compatibility test,

J. Neira and J. D. Tard ´os, “Data association in stochastic mapping using the joint compatibility test,”IEEE Transactions on robotics and automation, vol. 17, no. 6, pp. 890–897, 2002

2002

[16] [16]

CubeSLAM: Monocular 3D object SLAM,

S. Yang and S. Scherer, “CubeSLAM: Monocular 3D object SLAM,” IEEE Transactions on Robotics, vol. 35, no. 4, pp. 925–938, 2019

2019

[17] [17]

QuadricSLAM: Dual quadrics from object detections as landmarks in object-oriented SLAM,

L. Nicholson, M. Milford, and N. S ¨underhauf, “QuadricSLAM: Dual quadrics from object detections as landmarks in object-oriented SLAM,” IEEE Robotics and Automation Letters, vol. 4, no. 1, pp. 1–8, 2019

2019

[18] [18]

Semantic SLAM with au- tonomous object-level data association,

Z. Qian, K. Patath, J. Fu, and J. Xiao, “Semantic SLAM with au- tonomous object-level data association,” inIEEE International Confer- ence on Robotics and Automation (ICRA), 2021, pp. 11 203–11 209

2021

[19] [19]

DSP-SLAM: Object oriented SLAM with deep shape priors,

J. Wang, M. R ¨unz, and L. Agapito, “DSP-SLAM: Object oriented SLAM with deep shape priors,” inInternational Conference on 3D Vision (3DV), 2021, pp. 1362–1371

2021

[20] [20]

EAO- SLAM: Monocular semi-dense object SLAM based on ensemble data association,

Y . Wu, Y . Zhang, D. Zhu, Y . Feng, S. Coleman, and D. Kerr, “EAO- SLAM: Monocular semi-dense object SLAM based on ensemble data association,” inIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 4966–4973

2020

[21] [21]

An object slam framework for association, mapping, and high-level tasks,

Y . Wu, Y . Zhang, D. Zhu, Z. Deng, W. Sun, X. Chen, and J. Zhang, “An object slam framework for association, mapping, and high-level tasks,” IEEE Transactions on Robotics, vol. 39, no. 4, pp. 2912–2932, 2023. 9

2023

[22] [22]

Sgba: Semantic gaussian mixture model-based lidar bundle adjustment,

X. Ji, S. Yuan, J. Li, P. Yin, H. Cao, and L. Xie, “Sgba: Semantic gaussian mixture model-based lidar bundle adjustment,”IEEE Robotics and Automation Letters, vol. 9, no. 12, pp. 10 922–10 929, 2024

2024

[23] [23]

Slideslam: Sparse, lightweight, decentral- ized metric-semantic slam for multirobot navigation,

X. Liu, J. Lei, A. Prabhu, Y . Tao, I. Spasojevic, P. Chaudhari, N. Atanasov, and V . Kumar, “Slideslam: Sparse, lightweight, decentral- ized metric-semantic slam for multirobot navigation,”IEEE Transactions on Robotics, vol. 41, pp. 6529–6548, 2025

2025

[24] [24]

E 2M: Double bounded α-divergence optimization for tensor-based discrete density estimation,

K. Ghalamkari, J. L. Hinrich, and M. Mørup, “E 2M: Double bounded α-divergence optimization for tensor-based discrete density estimation,” Transactions on Machine Learning Research, 2026

2026

[25] [25]

Factor graphs and GTSAM: A hands-on introduction,

F. Dellaert, “Factor graphs and GTSAM: A hands-on introduction,” Georgia Institute of Technology, Tech. Rep. GT-RIM-CP&R-2012-002, 2012

2012

[26] [26]

isam2: Incremental smoothing and mapping using the bayes tree,

M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “isam2: Incremental smoothing and mapping using the bayes tree,”The International Journal of Robotics Research, vol. 31, no. 2, pp. 216–235, 2012

2012