SNGR: Selective Non-Gaussian Refinement for Ambiguous SLAM Factor Graphs

Anushka Kulkarni; Sarthak Dubey

arxiv: 2604.22065 · v1 · submitted 2026-04-23 · 💻 cs.RO · cs.NA· math.NA

SNGR: Selective Non-Gaussian Refinement for Ambiguous SLAM Factor Graphs

Anushka Kulkarni , Sarthak Dubey This is my paper

Pith reviewed 2026-05-09 20:58 UTC · model grok-4.3

classification 💻 cs.RO cs.NAmath.NA

keywords SLAMfactor graphsnon-Gaussian inferencedata associationnested samplingambiguous posteriorsselective refinementrange-only SLAM

0 comments

The pith

SNGR augments iSAM2 by detecting Gaussian failure windows via condition numbers and applying targeted nested sampling with gating.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a SLAM framework that keeps the speed of standard incremental solvers but adds non-Gaussian refinement only where the Gaussian approximation is likely to break. Detection relies on the condition number of joint marginal covariances; refinement uses nested sampling on the full factor-graph likelihood inside those windows. A gating step rejects refinements that would degrade the posterior when it is multimodal. Tests on range-only SLAM with deliberately wrong data associations show accurate failure detection, higher local likelihoods, and lower cost than running non-Gaussian inference everywhere. Readers should care because real-world SLAM routinely encounters ambiguous associations that Gaussian methods cannot handle, and exhaustive non-Gaussian fixes are too slow for online use.

Core claim

SNGR augments iSAM2 with targeted nested sampling on windows where Gaussian approximations are likely to fail. Such regions are identified by the condition number of joint marginal covariances. Refinement is performed with the full nonlinear factor-graph likelihood and protected by a gating mechanism that prevents degradation in multimodal cases. On range-only SLAM instances containing wrong data associations, the method produces high-precision failure detection, consistent local likelihood gains, and reduced computational cost relative to exhaustive non-Gaussian inference.

What carries the argument

Condition number of joint marginal covariances, used to select windows for gated nested sampling on the nonlinear factor graph.

If this is right

SNGR maintains posterior consistency in the presence of wrong data associations without exhaustive non-Gaussian computation.
Local likelihood improvements inside selected windows translate to better global map estimates in ambiguous range-only scenarios.
Computational cost scales with the number of detected failure windows rather than the full graph size.
The gating mechanism allows safe use even when some windows remain multimodal after refinement.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same condition-number heuristic could be tested on other factor-graph problems that suffer from discrete ambiguities, such as robust pose-graph optimization.
If the gating threshold proves stable across sensors, SNGR might serve as a lightweight drop-in layer for any incremental Gaussian SLAM solver.
Extending the detection criterion beyond condition number to include higher-order moments could reduce missed multimodal windows.

Load-bearing premise

The condition number of joint marginal covariances reliably flags the windows where Gaussian approximations fail, and the gating step reliably prevents degradation when the posterior is multimodal.

What would settle it

A controlled experiment in which SNGR is applied to a known multimodal posterior, the condition number does not flag the ambiguous region, and the final likelihood after refinement is lower than the Gaussian baseline.

Figures

Figures reproduced from arXiv: 2604.22065 by Anushka Kulkarni, Sarthak Dubey.

**Figure 3.** Figure 3: Per-window scores sw (seed 0, τ = 3.96). Distributions at p = 0.0 and p = 0.1 are identical, confirming the blind spot is structural. 0.0 0.1 0.2 0.3 0.4 Wrong-association noise fraction 10 0 10 1 10 2 Mean NEES (log scale) Consistent (NEES = 2) mean ± std (a) NEES (log scale) vs noise. At p = 0.1, NEES= 148 with zero triggers. 0.0 0.1 0.2 0.3 0.4 Wrong-association noise fraction 3.25 3.50 3.75 4.00 4.25 4… view at source ↗

**Figure 2.** Figure 2: Bimodal proof-of-concept. Anchors at A = (0, 0), B = (4, 0) observe L(0) at r = 3.0 m. Constraint manifold bimodal at (2, ± √ 5). iSAM2 converges to saddle; nested sampling recovers both modes. TABLE I: Bimodal Experiment: iSAM2 vs Nested Sampling iSAM2 Nested Sampling L(0) estimate (m) (2.000, 0.000) (1.984, −2.232) Log-likelihood (nats) −12.50 −0.01 Error from mode (m) 2.236 0.004 ∆log p — +12.49 Weighte… view at source ↗

**Figure 4.** Figure 4: Trigger-only baseline diagnostics (τ = 3.96, 5 seeds). Overconfidence grows with noise; trigger remains blind at low contamination. TABLE II: Threshold Sensitivity (Seed 0) τ Clean Noisy (p = 0.3) 3.92 0/28 21/28 3.96 0/28 14/28 4.0 0/28 8/28 TABLE III: Baseline Results (5-Seed RMSE; Seed 0 NEES, τ = 3.96) p RMSE (m) NEES Prec. Recall Trig./Fail 0.0 0.15 ± 0.06 0.36 — — 0/0 0.1 1.27 ± 1.20 148.52 — 0.00 0/… view at source ↗

**Figure 5.** Figure 5: SNGR diagnostics at p = 0.2. All improvements genuine; ESS indicates within-mode corrections. At p = 0.1, NEES = 148 with zero triggers: covariance shape and accuracy are independent properties. At 10% contamination, corrupted factors spread uniformly and their MAP displacements partially cancel, leaving isotropic covariance despite trajectory bias. At p ≥ 0.2, enough corruption accumulates to produce elo… view at source ↗

read the original abstract

We present Selective Non-Gaussian Refinement (SNGR), a SLAM framework that augments iSAM2 with targeted nested sampling on windows where Gaussian approximations are likely to fail. We detect such regions using the condition number of joint marginal covariances and selectively refine them using the full nonlinear factor graph likelihood, with a gating mechanism to avoid degradation in multimodal cases. Experiments on range-only SLAM with wrong data association show that SNGR achieves high-precision failure detection and consistent local likelihood improvements while reducing computational cost relative to exhaustive non-Gaussian inference. These results highlight both the promise and the limitations of selective refinement for approximate SLAM posteriors.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SNGR's selective non-Gaussian refinement on iSAM2 is a reasonable practical idea but the condition-number detector and the thin experimental reporting leave its reliability unproven.

read the letter

The key takeaway from this paper is that SNGR offers a selective way to refine iSAM2 factor graphs with non-Gaussian methods, but the validation leaves too many questions about whether the condition-number trigger works as intended. The approach is new in how it combines covariance condition numbers for window selection with a gating step to prevent degradation during nested sampling. It builds directly on iSAM2 and standard nested sampling without inventing new solvers. What it does well is focus on the practical problem of wrong data associations in range-only SLAM, where Gaussian assumptions break. The framework tries to keep most of the computation cheap while fixing local issues. The soft spots are in the experimental reporting and the core assumption. The abstract mentions positive outcomes but gives no metrics, so we cannot see the actual precision of failure detection or the cost savings. The stress-test concern holds: condition numbers can rise for reasons unrelated to multimodality, such as numerical instability or weak observability. If that correlation is loose, the method might either miss bad regions or waste effort on good ones. The gating helps in theory but relies on the same signal. This paper is for robotics researchers working on robust SLAM and factor graph inference. Someone building systems that need to handle ambiguity would get ideas from it, though they would need to implement and test the detector themselves. I think it deserves peer review. The idea is worth exploring further with stronger experiments, and a referee could push for the missing ablations and quantitative results.

Referee Report

3 major / 2 minor

Summary. The paper presents Selective Non-Gaussian Refinement (SNGR), a SLAM framework that augments iSAM2 with targeted nested sampling applied only to windows where Gaussian approximations are likely to fail. These windows are detected using the condition number of joint marginal covariances, with a gating mechanism to avoid degradation in multimodal cases. Experiments on range-only SLAM with wrong data association claim that SNGR achieves high-precision failure detection, consistent local likelihood improvements, and reduced computational cost relative to exhaustive non-Gaussian inference.

Significance. If the central claims hold, SNGR would provide a practical way to improve robustness of approximate SLAM posteriors in ambiguous settings (e.g., data-association errors) by selectively invoking expensive non-Gaussian inference only where needed, rather than exhaustively. This selective approach could reduce overall cost while preserving accuracy in robotics applications where full nested sampling is prohibitive.

major comments (3)

[Abstract] Abstract: The claim of 'high-precision failure detection' and 'consistent local likelihood improvements' is presented without any quantitative metrics (precision, recall, likelihood ratios, error bars, dataset sizes, or number of trials). This absence makes it impossible to evaluate whether the experimental outcomes actually support the stated advantages over iSAM2 and exhaustive nested sampling.
[Method / Experiments] The central experimental claim depends on the condition number of joint marginal covariances correctly identifying regions where the iSAM2 Gaussian posterior is a poor approximation to the true nonlinear factor-graph likelihood due to multimodality. High condition numbers can arise from poor observability, near-singularities, or numerical artifacts without implying multimodality; conversely, some multimodal posteriors may not elevate the condition number. The manuscript must supply ablation studies or direct correlation analysis between condition-number thresholds and actual likelihood gaps (or mode counts) to substantiate that the detector is load-bearing for the precision and cost claims.
[Method] The gating mechanism is asserted to prevent degradation when the posterior is multimodal, yet its trigger still relies on the same condition-number detector. Evidence is required that the gate correctly withholds refinement in cases where applying nested sampling would lower the likelihood, and that false negatives (missed multimodal windows) do not undermine the overall posterior quality.

minor comments (2)

[Abstract] The abstract refers to 'range-only SLAM with wrong data association' but does not name the specific datasets, simulation parameters, or how the wrong associations were generated.
[Method] Notation for the joint marginal covariance and its condition number should be defined explicitly with an equation reference when first introduced.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thorough review and constructive feedback on our manuscript. We address each of the major comments below and outline the revisions we plan to make to strengthen the presentation of our results and the validation of our method.

read point-by-point responses

Referee: The claim of 'high-precision failure detection' and 'consistent local likelihood improvements' is presented without any quantitative metrics (precision, recall, likelihood ratios, error bars, dataset sizes, or number of trials). This absence makes it impossible to evaluate whether the experimental outcomes actually support the stated advantages over iSAM2 and exhaustive nested sampling.

Authors: We agree that including quantitative metrics in the abstract would strengthen the summary of our contributions. Although the detailed results, including precision and recall for the failure detector, likelihood improvement ratios, and statistics over multiple trials and datasets, are provided in the Experiments section, we will revise the abstract to incorporate key quantitative highlights from our evaluation on range-only SLAM scenarios. This change will make the abstract more informative while remaining concise. revision: yes
Referee: The central experimental claim depends on the condition number of joint marginal covariances correctly identifying regions where the iSAM2 Gaussian posterior is a poor approximation to the true nonlinear factor-graph likelihood due to multimodality. High condition numbers can arise from poor observability, near-singularities, or numerical artifacts without implying multimodality; conversely, some multimodal posteriors may not elevate the condition number. The manuscript must supply ablation studies or direct correlation analysis between condition-number thresholds and actual likelihood gaps (or mode counts) to substantiate that the detector is load-bearing for the precision and cost claims.

Authors: This is an important point, and we acknowledge that condition numbers are an indirect indicator that may be influenced by factors other than multimodality. Our approach uses the condition number as a practical heuristic for selecting windows likely to benefit from non-Gaussian refinement, motivated by the properties of SLAM factor graphs. To provide stronger evidence, we will include in the revised manuscript an ablation analysis that examines the correlation between condition number values and the likelihood improvements achieved by nested sampling, as well as comparisons to mode counts in selected regions. This will help validate the detector's role in achieving the reported precision and efficiency gains. revision: yes
Referee: The gating mechanism is asserted to prevent degradation when the posterior is multimodal, yet its trigger still relies on the same condition-number detector. Evidence is required that the gate correctly withholds refinement in cases where applying nested sampling would lower the likelihood, and that false negatives (missed multimodal windows) do not undermine the overall posterior quality.

Authors: We recognize the need for explicit validation of the gating mechanism. The gate is intended to avoid unnecessary or potentially harmful refinements by checking for potential degradation, but as noted, it shares the condition-number basis. In the revised version, we will add targeted experiments that evaluate the gate's performance, including cases where refinement is withheld and the resulting likelihoods compared to exhaustive application. We will also report on the impact of potential false negatives on the global posterior quality in our SLAM experiments. These additions will provide the requested evidence. revision: yes

Circularity Check

0 steps flagged

No circularity: SNGR is an empirical augmentation with an independent selection heuristic

full rationale

The paper augments iSAM2 with selective nested sampling on windows flagged by the condition number of joint marginal covariances, plus a gating rule. No equations, derivations, or performance claims reduce by construction to fitted parameters, self-defined quantities, or a self-citation chain. The condition-number detector and gating mechanism are presented as new heuristics whose validity is tested empirically on range-only SLAM data; they are not derived from the target likelihood improvements. Experiments compare against exhaustive nested sampling and report cost/accuracy trade-offs without any step that renames a fit as a prediction or imports uniqueness from the authors' prior work. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities. The method description implies standard factor-graph assumptions and existing nested sampling but does not introduce new ones at the visible level.

pith-pipeline@v0.9.0 · 5409 in / 1207 out tokens · 39392 ms · 2026-05-09T20:58:02.486796+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages

[1]

Factor graphs for robot perception,

F. Dellaert and M. Kaess, “Factor graphs for robot perception,”Found. Trends Robot., vol. 6, no. 1/2, pp. 1–139, 2017

work page 2017
[2]

iSAM2: Incremental smoothing and mapping using the Bayes tree,

M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “iSAM2: Incremental smoothing and mapping using the Bayes tree,”Int. J. Robot. Res., vol. 31, no. 2, pp. 216–235, 2012

work page 2012
[3]

Nested sampling for non- Gaussian inference in SLAM factor graphs,

Q. Huang, A. Papalia, and J. J. Leonard, “Nested sampling for non- Gaussian inference in SLAM factor graphs,”IEEE Robot. Autom. Lett., vol. 7, no. 4, pp. 9232–9239, Oct. 2022

work page 2022
[4]

Informed, constrained, aligned: A field analysis on degeneracy- aware point cloud registration in the wild,

T. Tuna, J. Nubert, P. Pfreundschuh, C. Cadena, S. Khattak, and M. Hut- ter, “Informed, constrained, aligned: A field analysis on degeneracy- aware point cloud registration in the wild,” arXiv:2408.11809, 2024

work page arXiv 2024
[5]

dynesty: A dynamic nested sampling package for estimating Bayesian posteriors and evidences,

J. S. Speagle, “dynesty: A dynamic nested sampling package for estimating Bayesian posteriors and evidences,”Mon. Not. R. Astron. Soc., vol. 493, no. 3, pp. 3132–3158, 2020

work page 2020
[6]

iSAM: Incremental smooth- ing and mapping,

M. Kaess, A. Ranganathan, and F. Dellaert, “iSAM: Incremental smooth- ing and mapping,”IEEE Trans. Robot., vol. 24, no. 6, pp. 1365–1378, 2008

work page 2008
[7]

g2o: A general framework for graph optimization,

R. K ¨ummerle, G. Grisetti, H. Strasdat, K. Konolige, and W. Burgard, “g2o: A general framework for graph optimization,” inProc. IEEE ICRA, 2011, pp. 3607–3613

work page 2011
[8]

Past, present, and future of simultaneous localization and mapping,

C. Cadenaet al., “Past, present, and future of simultaneous localization and mapping,”IEEE Trans. Robot., vol. 32, no. 6, pp. 1309–1332, 2016

work page 2016
[9]

Towards a robust back-end for pose graph SLAM,

N. S ¨underhauf and P. Protzel, “Towards a robust back-end for pose graph SLAM,” inProc. IEEE ICRA, 2012, pp. 1254–1261

work page 2012
[10]

Robust map optimization using dynamic covariance scaling,

P. Agarwal, G. D. Tipaldi, L. Spinello, C. Stachniss, and W. Burgard, “Robust map optimization using dynamic covariance scaling,” inProc. IEEE ICRA, 2013, pp. 62–69

work page 2013
[11]

Inference on networks of mixtures for robust robot mapping,

E. Olson and P. Agarwal, “Inference on networks of mixtures for robust robot mapping,”Int. J. Robot. Res., vol. 32, no. 7, pp. 826–840, 2013

work page 2013
[12]

Thrun, W

S. Thrun, W. Burgard, and D. Fox,Probabilistic Robotics. MIT Press, 2005

work page 2005
[13]

FastSLAM: A factored solution to simultaneous localization and mapping,

M. Montemerlo, S. Thrun, D. Koller, and B. Wegbreit, “FastSLAM: A factored solution to simultaneous localization and mapping,” inProc. AAAI, 2002, pp. 593–598

work page 2002
[14]

A benchmark for the evaluation of RGB-D SLAM systems,

J. Sturm, N. Engelhard, F. Endres, W. Burgard, and D. Cremers, “A benchmark for the evaluation of RGB-D SLAM systems,” inProc. IEEE/RSJ IROS, 2012, pp. 573–580

work page 2012
[15]

A tutorial on quantitative trajectory evaluation for visual(-inertial) odometry,

Z. Zhang and D. Scaramuzza, “A tutorial on quantitative trajectory evaluation for visual(-inertial) odometry,” inProc. IEEE/RSJ IROS, 2018, pp. 7244–7251

work page 2018
[16]

Bar-Shalom, X.-R

Y . Bar-Shalom, X.-R. Li, and T. Kirubarajan,Estimation with Applica- tions to Tracking and Navigation. Wiley, 2001

work page 2001
[17]

Assessing bimodality to detect the presence of a dual cognitive process,

J. B. Freeman and R. Dale, “Assessing bimodality to detect the presence of a dual cognitive process,”Behav. Res. Methods, vol. 45, no. 1, pp. 83–97, 2013

work page 2013

[1] [1]

Factor graphs for robot perception,

F. Dellaert and M. Kaess, “Factor graphs for robot perception,”Found. Trends Robot., vol. 6, no. 1/2, pp. 1–139, 2017

work page 2017

[2] [2]

iSAM2: Incremental smoothing and mapping using the Bayes tree,

M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “iSAM2: Incremental smoothing and mapping using the Bayes tree,”Int. J. Robot. Res., vol. 31, no. 2, pp. 216–235, 2012

work page 2012

[3] [3]

Nested sampling for non- Gaussian inference in SLAM factor graphs,

Q. Huang, A. Papalia, and J. J. Leonard, “Nested sampling for non- Gaussian inference in SLAM factor graphs,”IEEE Robot. Autom. Lett., vol. 7, no. 4, pp. 9232–9239, Oct. 2022

work page 2022

[4] [4]

Informed, constrained, aligned: A field analysis on degeneracy- aware point cloud registration in the wild,

T. Tuna, J. Nubert, P. Pfreundschuh, C. Cadena, S. Khattak, and M. Hut- ter, “Informed, constrained, aligned: A field analysis on degeneracy- aware point cloud registration in the wild,” arXiv:2408.11809, 2024

work page arXiv 2024

[5] [5]

dynesty: A dynamic nested sampling package for estimating Bayesian posteriors and evidences,

J. S. Speagle, “dynesty: A dynamic nested sampling package for estimating Bayesian posteriors and evidences,”Mon. Not. R. Astron. Soc., vol. 493, no. 3, pp. 3132–3158, 2020

work page 2020

[6] [6]

iSAM: Incremental smooth- ing and mapping,

M. Kaess, A. Ranganathan, and F. Dellaert, “iSAM: Incremental smooth- ing and mapping,”IEEE Trans. Robot., vol. 24, no. 6, pp. 1365–1378, 2008

work page 2008

[7] [7]

g2o: A general framework for graph optimization,

R. K ¨ummerle, G. Grisetti, H. Strasdat, K. Konolige, and W. Burgard, “g2o: A general framework for graph optimization,” inProc. IEEE ICRA, 2011, pp. 3607–3613

work page 2011

[8] [8]

Past, present, and future of simultaneous localization and mapping,

C. Cadenaet al., “Past, present, and future of simultaneous localization and mapping,”IEEE Trans. Robot., vol. 32, no. 6, pp. 1309–1332, 2016

work page 2016

[9] [9]

Towards a robust back-end for pose graph SLAM,

N. S ¨underhauf and P. Protzel, “Towards a robust back-end for pose graph SLAM,” inProc. IEEE ICRA, 2012, pp. 1254–1261

work page 2012

[10] [10]

Robust map optimization using dynamic covariance scaling,

P. Agarwal, G. D. Tipaldi, L. Spinello, C. Stachniss, and W. Burgard, “Robust map optimization using dynamic covariance scaling,” inProc. IEEE ICRA, 2013, pp. 62–69

work page 2013

[11] [11]

Inference on networks of mixtures for robust robot mapping,

E. Olson and P. Agarwal, “Inference on networks of mixtures for robust robot mapping,”Int. J. Robot. Res., vol. 32, no. 7, pp. 826–840, 2013

work page 2013

[12] [12]

Thrun, W

S. Thrun, W. Burgard, and D. Fox,Probabilistic Robotics. MIT Press, 2005

work page 2005

[13] [13]

FastSLAM: A factored solution to simultaneous localization and mapping,

M. Montemerlo, S. Thrun, D. Koller, and B. Wegbreit, “FastSLAM: A factored solution to simultaneous localization and mapping,” inProc. AAAI, 2002, pp. 593–598

work page 2002

[14] [14]

A benchmark for the evaluation of RGB-D SLAM systems,

J. Sturm, N. Engelhard, F. Endres, W. Burgard, and D. Cremers, “A benchmark for the evaluation of RGB-D SLAM systems,” inProc. IEEE/RSJ IROS, 2012, pp. 573–580

work page 2012

[15] [15]

A tutorial on quantitative trajectory evaluation for visual(-inertial) odometry,

Z. Zhang and D. Scaramuzza, “A tutorial on quantitative trajectory evaluation for visual(-inertial) odometry,” inProc. IEEE/RSJ IROS, 2018, pp. 7244–7251

work page 2018

[16] [16]

Bar-Shalom, X.-R

Y . Bar-Shalom, X.-R. Li, and T. Kirubarajan,Estimation with Applica- tions to Tracking and Navigation. Wiley, 2001

work page 2001

[17] [17]

Assessing bimodality to detect the presence of a dual cognitive process,

J. B. Freeman and R. Dale, “Assessing bimodality to detect the presence of a dual cognitive process,”Behav. Res. Methods, vol. 45, no. 1, pp. 83–97, 2013

work page 2013