Natural Gradient Bayesian Filtering: Geometry-Aware Filter for Dynamical Systems

Chang Liu; Jiayu Yuan; Shengbo Eben Li; Stephen Shing-Toung Yau; Tianyi Zhang; Ting Yuan; Wei Wu; Wenhan Cao; Yao Lyu; Yi Zeng

arxiv: 2605.02306 · v1 · submitted 2026-05-04 · 💻 cs.RO · cs.SY· eess.SY

Natural Gradient Bayesian Filtering: Geometry-Aware Filter for Dynamical Systems

Chang Liu , Wenhan Cao , Zeju Sun , Tianyi Zhang , Jiayu Yuan , Yi Zeng , Ting Yuan , Yao Lyu

show 3 more authors

Wei Wu Stephen Shing-Toung Yau Shengbo Eben Li

This is my paper

Pith reviewed 2026-05-08 18:37 UTC · model grok-4.3

classification 💻 cs.RO cs.SYeess.SY

keywords natural gradientBayesian filteringGaussian approximationKalman filterinformation geometrynonlinear state estimationroboticsattitude estimation

0 comments

The pith

Natural gradient descent on the Gaussian manifold recovers the Kalman measurement update exactly in the linear case and extends geometrically to nonlinear filtering.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that viewing prediction and measurement updates as inference on the manifold of Gaussian distributions allows natural gradient descent to refine posterior mean and covariance iteratively. This respects the intrinsic geometry of the Gaussian family, automatically preserving positive definiteness of the covariance. In the linear-Gaussian setting a single natural-gradient step reproduces the classical Kalman update exactly. The approach is positioned as a geometry-aware alternative to extended and unscented Kalman filters for nonlinear dynamical systems.

Core claim

The NANO filter performs Bayesian filtering by iteratively applying natural gradient steps on the statistical manifold of multivariate Gaussians, using the Fisher information metric to update both mean and covariance parameters while guaranteeing that the covariance remains positive definite. In the linear-Gaussian case this procedure coincides exactly with the Kalman measurement update after one step; in nonlinear cases it supplies an iterative refinement of the Gaussian posterior approximation.

What carries the argument

Natural gradient descent on the statistical manifold of Gaussian distributions, where the Fisher-Rao metric defines the direction of steepest ascent for the negative log-likelihood or KL divergence objective.

If this is right

A single natural-gradient step recovers the exact Kalman measurement update when the system and measurement models are linear and the noise is Gaussian.
Covariance matrices remain positive definite by construction throughout the iterative updates.
The filter can be applied directly to nonlinear estimation tasks such as satellite attitude determination, SLAM, and state estimation on quadruped and humanoid robots.
Prediction and measurement updates are treated uniformly as optimization steps on the same Gaussian manifold.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The geometric view may allow systematic incorporation of constraints or additional manifold structure (for example, on rotation groups) into the filter without ad-hoc projections.
Because each step is an optimization move, the method could be combined with line-search or adaptive step-size rules to improve robustness in strongly nonlinear regimes.
The exact recovery of Kalman in the linear case suggests that differences in performance on nonlinear problems arise purely from how the geometry-aware steps handle higher-order effects compared with linearization or sigma-point sampling.

Load-bearing premise

That natural gradient steps on the Gaussian manifold produce a valid approximation to the true posterior in nonlinear systems without introducing new instabilities or biases beyond those already present in standard Gaussian filters.

What would settle it

Apply the NANO filter to a linear-Gaussian problem and check whether the mean and covariance after one step match the closed-form Kalman update to machine precision; or apply it to a nonlinear benchmark such as satellite attitude estimation and verify whether the covariance matrices remain positive definite while the estimation error stays below that of an unscented Kalman filter baseline.

Figures

Figures reproduced from arXiv: 2605.02306 by Chang Liu, Jiayu Yuan, Shengbo Eben Li, Stephen Shing-Toung Yau, Tianyi Zhang, Ting Yuan, Wei Wu, Wenhan Cao, Yao Lyu, Yi Zeng, Zeju Sun.

**Figure 1.** Figure 1: Illustration of Fisher information metric view at source ↗

**Figure 2.** Figure 2: Illustration of NANO filter The transformation matrix between ϱ and θ: ∂ϱ ∂θ = ∂ 2ψ ∂θi∂θj ij = I(θ), is just the Fisher information matrix. In this way, the Fisher information matrix with respect to the expectation parameters ϱ can be written as ˜I(ϱ) = Ep(x,ϱ) view at source ↗

**Figure 3.** Figure 3: Comparison of attitude estimation performance under view at source ↗

**Figure 4.** Figure 4: Vehicle in the 2D coordinate system view at source ↗

**Figure 5.** Figure 5: Victoria Park map and vehicle trajectory obtained from view at source ↗

**Figure 6.** Figure 6: Experimental results of the vehicle localization and view at source ↗

**Figure 7.** Figure 7: Different environments of real-world legged robot view at source ↗

**Figure 8.** Figure 8: Estimated position, velocity, and orientation for InEKF and NANO-L on the unstable terrain. view at source ↗

**Figure 9.** Figure 9: Estimated trajectories from InEKF and NANO-L on the flat terrain with different distances and durations. From left to view at source ↗

**Figure 10.** Figure 10: Estimated velocity from EKF, UKF and NANO. view at source ↗

**Figure 11.** Figure 11: Estimated trajectories from EKF, UKF and NANO. view at source ↗

read the original abstract

Bayesian filtering is a cornerstone of state estimation in complex systems such as aerospace systems, yet exact solutions are available only for linear Gaussian models. In practice,nonlinear systems are handled through tractable approximations,with Gaussian filters such as the extended and unscented Kalman filters being among the most widely used methods. This tutorial revisits Gaussian filtering from an information-geometric perspective, viewing the prediction and measurement update steps as inference procedures over state distributions. Within this framework, we introduce a geometry-aware Gaussian filtering approach that leverages natural gradient descent on the statistical manifold of Gaussian distributions. The resulting Natural Gradient Gaussian Approximation (NANO) filter iteratively refines the posterior mean and covariance while respecting the intrinsic geometry of the Gaussian family and preserving the positive definiteness of the covariance matrix. We further highlight fundamental connections to the classical Kalman filtering, showing that a single natural-gradient step exactly recovers the Kalman measurement update in the linear-Gaussian case. The practical implications of the proposed framework are illustrated through case studies in representative nonlinear estimation problems,including satellite attitude estimation, simultaneous localization and mapping, and state estimation for robotic systems including quadruped and humanoid robots.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The NANO filter recovers the exact Kalman measurement update with one natural-gradient step on the Gaussian manifold and uses iterative steps for nonlinear cases while preserving positive definiteness by construction.

read the letter

The paper's main move is to treat both the prediction and measurement update in Gaussian filtering as natural gradient descent on the statistical manifold of Gaussians. In the linear-Gaussian setting this yields the Kalman update after a single step, which follows directly from the Fisher information metric matching the information-form update. For nonlinear systems the method iterates while staying on the manifold, so the covariance stays positive definite without extra fixes. That connection and the built-in geometric property are the cleanest parts of the work. The authors also lay out how the classical Kalman filter sits inside this framework, which makes the contribution easy to place against EKF and UKF. The case studies on satellite attitude, SLAM, and quadruped/humanoid state estimation show the filter running on representative robotics problems without obvious divergence. The derivations for the linear recovery check out and the overall argument is internally consistent. The empirical side is thinner. The examples are mostly demonstrations rather than systematic comparisons with tables of RMSE or stability metrics across repeated trials, so it is still unclear how much accuracy or robustness the geometry actually buys over standard Gaussian filters in practice. The paper reads more like a tutorial that introduces the geometric lens than a comprehensive benchmark study. This is useful reading for researchers in robotics and aerospace who already work with Gaussian approximations and want to see how information geometry can be applied directly to the filter steps. A reader comfortable with natural gradients will pick up the ideas quickly. I would send it to peer review. The central claims are verifiable from the derivations, the approach is coherent, and referees can push on the experimental depth without the paper falling apart on its own terms.

Referee Report

0 major / 2 minor

Summary. The manuscript presents a tutorial on Gaussian Bayesian filtering from an information-geometric viewpoint. It introduces the Natural Gradient Gaussian Approximation (NANO) filter, which performs iterative natural-gradient descent on the statistical manifold of Gaussian distributions to refine posterior mean and covariance. A central claim is that a single natural-gradient step exactly recovers the classical Kalman measurement update in the linear-Gaussian case; the method is shown to preserve positive definiteness by construction. The framework is illustrated via case studies on nonlinear problems including satellite attitude estimation, SLAM, and state estimation for quadruped and humanoid robots.

Significance. If the derivations are correct, the work supplies a geometrically principled reinterpretation of Gaussian filtering that recovers the Kalman filter as a special case and automatically respects the positive-definiteness constraint. The exact recovery result anchors the approach in classical theory and may help researchers design manifold-aware updates for robotics and aerospace applications. The tutorial format usefully connects optimization on statistical manifolds with filtering practice.

minor comments (2)

Abstract: missing space in 'In practice,nonlinear systems'.
Abstract: the list of case studies repeats 'including' ('including satellite attitude estimation, simultaneous localization and mapping, and state estimation for robotic systems including quadruped...').

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the careful reading and positive assessment of the manuscript. The summary accurately reflects the tutorial's focus on information geometry for Gaussian filtering, the introduction of the NANO filter, and the exact recovery of the Kalman update as a special case. We appreciate the recommendation for minor revision and the recognition of potential utility in robotics and aerospace applications. No specific major comments were listed in the report.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's derivation chain is self-contained. The central result—that one natural-gradient step on the Gaussian statistical manifold recovers the Kalman measurement update—follows directly from the known coincidence between the Fisher information metric and the information-form update under linear-Gaussian assumptions; this is a standard geometric identity, not a redefinition or fit of the target quantity. The NANO filter's iterative refinement is defined as natural-gradient descent on the manifold, which by construction stays within the positive-definite cone; this is an intrinsic property of the chosen geometry rather than an input smuggled back as output. No self-citations are used as load-bearing premises for uniqueness or ansatz, and the abstract plus skeptic analysis show no equations that reduce the claimed prediction to a fitted parameter or prior result by construction. The argument therefore does not collapse to its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The approach rests on the standard assumption that Gaussian approximations remain useful for the target nonlinear systems and that the natural gradient provides a geometrically meaningful update direction; no new free parameters or invented entities are mentioned in the abstract.

axioms (2)

domain assumption Gaussian distributions form a suitable statistical manifold for approximating posteriors in the considered dynamical systems
Implicit in all Gaussian filtering methods; stated as the setting for the NANO filter
domain assumption Natural gradient descent on this manifold yields a valid refinement of the posterior
Central to the proposed iterative procedure

pith-pipeline@v0.9.0 · 5537 in / 1338 out tokens · 76520 ms · 2026-05-08T18:37:42.729972+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Cost / FunctionalEquation (J(x)=½(x+x⁻¹)−1 uniqueness) washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the parameter space of probability distributions [is] a Riemannian manifold, where the Riemannian metric is defined by Fisher information matrix ... I_ij(θ) = ∂²ψ/∂θ_i∂θ_j

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages · 1 internal anchor

[1]

Bayesian filtering: From Kalman filters to particle filters, and beyond,

Z. Chenet al., “Bayesian filtering: From Kalman filters to particle filters, and beyond,”Statistics, vol. 182, no. 1, pp. 1–69, 2003

work page 2003
[2]

Measurement dissemination- based distributed bayesian filter using the latest-in-and-full-out exchange protocol for networked unmanned vehicles,

C. Liu, S. E. Li, and J. K. Hedrick, “Measurement dissemination- based distributed bayesian filter using the latest-in-and-full-out exchange protocol for networked unmanned vehicles,”IEEE Transactions on Industrial Electronics, vol. 64, no. 11, pp. 8756–8766, 2017

work page 2017
[3]

A new approach to linear filtering and prediction problems,

R. Kalman, “A new approach to linear filtering and prediction problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960

work page 1960
[4]

S ¨arkk¨a and L

S. S ¨arkk¨a and L. Svensson,Bayesian filtering and smoothing, vol. 17. Cambridge university press, 2023

work page 2023
[5]

Particle filters: A hands- on tutorial,

J. Elfring, E. Torta, and R. van de Molengraft, “Particle filters: A hands- on tutorial,”Sensors, vol. 21, no. 2, p. 438, 2021

work page 2021
[6]

Probabilistic robotics,

S. Thrun, “Probabilistic robotics,”Communications of the ACM, vol. 45, no. 3, pp. 52–57, 2002

work page 2002
[7]

G. L. Smith, S. F. Schmidt, and L. A. McGee,Application of statistical filter theory to the optimal estimation of position and velocity on board a circumlunar vehicle, vol. 135. National Aeronautics and Space Administration, 1962

work page 1962
[8]

An assessment of the navigation and course corrections for a manned flyby of mars or venus,

B. A. McElhoe, “An assessment of the navigation and course corrections for a manned flyby of mars or venus,”IEEE Transactions on Aerospace and Electronic Systems, no. 4, pp. 613–623, 1966

work page 1966
[9]

Gelbet al.,Applied optimal estimation

A. Gelbet al.,Applied optimal estimation. MIT press, 1974

work page 1974
[10]

The iterated Kalman filter update as a Gauss-newton method,

B. M. Bell and F. W. Cathey, “The iterated Kalman filter update as a Gauss-newton method,”IEEE Transactions on Automatic Control, vol. 38, no. 2, pp. 294–297, 1993

work page 1993
[11]

A new approach for filtering nonlinear systems,

S. J. Julier, J. K. Uhlmann, and H. F. Durrant-Whyte, “A new approach for filtering nonlinear systems,” in1995 American Control Conference (ACC), vol. 3, pp. 1628–1632, 1995

work page 1995
[12]

Discrete-time nonlinear filtering algorithms using Gauss–hermite quadrature,

I. Arasaratnam, S. Haykin, and R. J. Elliott, “Discrete-time nonlinear filtering algorithms using Gauss–hermite quadrature,”Proceedings of the IEEE, vol. 95, no. 5, pp. 953–977, 2007

work page 2007
[13]

Cubature Kalman filters,

I. Arasaratnam and S. Haykin, “Cubature Kalman filters,”IEEE Trans- actions on Automatic Control, vol. 54, no. 6, pp. 1254–1269, 2009

work page 2009
[14]

Nonlinear bayesian filtering with natural gradient gaussian approximation,

W. Cao, T. Zhang, Z. Sun, C. Liu, S. S.-T. Yau, and S. E. Li, “Nonlinear bayesian filtering with natural gradient gaussian approximation,”IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026 (online in arXiv, Oct. 2024)

work page 2026
[15]

Algorithm design and comparative test of natural gradient gaussian approximation filter,

W. Cao, T. Zhang, and S. E. Li, “Algorithm design and comparative test of natural gradient gaussian approximation filter,” inIFAC Modeling Estimation and Control Conference (MECC), pp. 1–7, 2025

work page 2025
[16]

Nonlinear Kalman filtering with divergence minimization,

S. Gultekin and J. Paisley, “Nonlinear Kalman filtering with divergence minimization,”IEEE Transactions on Signal Processing, vol. 65, no. 23, pp. 6319–6331, 2017

work page 2017
[17]

An optimization-centric view on bayes’ rule: Reviewing and generalizing variational inference,

J. Knoblauch, J. Jewson, and T. Damoulas, “An optimization-centric view on bayes’ rule: Reviewing and generalizing variational inference,” Journal of Machine Learning Research, vol. 23, no. 132, pp. 1–109, 2022

work page 2022
[18]

Generalized moving hori- zon estimation for nonlinear systems with robustness to measurement outliers,

W. Cao, C. Liu, Z. Lan, Y . Piao, and S. E. Li, “Generalized moving hori- zon estimation for nonlinear systems with robustness to measurement outliers,” in2023 American Control Conference (ACC), pp. 1614–1621, 2023

work page 2023
[19]

Ro- bust bayesian inference for moving horizon estimation,

W. Cao, C. Liu, Z. Lan, S. E. Li, W. Pan, and A. Alessandri, “Ro- bust bayesian inference for moving horizon estimation,”Automatica, vol. 173, p. 112108, 2025

work page 2025
[20]

Information and accuracy attainable in the estimation of statistical parameters,

C. R. Rao, “Information and accuracy attainable in the estimation of statistical parameters,”Bulletin of the Calcutta Mathematical Society, vol. 37, no. 3, pp. 81–91, 1945

work page 1945
[21]

The utilization of multiple measurements in problems of biological classification,

C. R. Rao, “The utilization of multiple measurements in problems of biological classification,”Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 10, no. 2, pp. 159–193, 1948

work page 1948
[22]

Amari and H

S. Amari and H. Nagaoka,Methods of Information Geometry, vol. 191 ofTranslations of Mathematical Monographs. American Mathematical Society (Translations of Mathematical Monographs), 2000

work page 2000
[23]

N. N. Chentsov,Statistical decision rules and optimal inference. Amer- ican Mathematical Society, 1982

work page 1982
[24]

Transformations des signaux al ´eatoires a travers les sys- temes non lin´eaires sans m´emoire,

G. Bonnet, “Transformations des signaux al ´eatoires a travers les sys- temes non lin´eaires sans m´emoire,” inAnnales des T ´el´ecommunications, vol. 19, pp. 203–220, Springer, 1964

work page 1964
[25]

A useful theorem for nonlinear devices having Gaussian inputs,

R. Price, “A useful theorem for nonlinear devices having Gaussian inputs,”IRE Transactions on Information Theory, vol. 4, no. 2, pp. 69– 72, 1958

work page 1958
[26]

Unscented filtering and nonlinear estimation,

S. J. Julier and J. K. Uhlmann, “Unscented filtering and nonlinear estimation,”Proceedings of the IEEE, vol. 92, no. 3, pp. 401–422, 2004

work page 2004
[27]

Natural gradient gaussian approx- imation filter with positive definiteness guarantee,

T. Zhang, W. Cao, and S. E. Li, “Natural gradient gaussian approx- imation filter with positive definiteness guarantee,” in2026 American Control Conference (ACC), 2026

work page 2026
[28]

Tractable structured natural-gradient descent using local parameterizations,

W. Lin, F. Nielsen, K. M. Emtiyaz, and M. Schmidt, “Tractable structured natural-gradient descent using local parameterizations,” in International Conference on Machine Learning, pp. 6680–6691, 2021

work page 2021
[29]

Gaussian variational inference with covariance constraints applied to range-only localization,

A. Goudar, W. Zhao, T. D. Barfoot, and A. P. Schoellig, “Gaussian variational inference with covariance constraints applied to range-only localization,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 2872–2879, IEEE, 2022

work page 2022
[30]

T. D. Barfoot,State estimation for robotics. Cambridge University Press, 2024

work page 2024
[31]

The invariant extended Kalman filter as a stable observer,

A. Barrau and S. Bonnabel, “The invariant extended Kalman filter as a stable observer,”IEEE Transactions on Automatic Control, vol. 62, no. 4, pp. 1797–1812, 2016. 18

work page 2016
[32]

Contact- aided invariant extended Kalman filtering for robot state estimation,

R. Hartley, M. Ghaffari, R. M. Eustice, and J. W. Grizzle, “Contact- aided invariant extended Kalman filtering for robot state estimation,”The International Journal of Robotics Research, vol. 39, no. 4, pp. 402–430, 2020

work page 2020
[33]

Nano- slam: natural gradient gaussian approximation for vehicle slam,

T. Zhang, W. Cao, C. Liu, F. Zhang, W. Wu, and S. E. Li, “Nano- slam: natural gradient gaussian approximation for vehicle slam,” in 2025 IEEE 28th International Conference on Intelligent Transportation Systems (ITSC), pp. 4014–4019, 2025

work page 2025
[34]

Fast-lio: A fast, robust lidar-inertial odometry package by tightly-coupled iterated kalman filter,

W. Xu and F. Zhang, “Fast-lio: A fast, robust lidar-inertial odometry package by tightly-coupled iterated kalman filter,”IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 3317–3324, 2021

work page 2021
[35]

A code for unscented kalman filtering on manifolds (ukf-m),

M. Brossard, A. Barrau, and S. Bonnabel, “A code for unscented kalman filtering on manifolds (ukf-m),” in2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 5701–5708, 2020

work page 2020
[36]

Convergence and consistency analysis for extended kalman filter based slam,

S. Huang and G. Dissanayake, “Convergence and consistency analysis for extended kalman filter based slam,”IEEE Transactions on robotics, vol. 23, no. 5, pp. 1036–1049, 2007

work page 2007
[37]

Unscented fastslam: a robust and efficient solution to the slam problem,

C. Kim, R. Sakthivel, and W. K. Chung, “Unscented fastslam: a robust and efficient solution to the slam problem,”IEEE Transactions on Robotics, vol. 24, no. 4, pp. 808–820, 2008

work page 2008
[38]

Natural Gradient Gaussian Approximation Filter on Lie Groups for Robot State Estimation

T. Zhang, W. Cao, C. Liu, Y . Lyu, and S. E. Li, “Natural gradient gaussian approximation filter on lie groups for robot state estimation,” arXiv preprint arXiv:2604.10057, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026
[39]

State estimation for legged robots: Consistent fusion of leg kinematics and IMU,

M. Bloesch, M. Hutter, M. A. Hoepflinger, S. Leutenegger, C. Gehring, C. D. Remy, and R. Siegwart, “State estimation for legged robots: Consistent fusion of leg kinematics and IMU,”Robotics: Science and Systems VIII, p. 17, 2013

work page 2013
[40]

A tutorial on quantitative trajectory eval- uation for visual (-inertial) odometry,

Z. Zhang and D. Scaramuzza, “A tutorial on quantitative trajectory eval- uation for visual (-inertial) odometry,” in2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 7244–7251, IEEE, 2018

work page 2018
[41]

Invariant smoother for legged robot state estimation with dynamic contact event information,

Z. Yoon, J.-H. Kim, and H.-W. Park, “Invariant smoother for legged robot state estimation with dynamic contact event information,”IEEE Transactions on Robotics, vol. 40, pp. 193–212, 2023

work page 2023

[1] [1]

Bayesian filtering: From Kalman filters to particle filters, and beyond,

Z. Chenet al., “Bayesian filtering: From Kalman filters to particle filters, and beyond,”Statistics, vol. 182, no. 1, pp. 1–69, 2003

work page 2003

[2] [2]

Measurement dissemination- based distributed bayesian filter using the latest-in-and-full-out exchange protocol for networked unmanned vehicles,

C. Liu, S. E. Li, and J. K. Hedrick, “Measurement dissemination- based distributed bayesian filter using the latest-in-and-full-out exchange protocol for networked unmanned vehicles,”IEEE Transactions on Industrial Electronics, vol. 64, no. 11, pp. 8756–8766, 2017

work page 2017

[3] [3]

A new approach to linear filtering and prediction problems,

R. Kalman, “A new approach to linear filtering and prediction problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960

work page 1960

[4] [4]

S ¨arkk¨a and L

S. S ¨arkk¨a and L. Svensson,Bayesian filtering and smoothing, vol. 17. Cambridge university press, 2023

work page 2023

[5] [5]

Particle filters: A hands- on tutorial,

J. Elfring, E. Torta, and R. van de Molengraft, “Particle filters: A hands- on tutorial,”Sensors, vol. 21, no. 2, p. 438, 2021

work page 2021

[6] [6]

Probabilistic robotics,

S. Thrun, “Probabilistic robotics,”Communications of the ACM, vol. 45, no. 3, pp. 52–57, 2002

work page 2002

[7] [7]

G. L. Smith, S. F. Schmidt, and L. A. McGee,Application of statistical filter theory to the optimal estimation of position and velocity on board a circumlunar vehicle, vol. 135. National Aeronautics and Space Administration, 1962

work page 1962

[8] [8]

An assessment of the navigation and course corrections for a manned flyby of mars or venus,

B. A. McElhoe, “An assessment of the navigation and course corrections for a manned flyby of mars or venus,”IEEE Transactions on Aerospace and Electronic Systems, no. 4, pp. 613–623, 1966

work page 1966

[9] [9]

Gelbet al.,Applied optimal estimation

A. Gelbet al.,Applied optimal estimation. MIT press, 1974

work page 1974

[10] [10]

The iterated Kalman filter update as a Gauss-newton method,

B. M. Bell and F. W. Cathey, “The iterated Kalman filter update as a Gauss-newton method,”IEEE Transactions on Automatic Control, vol. 38, no. 2, pp. 294–297, 1993

work page 1993

[11] [11]

A new approach for filtering nonlinear systems,

S. J. Julier, J. K. Uhlmann, and H. F. Durrant-Whyte, “A new approach for filtering nonlinear systems,” in1995 American Control Conference (ACC), vol. 3, pp. 1628–1632, 1995

work page 1995

[12] [12]

Discrete-time nonlinear filtering algorithms using Gauss–hermite quadrature,

I. Arasaratnam, S. Haykin, and R. J. Elliott, “Discrete-time nonlinear filtering algorithms using Gauss–hermite quadrature,”Proceedings of the IEEE, vol. 95, no. 5, pp. 953–977, 2007

work page 2007

[13] [13]

Cubature Kalman filters,

I. Arasaratnam and S. Haykin, “Cubature Kalman filters,”IEEE Trans- actions on Automatic Control, vol. 54, no. 6, pp. 1254–1269, 2009

work page 2009

[14] [14]

Nonlinear bayesian filtering with natural gradient gaussian approximation,

W. Cao, T. Zhang, Z. Sun, C. Liu, S. S.-T. Yau, and S. E. Li, “Nonlinear bayesian filtering with natural gradient gaussian approximation,”IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026 (online in arXiv, Oct. 2024)

work page 2026

[15] [15]

Algorithm design and comparative test of natural gradient gaussian approximation filter,

W. Cao, T. Zhang, and S. E. Li, “Algorithm design and comparative test of natural gradient gaussian approximation filter,” inIFAC Modeling Estimation and Control Conference (MECC), pp. 1–7, 2025

work page 2025

[16] [16]

Nonlinear Kalman filtering with divergence minimization,

S. Gultekin and J. Paisley, “Nonlinear Kalman filtering with divergence minimization,”IEEE Transactions on Signal Processing, vol. 65, no. 23, pp. 6319–6331, 2017

work page 2017

[17] [17]

An optimization-centric view on bayes’ rule: Reviewing and generalizing variational inference,

J. Knoblauch, J. Jewson, and T. Damoulas, “An optimization-centric view on bayes’ rule: Reviewing and generalizing variational inference,” Journal of Machine Learning Research, vol. 23, no. 132, pp. 1–109, 2022

work page 2022

[18] [18]

Generalized moving hori- zon estimation for nonlinear systems with robustness to measurement outliers,

W. Cao, C. Liu, Z. Lan, Y . Piao, and S. E. Li, “Generalized moving hori- zon estimation for nonlinear systems with robustness to measurement outliers,” in2023 American Control Conference (ACC), pp. 1614–1621, 2023

work page 2023

[19] [19]

Ro- bust bayesian inference for moving horizon estimation,

W. Cao, C. Liu, Z. Lan, S. E. Li, W. Pan, and A. Alessandri, “Ro- bust bayesian inference for moving horizon estimation,”Automatica, vol. 173, p. 112108, 2025

work page 2025

[20] [20]

Information and accuracy attainable in the estimation of statistical parameters,

C. R. Rao, “Information and accuracy attainable in the estimation of statistical parameters,”Bulletin of the Calcutta Mathematical Society, vol. 37, no. 3, pp. 81–91, 1945

work page 1945

[21] [21]

The utilization of multiple measurements in problems of biological classification,

C. R. Rao, “The utilization of multiple measurements in problems of biological classification,”Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 10, no. 2, pp. 159–193, 1948

work page 1948

[22] [22]

Amari and H

S. Amari and H. Nagaoka,Methods of Information Geometry, vol. 191 ofTranslations of Mathematical Monographs. American Mathematical Society (Translations of Mathematical Monographs), 2000

work page 2000

[23] [23]

N. N. Chentsov,Statistical decision rules and optimal inference. Amer- ican Mathematical Society, 1982

work page 1982

[24] [24]

Transformations des signaux al ´eatoires a travers les sys- temes non lin´eaires sans m´emoire,

G. Bonnet, “Transformations des signaux al ´eatoires a travers les sys- temes non lin´eaires sans m´emoire,” inAnnales des T ´el´ecommunications, vol. 19, pp. 203–220, Springer, 1964

work page 1964

[25] [25]

A useful theorem for nonlinear devices having Gaussian inputs,

R. Price, “A useful theorem for nonlinear devices having Gaussian inputs,”IRE Transactions on Information Theory, vol. 4, no. 2, pp. 69– 72, 1958

work page 1958

[26] [26]

Unscented filtering and nonlinear estimation,

S. J. Julier and J. K. Uhlmann, “Unscented filtering and nonlinear estimation,”Proceedings of the IEEE, vol. 92, no. 3, pp. 401–422, 2004

work page 2004

[27] [27]

Natural gradient gaussian approx- imation filter with positive definiteness guarantee,

T. Zhang, W. Cao, and S. E. Li, “Natural gradient gaussian approx- imation filter with positive definiteness guarantee,” in2026 American Control Conference (ACC), 2026

work page 2026

[28] [28]

Tractable structured natural-gradient descent using local parameterizations,

W. Lin, F. Nielsen, K. M. Emtiyaz, and M. Schmidt, “Tractable structured natural-gradient descent using local parameterizations,” in International Conference on Machine Learning, pp. 6680–6691, 2021

work page 2021

[29] [29]

Gaussian variational inference with covariance constraints applied to range-only localization,

A. Goudar, W. Zhao, T. D. Barfoot, and A. P. Schoellig, “Gaussian variational inference with covariance constraints applied to range-only localization,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 2872–2879, IEEE, 2022

work page 2022

[30] [30]

T. D. Barfoot,State estimation for robotics. Cambridge University Press, 2024

work page 2024

[31] [31]

The invariant extended Kalman filter as a stable observer,

A. Barrau and S. Bonnabel, “The invariant extended Kalman filter as a stable observer,”IEEE Transactions on Automatic Control, vol. 62, no. 4, pp. 1797–1812, 2016. 18

work page 2016

[32] [32]

Contact- aided invariant extended Kalman filtering for robot state estimation,

R. Hartley, M. Ghaffari, R. M. Eustice, and J. W. Grizzle, “Contact- aided invariant extended Kalman filtering for robot state estimation,”The International Journal of Robotics Research, vol. 39, no. 4, pp. 402–430, 2020

work page 2020

[33] [33]

Nano- slam: natural gradient gaussian approximation for vehicle slam,

T. Zhang, W. Cao, C. Liu, F. Zhang, W. Wu, and S. E. Li, “Nano- slam: natural gradient gaussian approximation for vehicle slam,” in 2025 IEEE 28th International Conference on Intelligent Transportation Systems (ITSC), pp. 4014–4019, 2025

work page 2025

[34] [34]

Fast-lio: A fast, robust lidar-inertial odometry package by tightly-coupled iterated kalman filter,

W. Xu and F. Zhang, “Fast-lio: A fast, robust lidar-inertial odometry package by tightly-coupled iterated kalman filter,”IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 3317–3324, 2021

work page 2021

[35] [35]

A code for unscented kalman filtering on manifolds (ukf-m),

M. Brossard, A. Barrau, and S. Bonnabel, “A code for unscented kalman filtering on manifolds (ukf-m),” in2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 5701–5708, 2020

work page 2020

[36] [36]

Convergence and consistency analysis for extended kalman filter based slam,

S. Huang and G. Dissanayake, “Convergence and consistency analysis for extended kalman filter based slam,”IEEE Transactions on robotics, vol. 23, no. 5, pp. 1036–1049, 2007

work page 2007

[37] [37]

Unscented fastslam: a robust and efficient solution to the slam problem,

C. Kim, R. Sakthivel, and W. K. Chung, “Unscented fastslam: a robust and efficient solution to the slam problem,”IEEE Transactions on Robotics, vol. 24, no. 4, pp. 808–820, 2008

work page 2008

[38] [38]

Natural Gradient Gaussian Approximation Filter on Lie Groups for Robot State Estimation

T. Zhang, W. Cao, C. Liu, Y . Lyu, and S. E. Li, “Natural gradient gaussian approximation filter on lie groups for robot state estimation,” arXiv preprint arXiv:2604.10057, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026

[39] [39]

State estimation for legged robots: Consistent fusion of leg kinematics and IMU,

M. Bloesch, M. Hutter, M. A. Hoepflinger, S. Leutenegger, C. Gehring, C. D. Remy, and R. Siegwart, “State estimation for legged robots: Consistent fusion of leg kinematics and IMU,”Robotics: Science and Systems VIII, p. 17, 2013

work page 2013

[40] [40]

A tutorial on quantitative trajectory eval- uation for visual (-inertial) odometry,

Z. Zhang and D. Scaramuzza, “A tutorial on quantitative trajectory eval- uation for visual (-inertial) odometry,” in2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 7244–7251, IEEE, 2018

work page 2018

[41] [41]

Invariant smoother for legged robot state estimation with dynamic contact event information,

Z. Yoon, J.-H. Kim, and H.-W. Park, “Invariant smoother for legged robot state estimation with dynamic contact event information,”IEEE Transactions on Robotics, vol. 40, pp. 193–212, 2023

work page 2023