Mitigating Overconfidence in Nonlinear Kalman Filters via Covariance Recalibration
Pith reviewed 2026-05-23 23:07 UTC · model grok-4.3
The pith
Nonlinear measurements cause the standard Kalman filter update to underestimate true posterior covariance.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under nonlinear measurements the conventional Kalman filter update framework inherently tends to underestimate the true posterior covariance. The covariance-recalibrated framework re-approximates the measurement model after the state update to capture the actual effect of the Kalman gain on the posterior covariance; when the recalibration indicates that an update is harmful the update is withdrawn. The framework combines with any existing nonlinear KF and improves state and covariance estimation accuracy.
What carries the argument
The covariance recalibration procedure that re-approximates the measurement model after the state update to better capture the actual effect of the Kalman gain.
If this is right
- The recalibrated framework applies unchanged to the extended, unscented, and cubature Kalman filters.
- An update is withdrawn when recalibration shows it would increase covariance error.
- Both state estimates and covariance estimates improve when the framework is added to existing nonlinear filters.
- Error reductions of several orders of magnitude occur in the tested applications.
Where Pith is reading between the lines
- The same underestimation mechanism may help explain observed divergence of nonlinear filters on strongly nonlinear problems.
- A similar post-update recalibration step could be tested on particle filters and other sequential Bayesian estimators.
- Direct comparison of reported covariances against empirical covariances obtained from repeated Monte Carlo runs on standard nonlinear benchmarks would provide an immediate empirical check.
Load-bearing premise
The conventional Kalman filter update equations can be applied directly to nonlinear measurement models without introducing systematic bias into the reported posterior covariance.
What would settle it
Compute the true posterior covariance by Monte Carlo sampling on a nonlinear measurement model and show that it is larger than the covariance returned by an unmodified nonlinear KF on the same data.
Figures
read the original abstract
The Kalman filter (KF) is an optimal linear state estimator for linear systems, and numerous extensions, including the extended Kalman filter (EKF), unscented Kalman filter (UKF), and cubature Kalman filter (CKF), have been developed for nonlinear systems. Although these nonlinear KFs differ in how they approximate nonlinear transformations, they all retain the same update framework as the linear KF. In this paper, we show that, under nonlinear measurements, this conventional framework inherently tends to underestimate the true posterior covariance, leading to overconfident covariance estimates. To the best of our knowledge, this is the first work to provide a mathematical proof of this systematic covariance underestimation in a general nonlinear KF framework. Motivated by this analysis, we propose a covariance-recalibrated framework that re-approximates the measurement model after the state update to better capture the actual effect of the Kalman gain on the posterior covariance; when recalibration indicates that an update is harmful, the update can be withdrawn. The proposed framework can be combined with essentially any existing nonlinear KF, and simulations across four nonlinear KFs and five applications show that it substantially improves both state and covariance estimation accuracy, often reducing errors by several orders of magnitude. The code and supplementary material are available at https://github.com/Shida-Jiang/A-new-framework-for-nonlinear-Kalman-filters.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that the standard Kalman update equations retained by nonlinear filters (EKF, UKF, CKF, etc.) systematically underestimate the true posterior covariance whenever the measurement function is nonlinear, provides a mathematical proof of this underestimation, and introduces a covariance-recalibration procedure that re-approximates the measurement model after the state update (optionally withdrawing the update). The recalibrated framework is asserted to be compatible with any existing nonlinear KF; simulations across four filters and five applications are reported to yield substantial gains in both state and covariance accuracy.
Significance. A general, assumption-light proof that the conventional update produces overconfident covariances under nonlinearity, together with a practical fix and open code, would be a meaningful contribution to nonlinear filtering. The empirical claim of order-of-magnitude error reductions, if supported by proper baselines and statistics, would strengthen the practical case.
major comments (2)
- [Proof of underestimation (likely §3 or §4)] The central claim of a 'mathematical proof of this systematic covariance underestimation in a general nonlinear KF framework' requires an explicit statement of the function class for which the inequality holds. For arbitrary nonlinear h(·) the true posterior covariance is obtained by integrating a non-Gaussian likelihood against the prior and does not admit a closed-form comparison to the KF update; any strict underestimation result must invoke additional structure (convexity, bounded Hessian, local linearity, etc.). The manuscript must identify this structure and verify that it is not tacitly assumed.
- [Simulation results (likely §5)] The simulation section reports 'often reducing errors by several orders of magnitude' across four filters and five applications, yet the abstract and available description give no indication of error bars, statistical significance tests, or explicit baseline comparisons (e.g., against existing covariance-inflation or robust KF variants). Without these, the magnitude of the reported gains cannot be evaluated and the cross-application claim is not yet secured.
minor comments (1)
- [Notation and framework definition] Notation for the recalibrated quantities (e.g., P_r^+, K_r) should be introduced once and used consistently; currently the distinction between original and recalibrated variables is occasionally ambiguous.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. The comments help clarify the scope of the theoretical result and strengthen the empirical evaluation. We address each major comment below.
read point-by-point responses
-
Referee: [Proof of underestimation (likely §3 or §4)] The central claim of a 'mathematical proof of this systematic covariance underestimation in a general nonlinear KF framework' requires an explicit statement of the function class for which the inequality holds. For arbitrary nonlinear h(·) the true posterior covariance is obtained by integrating a non-Gaussian likelihood against the prior and does not admit a closed-form comparison to the KF update; any strict underestimation result must invoke additional structure (convexity, bounded Hessian, local linearity, etc.). The manuscript must identify this structure and verify that it is not tacitly assumed.
Authors: We agree that a strict inequality for completely arbitrary nonlinear h(·) cannot be stated in closed form without additional structure, as the true posterior is generally non-Gaussian. Our derivation is performed inside the standard Gaussian approximation framework employed by nonlinear KFs (EKF linearization, UKF/CKF sigma-point moment matching). Under these approximations the conventional update produces a covariance that is smaller than the covariance implied by the same approximated measurement model after the gain is applied. We will revise §3 to explicitly state the function class (C² functions whose Hessian is bounded in a neighborhood of the operating point) and add a short verification paragraph confirming these conditions are the same ones already required for the consistency of EKF/UKF/CKF. A clarifying sentence will also be added that the claimed underestimation holds relative to the filter’s own Gaussian approximation rather than the exact Bayesian posterior. revision: yes
-
Referee: [Simulation results (likely §5)] The simulation section reports 'often reducing errors by several orders of magnitude' across four filters and five applications, yet the abstract and available description give no indication of error bars, statistical significance tests, or explicit baseline comparisons (e.g., against existing covariance-inflation or robust KF variants). Without these, the magnitude of the reported gains cannot be evaluated and the cross-application claim is not yet secured.
Authors: We accept that the current presentation of the numerical results lacks the statistical detail needed to substantiate the magnitude of the improvements. In the revised manuscript we will (i) report mean and standard deviation over 100 Monte-Carlo runs for every metric, (ii) include paired statistical tests (Wilcoxon signed-rank) against the baseline filters, and (iii) add direct comparisons with two representative covariance-inflation schemes and one robust KF variant. These additions will be placed in §5 and summarized in the abstract. revision: yes
Circularity Check
No significant circularity; proof and recalibration are independently derived
full rationale
The paper's central chain consists of (1) a claimed mathematical proof that the standard KF update underestimates posterior covariance for nonlinear measurements, and (2) a recalibration procedure motivated by that analysis. No equations or steps reduce by construction to fitted parameters, self-definitions, or self-citations. The recalibration re-approximates the measurement model post-update and is not statistically forced by the underestimation proof itself. The provided abstract and reader summary contain no load-bearing self-citations or ansatz smuggling. This is the common case of an independent derivation.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Nonlinear Kalman filters retain the same update framework as the linear KF
invented entities (1)
-
covariance-recalibrated framework
no independent evidence
Forward citations
Cited by 2 Pith papers
-
CAR-EnKF: A Covariance-Adaptive and Recalibrated Ensemble Kalman Filter Framework
CAR-EnKF reduces RMSE versus standard EnKF by recalibrating the Kalman gain effect and adding an online-tuned covariance compensation term that activates only under measurement nonlinearity.
-
Assumed Density Filtering and Smoothing with Neural Network Surrogate Models
Assumed density filtering and smoothing for neural network surrogate models is enabled by analytic computation of output moments, yielding more accurate state estimates and improved LQR performance on stochastic Loren...
Reference graph
Works this paper leans on
-
[1]
Brian DO Anderson and John B Moore. Optimal filtering. Courier Corporation, 2005
work page 2005
-
[2]
Critical remarks on the linearised and extended kalman filters with geodetic navigation examples
Dah-Jing Jwo and Ta-Shun Cho. Critical remarks on the linearised and extended kalman filters with geodetic navigation examples. Measurement, 43(9):1077–1089, 2010
work page 2010
-
[3]
Disturbance observer and kalman filter based motion control realization
Thao Tran Phuong, Kiyoshi Ohishi, Chowarit Mitsantisuk, Yuki Yokokura, Kouhei Ohnishi, Roberto Oboe, and Asif Sabanovic. Disturbance observer and kalman filter based motion control realization. IEEJ Journal of Industry Applications, 7(1):1–14, 2018
work page 2018
-
[4]
[9] application of the kalman filter to computational problems in statistics
Emery N Brown and Christopher H Schmid. [9] application of the kalman filter to computational problems in statistics. In Methods in enzymology, volume 240, pages 171–181. Elsevier, 1994
work page 1994
-
[5]
The ensemble kalman filter: a signal processing perspective
Michael Roth, Gustaf Hendeby, Carsten Fritsche, and Fredrik Gustafsson. The ensemble kalman filter: a signal processing perspective. EURASIP Journal on Advances in Signal Processing, 2017:1–16, 2017
work page 2017
-
[6]
Nonlinear Filtering: Methods and Applications
Kumar Pakki Bharani Chandra and Da-Wei Gu. Nonlinear Filtering: Methods and Applications . Springer, 2019
work page 2019
- [7]
-
[8]
New extension of the kalman filter to nonlinear systems
Simon J Julier and Jeffrey K Uhlmann. New extension of the kalman filter to nonlinear systems. In Signal processing, sensor fusion, and target recognition VI, volume 3068, pages 182–193. Spie, 1997
work page 1997
-
[9]
The unscented kalman filter for nonlinear estimation
Eric A Wan and Rudolph Van Der Merwe. The unscented kalman filter for nonlinear estimation. In Proceedings of the IEEE 2000 adaptive systems for signal processing, communications, and control symposium (Cat. No. 00EX373), pages 153–158. Ieee, 2000
work page 2000
-
[10]
Ienkaran Arasaratnam and Simon Haykin. Cubature kalman filters. IEEE Transactions on automatic control, 54(6):1254– 1269, 2009. 11
work page 2009
-
[11]
Discrete-time nonlinear filtering algorithms using gauss– hermite quadrature
Ienkaran Arasaratnam, Simon Haykin, and Robert J Elliott. Discrete-time nonlinear filtering algorithms using gauss– hermite quadrature. Proceedings of the IEEE, 95(5):953–977, 2007
work page 2007
-
[12]
Square-root quadrature kalman filtering
Ienkaran Arasaratnam and Simon Haykin. Square-root quadrature kalman filtering. IEEE Transactions on Signal Processing, 56(6):2589–2593, 2008
work page 2008
-
[13]
Some relations between extended and unscented kalman filters
Fredrik Gustafsson and Gustaf Hendeby. Some relations between extended and unscented kalman filters. IEEE Transactions on Signal Processing, 60(2):545–555, 2011
work page 2011
-
[14]
Analysis of kalman filter approximations for nonlinear measurements
Mark R Morelande and Angel F Garcia-Fernandez. Analysis of kalman filter approximations for nonlinear measurements. IEEE Transactions on Signal Processing, 61(22):5477–5484, 2013
work page 2013
-
[15]
Convergence analysis of nonlinear kalman filters with novel innovation-based method
Shiyuan Wang, Wanli Wang, Badong Chen, and K Tse Chi. Convergence analysis of nonlinear kalman filters with novel innovation-based method. Neurocomputing, 289:188– 194, 2018
work page 2018
-
[16]
Kumar Pakki Bharani Chandra and Da-Wei Gu. Nonlinear filtering. Cham, Switzerland: Springer , 2019
work page 2019
-
[17]
Daisuke Hotta and Yoichiro Ota. Why does enkf suffer from analysis overconfidence? an insight into exploiting the ever- increasing volume of observations. Quarterly Journal of the Royal Meteorological Society, 147(735):1258–1277, 2021
work page 2021
-
[18]
Performance evaluation of ukf-based nonlinear filtering
Kai Xiong, HY Zhang, and CW Chan. Performance evaluation of ukf-based nonlinear filtering. Automatica, 42(2):261–270, 2006
work page 2006
-
[19]
The iterated kalman filter update as a gauss-newton method
Bradley M Bell and Frederick W Cathey. The iterated kalman filter update as a gauss-newton method. IEEE Transactions on Automatic Control, 38(2):294–297, 1993
work page 1993
-
[20]
Iterated unscented kalman filter for passive target tracking
Ronghui Zhan and Jianwei Wan. Iterated unscented kalman filter for passive target tracking. IEEE Transactions on Aerospace and Electronic Systems, 43(3):1155–1163, 2007
work page 2007
-
[21]
Iterated cubature kalman filter and its application
Jing Mu and Yuan-li Cai. Iterated cubature kalman filter and its application. In 2011 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems, pages 33–37. IEEE, 2011
work page 2011
-
[22]
Performance evaluation of iterated extended kalman filter with variable step-length
Jindˇ rich Havl´ ık and Ondˇ rej Straka. Performance evaluation of iterated extended kalman filter with variable step-length. In Journal of Physics: Conference Series , volume 659, page 012022. IOP Publishing, 2015
work page 2015
-
[23]
Marginalised iterated unscented kalman filter
Lubin Chang, Baiqing Hu, Guobin Chang, and An Li. Marginalised iterated unscented kalman filter. IET Control Theory & Applications, 6(6):847–854, 2012
work page 2012
-
[24]
Alternative framework for the iterated unscented kalman filter
Guobin Chang, Tianhe Xu, and Qianxin Wang. Alternative framework for the iterated unscented kalman filter. IET Signal Processing, 11(3):258–264, 2017
work page 2017
-
[25]
On unscented kalman filtering for state estimation of continuous-time nonlinear systems
Simo Sarkka. On unscented kalman filtering for state estimation of continuous-time nonlinear systems. IEEE Transactions on automatic control, 52(9):1631–1641, 2007
work page 2007
-
[26]
Introduction to kalman filter and its applications
Youngjoo Kim, Hyochoong Bang, et al. Introduction to kalman filter and its applications. Introduction and Implementations of the Kalman Filter , 1:1–16, 2018
work page 2018
-
[27]
Esmaeil Ghahremani and Innocent Kamwa. Online state estimation of a synchronous generator using unscented kalman filter from phasor measurements units. IEEE Transactions on Energy Conversion, 26(4):1099–1108, 2011
work page 2011
-
[28]
Shida Jiang, Junzhe Shi, Manashita Borah, and Scott Moura. Weaknesses and improvements of the extended kalman filter for battery state-of-charge and state-of-health estimation. In 2024 American Control Conference (ACC), pages 1441–1448. IEEE, 2024. A Supplementary proof for Theorem 1 Lemma 3 Let (Ω, F , P) be an underlying probability space. Assume the ra...
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.