Approximations of the Mortensen observer using higher order extended Kalman filters
Pith reviewed 2026-05-07 15:41 UTC · model grok-4.3
The pith
Polynomial approximations to the Mortensen observer arise from truncating higher-order derivatives of the value function along the observer trajectory.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A polynomial approximation of the minimum energy estimator, also called Mortensen observer, is discussed. The method relies on successive differentiations of an underlying value function and the Hamilton-Jacobi-Bellman equation, respectively. By means of neglecting higher order derivatives of the value function along the unknown observer trajectory, a coupled set of nonlinear tensor structured differential equations is derived. In its simplest form, the approach boils down to the well-known extended Kalman filter. Numerical experiments with polynomials up to the order eight illustrate the potential of the new approach and indicate local convergence to the Mortensen observer.
What carries the argument
The coupled set of nonlinear tensor-structured differential equations obtained by successive differentiations of the value function and Hamilton-Jacobi-Bellman equation while truncating higher-order terms along the observer trajectory.
If this is right
- The first-order case recovers the extended Kalman filter exactly.
- Increasing the polynomial degree yields a family of observers with successively higher accuracy for nonlinear systems.
- The tensor differential equations can be integrated numerically at least up to order eight.
- Experiments indicate local convergence of the approximations to the true Mortensen observer.
Where Pith is reading between the lines
- Efficient solvers for the high-dimensional tensor equations would be required before the method becomes practical for real-time estimation.
- The truncation idea could be combined with existing techniques such as sigma-point filters to improve robustness.
- Error estimates might follow from bounding the neglected higher-order terms once an approximate trajectory is known.
Load-bearing premise
Neglecting higher-order derivatives of the value function along the unknown observer trajectory produces a valid and convergent approximation to the true Mortensen observer.
What would settle it
A numerical test on a nonlinear system in which increasing the polynomial order from one to eight does not reduce the estimation error toward the exact Mortensen observer or causes the derived tensor equations to become unstable.
read the original abstract
A polynomial approximation of the minimum energy estimator, also called Mortensen observer, is discussed. The method relies on successive differentiations of an underlying value function and the Hamilton-Jacobi-Bellman equation, respectively. By means of neglecting higher order derivatives of the value function along the unknown observer trajectory, a coupled set of nonlinear tensor structured differential equations is derived. In its simplest form, the approach boils down to the well-known extended Kalman filter. Numerical experiments with polynomials up to the order eight illustrate the potential of the new approach and indicate local convergence to the Mortensen observer.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a polynomial approximation to the Mortensen minimum-energy observer obtained by successive differentiations of the underlying value function and the Hamilton-Jacobi-Bellman equation. Neglecting higher-order derivatives of the value function along the unknown observer trajectory produces a closed hierarchy of coupled nonlinear tensor ODEs; the lowest-order truncation recovers the extended Kalman filter, while higher-order truncations are examined numerically with polynomials up to degree eight, indicating local convergence.
Significance. If the neglected terms can be shown to remain small and the resulting ODEs are stable, the construction supplies a systematic, computationally feasible hierarchy of observers that extends the EKF while remaining grounded in the minimum-energy framework. The numerical tests up to order eight constitute a concrete strength, furnishing reproducible evidence that the approach can track the true Mortensen observer locally.
major comments (2)
- [§3] §3 (derivation of the tensor ODEs): the truncation that drops higher-order derivatives of the value function along the observer trajectory is introduced without an error estimate, a smallness condition, or a bound on the remainder; because this step is required to close the system and underpins the local-convergence claim, its justification is load-bearing.
- [§5] §5 (numerical experiments): the reported runs with polynomials up to order eight demonstrate apparent convergence, yet supply neither quantitative error norms against the true Mortensen trajectory, nor verification that the neglected derivatives remain negligible, nor stability analysis of the closed tensor ODEs; these omissions prevent confirmation that the observed behavior is not an artifact of the test cases.
minor comments (2)
- [§2] The tensor notation and multi-index conventions are introduced without a compact reference table or example; adding one would improve readability.
- [§5] A brief comparison of computational cost versus accuracy for successive orders would help readers assess practical utility.
Simulated Author's Rebuttal
We thank the referee for the careful reading, the positive assessment of the numerical tests up to order eight, and the identification of points where further justification would strengthen the presentation. We respond to each major comment below.
read point-by-point responses
-
Referee: [§3] §3 (derivation of the tensor ODEs): the truncation that drops higher-order derivatives of the value function along the observer trajectory is introduced without an error estimate, a smallness condition, or a bound on the remainder; because this step is required to close the system and underpins the local-convergence claim, its justification is load-bearing.
Authors: The truncation is introduced explicitly as an approximation that closes the hierarchy while remaining consistent with the minimum-energy derivation; the first-order case recovers the extended Kalman filter, whose practical utility is well established even though general a-priori error bounds are unavailable for arbitrary nonlinear systems. The manuscript does not assert a rigorous remainder estimate for the infinite hierarchy. In revision we will add a dedicated paragraph in §3 that states the local smoothness assumption on the value function under which the neglected terms are expected to be small and that clarifies the approximation character of the closed tensor ODEs. revision: partial
-
Referee: [§5] §5 (numerical experiments): the reported runs with polynomials up to order eight demonstrate apparent convergence, yet supply neither quantitative error norms against the true Mortensen trajectory, nor verification that the neglected derivatives remain negligible, nor stability analysis of the closed tensor ODEs; these omissions prevent confirmation that the observed behavior is not an artifact of the test cases.
Authors: The experiments are intended to illustrate the systematic improvement obtained by increasing polynomial degree rather than to furnish a complete numerical validation against the intractable true Mortensen observer (which would require solving the underlying HJB PDE). We will revise §5 to include (i) quantitative measures of the difference between successive polynomial approximations, (ii) explicit checks that the magnitudes of the dropped derivative terms remain small throughout the reported trajectories, and (iii) a brief discussion of the observed numerical stability of the closed tensor ODEs, which successfully integrate up to order eight in all presented cases. These additions will make the local-convergence indication more quantitative while respecting the computational motivation for the approximation. revision: yes
Circularity Check
Derivation is self-contained; no circular reductions identified
full rationale
The paper begins from the standard Hamilton-Jacobi-Bellman equation for the minimum-energy (Mortensen) observer and applies successive differentiations to generate a hierarchy of tensor ODEs. The approximation step consists of explicitly neglecting higher-order derivatives of the value function along the observer trajectory, which directly yields the claimed coupled system whose lowest-order truncation recovers the EKF. This is a conventional truncation procedure with no reduction of any derived quantity back to fitted parameters, self-definitional loops, or load-bearing self-citations. Numerical experiments up to order eight are presented as empirical illustration of local convergence rather than as a definitional identity. The derivation chain therefore remains independent of its own outputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The value function is sufficiently smooth to permit successive differentiations along the observer trajectory
Reference graph
Works this paper leans on
-
[1]
C. O. Aguilar and A. J. Krener,Numerical solutions to the Bellman equation of optimal control, J. Optim. Theory Appl., 160 (2014), pp. 527–552. https://doi.org/10.1007/s10957-013-0403-8
-
[2]
E. G. Al’brekht,On the optimal stabilization of nonlinear systems, J. Appl. Math. Mech., 25 (1961), p. 1254. https://doi.org/10.1016/0021-8928(61)90005-3. [3]B. W. Bader, T. G. Kolda, et al.,Tensor toolbox for matlab, Dec. 2025. www.tensortoolbox.org. [4]R. Bellman,Adaptive Control Processes: A Guided Tour, Princeton University Press, 1961
-
[3]
J. Borggaard and L. Zietsman,On approximating polynomial-quadratic regulator problems, IFAC- PapersOnLine, 54 (2021), pp. 329–334. https://doi.org/10.1016/j.ifacol.2021.06.090
-
[4]
T. Breiten and K. Kunisch,Neural network based nonlinear observers, Syst. Control Lett., 148 (2021). https://doi.org/10.1016/j.sysconle.2020.104829
-
[5]
T. Breiten, K. Kunisch, and L. Pfeiffer,Numerical study of polynomial feedback laws for a bilinear control problem, Math. Control Relat. Fields, 8 (2018), pp. 557–582. https://doi.org/10.3934/mcrf.2018023
-
[6]
,Taylor expansions of the value function associated with a bilinear optimal control problem, Ann. Inst. Henri Poincare (C) Anal. Non Lineaire, 36 (2019), pp. 1361–1399. https://doi.org/10.1016/j.anihpc.2019.01.001
-
[7]
T. Breiten, K. Kunisch, and J. Schröder,Numerical realization of the Mortensen observer via a Hessian-augmented polynomial approximation of the value function, SIAM J. Sci. Comput., 47 (2025), pp. A181–A206. https://doi.org/10.1137/23M1613773
-
[8]
Approximations of the Mortensen ob- server using higher order extended Kalman filters
T. Breiten, J. Ramme, and J. Schröder,Code for the paper "Approximations of the Mortensen ob- server using higher order extended Kalman filters", Apr. 2026. https://doi.org/10.5281/zenodo.19855072
-
[9]
T. Breiten and J. Schröder,Local well-posedness of the Mortensen observer, ESAIM - Control Optim. Calc. Var., 30 (2024). https://doi.org/10.1051/cocv/2024046
-
[10]
L.-P. Chaintron, Á. M. González, L. Mertz, and P. Moireau,Mortensen observer for a class of variational inequalities – lost equivalence with stochastic filtering approaches, ESAIM, Proc. surv., 73 (2023), pp. 130–157. https://doi.org/10.1051/proc/202373130
-
[11]
L.-P. Chaintron, L. Mertz, P. Moireau, and H. Zidani,Constrained non-linear estimation and links with stochastic filtering, 2025. https://doi.org/10.48550/arXiv.2502.01200
-
[12]
N. A. Corbin and B. Kramer,Scalable computation of H∞energy functions for poly- nomial control-affine systems, IEEE Trans. Autom. Control, 70 (2025), pp. 3088–3100. https://doi.org/10.1109/TAC.2024.3494472
-
[13]
W. H. Fleming,Deterministic nonlinear filtering, Ann. Sc. Norm. Super. Pisa, Cl. Sci., 25 (1997), pp. 435–454. 22 TOBIAS BREITEN ⋆, JUSTUS RAMME ⋆, AND JESPER SCHRÖDER†
work page 1997
-
[14]
W. H. Fleming and W. M. McEneaney,A max-plus-based algorithm for a Hamilton–Jacobi– Bellman equation of nonlinear filtering, SIAM J. Control Optim., 38 (2000), pp. 683–710. https://doi.org/10.1137/S0363012998332433
-
[15]
Hackbusch,Tensor spaces and numerical tensor calculus, vol
W. Hackbusch,Tensor spaces and numerical tensor calculus, vol. 56 of Springer Series in Computa- tional Mathematics, Springer, Cham, second ed., 2019. https://doi.org/10.1007/978-3-030-35554-8. [18]M. Hardy,Combinatorics of partial derivatives, Electron. J. Comb., 13 (2006). [19]O. Hijab,Minimum energy estimation, ph.D. dissertation, University of Califor...
-
[16]
O. Hijab,Asymptotic nonlinear filtering and large deviations, in Advances in Filtering and Optimal Sto- chastic Control, Berlin Heidelberg, 1982, Springer, pp. 170–176. https://doi.org/10.1007/BFb0004536
-
[17]
D. Jordan and P. Smith,Nonlinear Ordinary Differential Equations: An Introduction for Scientists and Engineers, vol. 10, Oxford University Press on Demand, fourth ed., 2007. https://doi.org/10.1093/oso/9780199208241.001.0001
-
[18]
B. Kramer, S. Gugercin, J. Borggaard, and L. Balicki,Scalable computation of energy functions for nonlinear balanced truncation, Comput. Methods Appl. Mech. Eng, 427 (2024), p. 117011. https://doi.org/10.1016/j.cma.2024.117011
- [19]
-
[20]
A. J. Krener,The convergence of the minimum energy estimator, in New Trends in Nonlinear Dynamics and Control and their Applications, Springer, Berlin, 2004, pp. 187–208
work page 2004
-
[21]
,Minimum energy estimation and moving horizon estimation, in 2015 54th IEEE Conference on Decision and Control (CDC), 2015, pp. 4952–4957. https://doi.org/10.1109/CDC.2015.7402993
-
[22]
,Minimum Energy Estimation Applied to the Lorenz Attractor, Springer International Publishing, Cham, 2018, pp. 165–182. https://doi.org/10.1007/978-3-030-01959-4
-
[23]
Lukes,Optimal regulation of nonlinear dynamical systems, SIAM J
D. Lukes,Optimal regulation of nonlinear dynamical systems, SIAM J. on Contr., 7 (1969), pp. 75–100. https://doi.org/10.1137/0307007
-
[24]
P. Moireau,A discrete-time optimal filtering approach for non-linear systems as a stable discretiza- tion of the Mortensen observer, ESAIM - Control Optim. Calc. Var., 24 (2018), pp. 1815–1847. https://doi.org/10.1051/cocv/2017077
-
[25]
R. E. Mortensen,Maximum-likelihood recursive nonlinear filtering, J. Optim. Theory Appl., 2 (1968), pp. 386–394. https://doi.org/10.1007/BF00925744
-
[26]
L. Sallandt,Computing high-dimensional value functions of optimal feedback control prob- lems using the tensor-train format, doctoral thesis, Technische Universität Berlin, 2022. https://doi.org/10.14279/depositonce-12786
-
[27]
J. Schröder,Energy-optimal state reconstruction for deterministic nonlinear dynamical systems, doctoral thesis, Technische Universität Berlin, 2025. https://doi.org/10.14279/depositonce-23323
-
[28]
J. Schröder,Local well-posedness of the minimum energy estimator for a defocusing cubic wave equation, J. Differ. Equ., 435 (2025), p. 113258. https://doi.org/10.1016/j.jde.2025.113258
-
[29]
Energy-Optimal State Reconstruction for Deter- ministic Nonlinear Dynamical Systems
J. Schröder and T. Breiten,Code for the thesis “Energy-Optimal State Reconstruction for Deter- ministic Nonlinear Dynamical Systems”, 2024. https://doi.org/10.5281/zenodo.14394797
-
[30]
J. C. Willems,Deterministic least squares filtering, J. Econom., 118 (2004), pp. 341–373. https://doi.org/10.1016/S0304-4076(03)00146-5
-
[31]
A. Wolf,Low rank tensor decompositions for high dimensional data approximation, recovery and prediction, doctoral thesis, Technische Universität Berlin, 2019. https://doi.org/10.14279/depositonce- 8109
-
[32]
X. Wu, B. Jacob, and H. Elbern,Optimal control and observation locations for time- varying systems on a finite-time horizon, SIAM J. Control Optim., 54 (2016), pp. 291–316. https://doi.org/10.1137/15M1014759. ⋆Institute of Mathematics, Technische Universität Berlin, Straße des 17. Juni 136, 10623 Berlin, Germany †Johann Radon Institute, Austrian Academy o...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.