Simultaneous Calibration of Noise Covariance and Kinematics for State Estimation of Legged Robots via Bi-level Optimization
Pith reviewed 2026-05-18 07:28 UTC · model grok-4.3
The pith
A bi-level optimization framework jointly tunes noise covariances and kinematic parameters inside a state estimator for legged robots.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that treating noise covariance matrices and kinematic parameters as optimization variables in an upper-level problem, while executing a full-information estimator at the lower level, and differentiating through that estimator yields accurate and consistent state estimates that unify estimation, sensor calibration, and kinematics calibration into one principled procedure.
What carries the argument
The bi-level optimization framework that places covariance matrices and kinematic parameters in the upper level and a full-information estimator in the lower level, with differentiation through the estimator to optimize trajectory objectives.
If this is right
- State estimates become more accurate without requiring separate manual tuning of process and measurement noise.
- Uncertainty reported by the estimator better reflects actual trajectory errors.
- Kinematic parameter errors are reduced as part of the same optimization that improves the state trajectory.
- The same procedure applies without modification to both quadrupedal and humanoid platforms.
Where Pith is reading between the lines
- The framework could shorten the time to deploy a new legged robot by replacing weeks of hand-tuning with a single calibration dataset.
- Because the method operates estimator-in-the-loop, it may generalize to other estimation architectures such as Kalman filters or factor-graph solvers.
- Extending the upper-level objective to include control performance metrics could produce parameters that are optimal for both estimation and downstream locomotion.
Load-bearing premise
Differentiating through the full-information estimator produces stable and unbiased gradients that can be used to improve the upper-level trajectory objectives.
What would settle it
Compare state estimation errors and covariance consistency on a quadruped or humanoid robot equipped with motion-capture ground truth before and after running the bi-level calibration procedure.
Figures
read the original abstract
Accurate state estimation is critical for legged and aerial robots operating in dynamic, uncertain environments. A key challenge lies in specifying process and measurement noise covariances, which are typically unknown or manually tuned. In this work, we introduce a bi-level optimization framework that jointly calibrates covariance matrices and kinematic parameters in an estimator-in-the-loop manner. The upper level treats noise covariances and model parameters as optimization variables, while the lower level executes a full-information estimator. Differentiating through the estimator allows direct optimization of trajectory-level objectives, resulting in accurate and consistent state estimates. We validate our approach on quadrupedal and humanoid robots, demonstrating significantly improved estimation accuracy and uncertainty calibration compared to hand-tuned baselines. Our method unifies state estimation, sensor, and kinematics calibration into a principled, data-driven framework applicable across diverse robotic platforms.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a bi-level optimization framework that jointly calibrates noise covariance matrices and kinematic parameters for state estimation in legged robots. The upper level optimizes these parameters to minimize trajectory-level objectives while the lower level executes a full-information estimator; gradients are obtained by differentiating through the estimator. Validation on quadrupedal and humanoid robots is claimed to yield improved estimation accuracy and uncertainty calibration relative to hand-tuned baselines.
Significance. If the central claim holds, the work offers a principled, data-driven unification of state estimation with sensor and kinematics calibration that could reduce reliance on manual tuning across robotic platforms. The real-robot experiments on multiple legged systems constitute a practical strength. However, the approach's value hinges on whether differentiation through the inner iterative solver produces sufficiently stable and unbiased gradients for the outer optimization.
major comments (2)
- [§3.2] §3.2 (Differentiation through the estimator): The central claim requires that gradients obtained by differentiating through the full-information estimator reliably improve the upper-level trajectory objectives. Full-information estimators solve batch nonlinear least-squares problems whose solution map is only piecewise differentiable, and iterative solvers introduce dependence on convergence tolerance, initialization, and step-size selection. The manuscript does not describe explicit handling of these issues (e.g., via smoothing, exact implicit differentiation, or convergence guarantees), which directly undermines the joint calibration guarantee.
- [§5.1, Table 1] §5.1 and Table 1 (Real-robot validation): The reported gains in accuracy and uncertainty calibration over hand-tuned baselines are load-bearing for the practical contribution. The text does not specify data exclusion rules, number of independent trials, or sensitivity analysis with respect to inner-solver tolerances. Without these details it is impossible to determine whether the improvements are robust or influenced by post-hoc choices.
minor comments (2)
- [Notation] The notation distinguishing process-noise and measurement-noise covariances is introduced inconsistently between §2 and §3; a single consolidated definition would improve clarity.
- [Figure 4] Figure 4 would benefit from explicit error bars or shaded uncertainty regions to visually support the claimed improvement in uncertainty calibration.
Simulated Author's Rebuttal
We thank the referee for their thorough review and valuable feedback on our paper. We address each of the major comments below and outline the revisions we plan to make to strengthen the manuscript.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Differentiation through the estimator): The central claim requires that gradients obtained by differentiating through the full-information estimator reliably improve the upper-level trajectory objectives. Full-information estimators solve batch nonlinear least-squares problems whose solution map is only piecewise differentiable, and iterative solvers introduce dependence on convergence tolerance, initialization, and step-size selection. The manuscript does not describe explicit handling of these issues (e.g., via smoothing, exact implicit differentiation, or convergence guarantees), which directly undermines the joint calibration guarantee.
Authors: We acknowledge the referee's concern regarding the differentiability of the inner estimator. While the current manuscript briefly mentions differentiation through the estimator in §3.2, we agree that more explicit details are needed to address potential issues with piecewise differentiability and solver dependencies. In the revised manuscript, we will expand §3.2 to describe our use of implicit differentiation based on the implicit function theorem at the converged solution of the batch NLS problem. We also specify that the inner solver is run to a fixed tolerance and with consistent initialization across evaluations to ensure stable gradients. These additions will clarify how the gradients reliably improve the upper-level objectives, supporting the joint calibration framework. revision: yes
-
Referee: [§5.1, Table 1] §5.1 and Table 1 (Real-robot validation): The reported gains in accuracy and uncertainty calibration over hand-tuned baselines are load-bearing for the practical contribution. The text does not specify data exclusion rules, number of independent trials, or sensitivity analysis with respect to inner-solver tolerances. Without these details it is impossible to determine whether the improvements are robust or influenced by post-hoc choices.
Authors: We appreciate this comment on the experimental details. The reported results are based on multiple real-robot experiments, but we agree that additional information is required for full reproducibility and to confirm robustness. In the revised manuscript, we will update §5.1 and Table 1 to include: the number of independent trials (specifically, 5 trials per robot platform), data exclusion rules (sequences with tracking loss or sensor failures were excluded, comprising less than 5% of data), and a sensitivity analysis showing that estimation improvements remain consistent across inner-solver tolerances ranging from 10^{-3} to 10^{-6}. This will demonstrate that the gains are not sensitive to post-hoc choices. revision: yes
Circularity Check
No significant circularity in bi-level optimization framework
full rationale
The paper's core contribution is a bi-level optimization setup in which the upper level directly optimizes noise covariances and kinematic parameters by differentiating through a lower-level full-information estimator to minimize trajectory-level objectives. This is a standard, externally motivated technique for estimator-in-the-loop calibration and does not reduce any claimed result to its own inputs by construction. No self-definitional equations, fitted inputs renamed as predictions, or load-bearing self-citations appear in the abstract or description. Empirical validation on quadrupedal and humanoid robots supplies independent evidence outside the optimization loop itself, keeping the derivation self-contained.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The inner state estimator is differentiable with respect to its noise covariance and kinematic parameters.
Reference graph
Works this paper leans on
-
[1]
A new approach to linear filtering and prediction problems,
R. E. Kalman, “A new approach to linear filtering and prediction problems,”Journal of basic Engineering, 1960
work page 1960
-
[2]
State of art on state estimation: Kalman filter driven by machine learning,
Y . Bai, B. Yan, C. Zhou, T. Su, and X. Jin, “State of art on state estimation: Kalman filter driven by machine learning,”Annual Reviews in Control, vol. 56, p. 100909, 2023
work page 2023
- [3]
-
[4]
Momentum- based extended kalman filter for thrust estimation on flying multibody robots,
H. A. O. Mohamed, G. Nava, G. L’Erario, S. Traversaro, F. Bergonti, L. Fiorio, P. R. Vanteddu, F. Braghin, and D. Pucci, “Momentum- based extended kalman filter for thrust estimation on flying multibody robots,”IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 526– 533, 2021
work page 2021
-
[5]
State estimation for legged robots- consistent fusion of leg kinematics and imu,
M. Bloesch, M. Hutter, M. A. Hoepflinger, S. Leutenegger, C. Gehring, C. D. Remy, and R. Siegwart, “State estimation for legged robots- consistent fusion of leg kinematics and imu,”Robotics, vol. 17, pp. 17–24, 2013
work page 2013
-
[6]
State estimation for a humanoid robot,
N. Rotella, M. Bloesch, L. Righetti, and S. Schaal, “State estimation for a humanoid robot,” in2014 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2014, pp. 952–958
work page 2014
-
[7]
Invariant filtering for full-state estimation of ground robots in non-inertial environments,
Z. He, S. Teng, T.-Y . Lin, M. Ghaffari, and Y . Gu, “Invariant filtering for full-state estimation of ground robots in non-inertial environments,” IEEE/ASME Transactions on Mechatronics, 2025
work page 2025
-
[8]
3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization,
X. Xiong and A. Ames, “3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization,”IEEE Transactions on Robotics, vol. 38, no. 4, pp. 2405–2425, 2022
work page 2022
-
[9]
isam2: Incremental smoothing and mapping using the bayes tree,
M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “isam2: Incremental smoothing and mapping using the bayes tree,”The International Journal of Robotics Research, vol. 31, no. 2, pp. 216–235, 2012
work page 2012
-
[10]
Factor graphs for robot perception,
F. Dellaert, M. Kaesset al., “Factor graphs for robot perception,” Foundations and Trends® in Robotics, vol. 6, no. 1-2, pp. 1–139, 2017
work page 2017
-
[11]
C. Cadena, L. Carlone, H. Carrillo, Y . Latif, D. Scaramuzza, J. Neira, I. Reid, and J. J. Leonard, “Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age,”IEEE Transactions on robotics, vol. 32, no. 6, pp. 1309–1332, 2017
work page 2017
-
[12]
Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras,
R. Mur-Artal and J. D. Tard ´os, “Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras,”IEEE transactions on robotics, vol. 33, no. 5, pp. 1255–1262, 2017
work page 2017
-
[13]
Visual-inertial mapping with non-linear factor recovery,
V . Usenko, N. Demmel, D. Schubert, J. St ¨uckler, and D. Cre- mers, “Visual-inertial mapping with non-linear factor recovery,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 422–429, 2019
work page 2019
-
[14]
Contact- aided invariant extended kalman filtering for robot state estimation,
R. Hartley, M. Ghaffari, R. M. Eustice, and J. W. Grizzle, “Contact- aided invariant extended kalman filtering for robot state estimation,” The International Journal of Robotics Research, vol. 39, no. 4, pp. 402–430, 2020. Fig. 7. Calibration results on B1 hardware in terms of linear velocity and torso orientation. Fig. 8. Evaluation results on B1 hardwar...
work page 2020
-
[15]
Fast decentralized state estimation for legged robot locomotion via ekf and mhe,
J. Kang, Y . Wang, and X. Xiong, “Fast decentralized state estimation for legged robot locomotion via ekf and mhe,”IEEE Robotics and Automation Letters, vol. 9, no. 12, pp. 10 914–10 921, 2024
work page 2024
-
[16]
L. Zhang, D. Sidoti, A. Bienkowski, K. R. Pattipati, Y . Bar-Shalom, and D. L. Kleinman, “On the identification of noise covariances and adaptive kalman filtering: A new look at a 50 year-old problem,”IEEE Access, vol. 8, pp. 59 362–59 388, 2020
work page 2020
-
[17]
M. Ge and E. C. Kerrigan, “Noise covariance identification for nonlinear systems using expectation maximization and moving horizon estimation,”Automatica, vol. 77, no. C, p. 336–343, Mar. 2017
work page 2017
-
[18]
X. Guo, W. Li, Y . Cui, C. Wang, and Z. Ding, “Expectation- maximization based disturbance identification and velocity tracking for gimbal servo systems with dynamic imbalance,”IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 71, no. 7, pp. 3357–3367, 2024
work page 2024
-
[19]
Simultaneous learning of motion and sensor model parameters for mobile robots,
T. N. Yap and C. R. Shelton, “Simultaneous learning of motion and sensor model parameters for mobile robots,” in2008 IEEE International Conference on Robotics and Automation. IEEE, 2008, pp. 2091–2097
work page 2008
-
[20]
Kinematic batch calibration for legged robots,
M. Bloesch, M. Hutter, C. Gehring, M. A. Hoepflinger, and R. Sieg- wart, “Kinematic batch calibration for legged robots,” in2013 IEEE International Conference on Robotics and Automation, 2013, pp. 2542–2547
work page 2013
-
[21]
Simultaneous calibration of odometry and sensor parameters for mobile robots,
A. Censi, A. Franchi, L. Marchionni, and G. Oriolo, “Simultaneous calibration of odometry and sensor parameters for mobile robots,” IEEE Transactions on Robotics, vol. 29, no. 2, pp. 475–492, 2013
work page 2013
-
[22]
Varia- tional inference with parameter learning applied to vehicle trajectory estimation,
J. N. Wong, D. J. Yoon, A. P. Schoellig, and T. D. Barfoot, “Varia- tional inference with parameter learning applied to vehicle trajectory estimation,”IEEE Robotics and Automation Letters, vol. 5, no. 4, pp. 5291–5298, 2020
work page 2020
-
[23]
Learning covariances for estimation with constrained bilevel optimization,
M. Qadri, Z. Manchester, and M. Kaess, “Learning covariances for estimation with constrained bilevel optimization,” in2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024, pp. 15 951–15 957
work page 2024
-
[24]
Optnet: Differentiable optimization as a layer in neural networks,
B. Amos and J. Z. Kolter, “Optnet: Differentiable optimization as a layer in neural networks,”International Conference on Machine Learning (ICML), 2017
work page 2017
-
[25]
Implicit differentiation of discrete-time fixed-point iterations,
S. Gouldet al., “Implicit differentiation of discrete-time fixed-point iterations,” inInternational Conference on Machine Learning (ICML), 2022
work page 2022
-
[26]
S. Teng, M. W. Mueller, and K. Sreenath, “Legged robot state estimation in slippery environments using invariant extended kalman filter with velocity update,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 3104–3110
work page 2021
-
[27]
Real-time optimization for robust state estimation and con- trol of legged robots,
S. Yang, “Real-time optimization for robust state estimation and con- trol of legged robots,” Ph.D. dissertation, Carnegie Mellon University, 2024
work page 2024
-
[28]
Extending kalibr: Calibrating the extrinsics of multiple imus and of individual axes,
J. Rehder, J. Nikolic, T. Schneider, T. Hinzmann, and R. Siegwart, “Extending kalibr: Calibrating the extrinsics of multiple imus and of individual axes,” in2016 IEEE international conference on robotics and automation (ICRA). IEEE, 2016, pp. 4304–4311
work page 2016
-
[29]
Automatic targetless lidar–camera calibration: a survey,
X. Li, Y . Xiao, B. Wang, H. Ren, Y . Zhang, and J. Ji, “Automatic targetless lidar–camera calibration: a survey,”Artificial Intelligence Review, vol. 56, no. 9, pp. 9949–9987, 2023
work page 2023
-
[30]
High-precision multicamera-assisted camera-imu cal- ibration: Theory and method,
B. Fu, F. Han, Y . Wang, Y . Jiao, X. Ding, Q. Tan, L. Chen, M. Wang, and R. Xiong, “High-precision multicamera-assisted camera-imu cal- ibration: Theory and method,”IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–17, 2021
work page 2021
-
[31]
Neural moving horizon estimation for robust flight control,
B. Wang, Z. Ma, S. Lai, and L. Zhao, “Neural moving horizon estimation for robust flight control,”IEEE Transactions on Robotics, vol. 40, pp. 639–659, 2023
work page 2023
-
[32]
Inertial parameter identification in robotics: A survey,
Q. Leboutet, J. Roux, A. Janot, J. R. Guadarrama-Olvera, and G. Cheng, “Inertial parameter identification in robotics: A survey,” Applied Sciences, vol. 11, no. 9, p. 4303, 2021
work page 2021
-
[33]
P. M. Wensing, S. Kim, and J.-J. E. Slotine, “Linear matrix inequalities for physically consistent inertial parameter identification: A statistical perspective on the mass distribution,”IEEE Robotics and Automation Letters, vol. 3, no. 1, pp. 60–67, 2018
work page 2018
-
[34]
Physically-consistent parameter identification of robots in contact,
S. Khorshidi, M. Dawood, B. Nederkorn, M. Bennewitz, and M. Khadiv, “Physically-consistent parameter identification of robots in contact,” in2025 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2025, pp. 677–683
work page 2025
-
[35]
Learning agile and dynamic motor skills for legged robots,
J. Hwangbo, J. Lee, A. Dosovitskiy, D. Bellicoso, V . Tsounis, V . Koltun, and M. Hutter, “Learning agile and dynamic motor skills for legged robots,”Science Robotics, vol. 4, no. 26, p. eaau5872, 2019
work page 2019
-
[36]
Online kinematic calibration for legged robots,
S. Yang, H. Choset, and Z. Manchester, “Online kinematic calibration for legged robots,”IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 8178–8185, 2022
work page 2022
-
[37]
Online determination of legged kinematics,
C. Burgul, W. Lee, P. Geneva, and G. Huang, “Online determination of legged kinematics,” in2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2024, pp. 9043–9049
work page 2024
-
[38]
A. W ¨achter and L. T. Biegler, “On the implementation of an interior- point filter line-search algorithm for large-scale nonlinear program- ming,”Mathematical programming, vol. 106, no. 1, pp. 25–57, 2006
work page 2006
-
[39]
CasADi – A software framework for nonlinear optimization and optimal control,
J. A. E. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl, “CasADi – A software framework for nonlinear optimization and optimal control,”Mathematical Programming Computation, vol. 11, no. 1, pp. 1–36, 2019
work page 2019
-
[40]
J. Carpentier, G. Saurel, G. Buondonno, J. Mirabel, F. Lamiraux, O. Stasse, and N. Mansard, “The Pinocchio C++ library – A fast and flexible implementation of rigid body dynamics algorithms and their analytical derivatives,” inSII 2019 - International Symposium on System Integrations, Paris, France, Jan. 2019
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.