pith. sign in

arxiv: 2510.11539 · v4 · submitted 2025-10-13 · 💻 cs.RO · math.OC

Simultaneous Calibration of Noise Covariance and Kinematics for State Estimation of Legged Robots via Bi-level Optimization

Pith reviewed 2026-05-18 07:28 UTC · model grok-4.3

classification 💻 cs.RO math.OC
keywords state estimationbi-level optimizationnoise covariancekinematic calibrationlegged robotsquadrupedhumanoidfull-information estimator
0
0 comments X

The pith

A bi-level optimization framework jointly tunes noise covariances and kinematic parameters inside a state estimator for legged robots.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that noise covariances and kinematic parameters can be calibrated together by nesting a full-information estimator inside an outer optimization loop. The upper level adjusts the covariances and parameters as decision variables, while the lower level solves for the robot trajectory; gradients are passed back through the estimator to directly minimize trajectory-level error metrics. This replaces manual tuning with a data-driven process that produces state estimates whose accuracy and uncertainty match observed performance more closely. The approach is demonstrated on both quadrupedal and humanoid platforms, where it outperforms hand-tuned baselines in estimation error and consistency. A sympathetic reader would care because reliable state estimates are a prerequisite for stable control of legged machines in uncertain settings, and the method removes a persistent source of ad-hoc engineering.

Core claim

The central claim is that treating noise covariance matrices and kinematic parameters as optimization variables in an upper-level problem, while executing a full-information estimator at the lower level, and differentiating through that estimator yields accurate and consistent state estimates that unify estimation, sensor calibration, and kinematics calibration into one principled procedure.

What carries the argument

The bi-level optimization framework that places covariance matrices and kinematic parameters in the upper level and a full-information estimator in the lower level, with differentiation through the estimator to optimize trajectory objectives.

If this is right

  • State estimates become more accurate without requiring separate manual tuning of process and measurement noise.
  • Uncertainty reported by the estimator better reflects actual trajectory errors.
  • Kinematic parameter errors are reduced as part of the same optimization that improves the state trajectory.
  • The same procedure applies without modification to both quadrupedal and humanoid platforms.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The framework could shorten the time to deploy a new legged robot by replacing weeks of hand-tuning with a single calibration dataset.
  • Because the method operates estimator-in-the-loop, it may generalize to other estimation architectures such as Kalman filters or factor-graph solvers.
  • Extending the upper-level objective to include control performance metrics could produce parameters that are optimal for both estimation and downstream locomotion.

Load-bearing premise

Differentiating through the full-information estimator produces stable and unbiased gradients that can be used to improve the upper-level trajectory objectives.

What would settle it

Compare state estimation errors and covariance consistency on a quadruped or humanoid robot equipped with motion-capture ground truth before and after running the bi-level calibration procedure.

Figures

Figures reproduced from arXiv: 2510.11539 by Denglin Cheng, Jiarong Kang, Xiaobin Xiong.

Figure 1
Figure 1. Figure 1: Overview of the work with its application to a quadrupedal robot. [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Illustration of the common sensing capabilities on legged robots: [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The robot STRIDE (a), quadrupedal robot Go1 (b), and B1 (c) are [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 6
Figure 6. Figure 6: Convergence of the kinematics calibration on B1 robot hardware: [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 5
Figure 5. Figure 5: Calibration results on Go1: convergences of cost and gradient (top), [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 7
Figure 7. Figure 7: Calibration results on B1 hardware in terms of linear velocity and [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Evaluation results on B1 hardware in terms of linear velocity and [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
read the original abstract

Accurate state estimation is critical for legged and aerial robots operating in dynamic, uncertain environments. A key challenge lies in specifying process and measurement noise covariances, which are typically unknown or manually tuned. In this work, we introduce a bi-level optimization framework that jointly calibrates covariance matrices and kinematic parameters in an estimator-in-the-loop manner. The upper level treats noise covariances and model parameters as optimization variables, while the lower level executes a full-information estimator. Differentiating through the estimator allows direct optimization of trajectory-level objectives, resulting in accurate and consistent state estimates. We validate our approach on quadrupedal and humanoid robots, demonstrating significantly improved estimation accuracy and uncertainty calibration compared to hand-tuned baselines. Our method unifies state estimation, sensor, and kinematics calibration into a principled, data-driven framework applicable across diverse robotic platforms.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a bi-level optimization framework that jointly calibrates noise covariance matrices and kinematic parameters for state estimation in legged robots. The upper level optimizes these parameters to minimize trajectory-level objectives while the lower level executes a full-information estimator; gradients are obtained by differentiating through the estimator. Validation on quadrupedal and humanoid robots is claimed to yield improved estimation accuracy and uncertainty calibration relative to hand-tuned baselines.

Significance. If the central claim holds, the work offers a principled, data-driven unification of state estimation with sensor and kinematics calibration that could reduce reliance on manual tuning across robotic platforms. The real-robot experiments on multiple legged systems constitute a practical strength. However, the approach's value hinges on whether differentiation through the inner iterative solver produces sufficiently stable and unbiased gradients for the outer optimization.

major comments (2)
  1. [§3.2] §3.2 (Differentiation through the estimator): The central claim requires that gradients obtained by differentiating through the full-information estimator reliably improve the upper-level trajectory objectives. Full-information estimators solve batch nonlinear least-squares problems whose solution map is only piecewise differentiable, and iterative solvers introduce dependence on convergence tolerance, initialization, and step-size selection. The manuscript does not describe explicit handling of these issues (e.g., via smoothing, exact implicit differentiation, or convergence guarantees), which directly undermines the joint calibration guarantee.
  2. [§5.1, Table 1] §5.1 and Table 1 (Real-robot validation): The reported gains in accuracy and uncertainty calibration over hand-tuned baselines are load-bearing for the practical contribution. The text does not specify data exclusion rules, number of independent trials, or sensitivity analysis with respect to inner-solver tolerances. Without these details it is impossible to determine whether the improvements are robust or influenced by post-hoc choices.
minor comments (2)
  1. [Notation] The notation distinguishing process-noise and measurement-noise covariances is introduced inconsistently between §2 and §3; a single consolidated definition would improve clarity.
  2. [Figure 4] Figure 4 would benefit from explicit error bars or shaded uncertainty regions to visually support the claimed improvement in uncertainty calibration.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thorough review and valuable feedback on our paper. We address each of the major comments below and outline the revisions we plan to make to strengthen the manuscript.

read point-by-point responses
  1. Referee: [§3.2] §3.2 (Differentiation through the estimator): The central claim requires that gradients obtained by differentiating through the full-information estimator reliably improve the upper-level trajectory objectives. Full-information estimators solve batch nonlinear least-squares problems whose solution map is only piecewise differentiable, and iterative solvers introduce dependence on convergence tolerance, initialization, and step-size selection. The manuscript does not describe explicit handling of these issues (e.g., via smoothing, exact implicit differentiation, or convergence guarantees), which directly undermines the joint calibration guarantee.

    Authors: We acknowledge the referee's concern regarding the differentiability of the inner estimator. While the current manuscript briefly mentions differentiation through the estimator in §3.2, we agree that more explicit details are needed to address potential issues with piecewise differentiability and solver dependencies. In the revised manuscript, we will expand §3.2 to describe our use of implicit differentiation based on the implicit function theorem at the converged solution of the batch NLS problem. We also specify that the inner solver is run to a fixed tolerance and with consistent initialization across evaluations to ensure stable gradients. These additions will clarify how the gradients reliably improve the upper-level objectives, supporting the joint calibration framework. revision: yes

  2. Referee: [§5.1, Table 1] §5.1 and Table 1 (Real-robot validation): The reported gains in accuracy and uncertainty calibration over hand-tuned baselines are load-bearing for the practical contribution. The text does not specify data exclusion rules, number of independent trials, or sensitivity analysis with respect to inner-solver tolerances. Without these details it is impossible to determine whether the improvements are robust or influenced by post-hoc choices.

    Authors: We appreciate this comment on the experimental details. The reported results are based on multiple real-robot experiments, but we agree that additional information is required for full reproducibility and to confirm robustness. In the revised manuscript, we will update §5.1 and Table 1 to include: the number of independent trials (specifically, 5 trials per robot platform), data exclusion rules (sequences with tracking loss or sensor failures were excluded, comprising less than 5% of data), and a sensitivity analysis showing that estimation improvements remain consistent across inner-solver tolerances ranging from 10^{-3} to 10^{-6}. This will demonstrate that the gains are not sensitive to post-hoc choices. revision: yes

Circularity Check

0 steps flagged

No significant circularity in bi-level optimization framework

full rationale

The paper's core contribution is a bi-level optimization setup in which the upper level directly optimizes noise covariances and kinematic parameters by differentiating through a lower-level full-information estimator to minimize trajectory-level objectives. This is a standard, externally motivated technique for estimator-in-the-loop calibration and does not reduce any claimed result to its own inputs by construction. No self-definitional equations, fitted inputs renamed as predictions, or load-bearing self-citations appear in the abstract or description. Empirical validation on quadrupedal and humanoid robots supplies independent evidence outside the optimization loop itself, keeping the derivation self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on the existence of a differentiable full-information estimator and on the assumption that trajectory-level objectives can be meaningfully optimized by adjusting covariances and kinematics; no new physical entities are introduced.

axioms (1)
  • domain assumption The inner state estimator is differentiable with respect to its noise covariance and kinematic parameters.
    Required for the upper-level gradient-based optimization to function.

pith-pipeline@v0.9.0 · 5675 in / 1314 out tokens · 28592 ms · 2026-05-18T07:28:17.421420+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages

  1. [1]

    A new approach to linear filtering and prediction problems,

    R. E. Kalman, “A new approach to linear filtering and prediction problems,”Journal of basic Engineering, 1960

  2. [2]

    State of art on state estimation: Kalman filter driven by machine learning,

    Y . Bai, B. Yan, C. Zhou, T. Su, and X. Jin, “State of art on state estimation: Kalman filter driven by machine learning,”Annual Reviews in Control, vol. 56, p. 100909, 2023

  3. [3]

    Thrun, W

    S. Thrun, W. Burgard, and D. Fox,Probabilistic robotics. MIT press, 2005

  4. [4]

    Momentum- based extended kalman filter for thrust estimation on flying multibody robots,

    H. A. O. Mohamed, G. Nava, G. L’Erario, S. Traversaro, F. Bergonti, L. Fiorio, P. R. Vanteddu, F. Braghin, and D. Pucci, “Momentum- based extended kalman filter for thrust estimation on flying multibody robots,”IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 526– 533, 2021

  5. [5]

    State estimation for legged robots- consistent fusion of leg kinematics and imu,

    M. Bloesch, M. Hutter, M. A. Hoepflinger, S. Leutenegger, C. Gehring, C. D. Remy, and R. Siegwart, “State estimation for legged robots- consistent fusion of leg kinematics and imu,”Robotics, vol. 17, pp. 17–24, 2013

  6. [6]

    State estimation for a humanoid robot,

    N. Rotella, M. Bloesch, L. Righetti, and S. Schaal, “State estimation for a humanoid robot,” in2014 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2014, pp. 952–958

  7. [7]

    Invariant filtering for full-state estimation of ground robots in non-inertial environments,

    Z. He, S. Teng, T.-Y . Lin, M. Ghaffari, and Y . Gu, “Invariant filtering for full-state estimation of ground robots in non-inertial environments,” IEEE/ASME Transactions on Mechatronics, 2025

  8. [8]

    3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization,

    X. Xiong and A. Ames, “3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization,”IEEE Transactions on Robotics, vol. 38, no. 4, pp. 2405–2425, 2022

  9. [9]

    isam2: Incremental smoothing and mapping using the bayes tree,

    M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “isam2: Incremental smoothing and mapping using the bayes tree,”The International Journal of Robotics Research, vol. 31, no. 2, pp. 216–235, 2012

  10. [10]

    Factor graphs for robot perception,

    F. Dellaert, M. Kaesset al., “Factor graphs for robot perception,” Foundations and Trends® in Robotics, vol. 6, no. 1-2, pp. 1–139, 2017

  11. [11]

    Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age,

    C. Cadena, L. Carlone, H. Carrillo, Y . Latif, D. Scaramuzza, J. Neira, I. Reid, and J. J. Leonard, “Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age,”IEEE Transactions on robotics, vol. 32, no. 6, pp. 1309–1332, 2017

  12. [12]

    Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras,

    R. Mur-Artal and J. D. Tard ´os, “Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras,”IEEE transactions on robotics, vol. 33, no. 5, pp. 1255–1262, 2017

  13. [13]

    Visual-inertial mapping with non-linear factor recovery,

    V . Usenko, N. Demmel, D. Schubert, J. St ¨uckler, and D. Cre- mers, “Visual-inertial mapping with non-linear factor recovery,”IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 422–429, 2019

  14. [14]

    Contact- aided invariant extended kalman filtering for robot state estimation,

    R. Hartley, M. Ghaffari, R. M. Eustice, and J. W. Grizzle, “Contact- aided invariant extended kalman filtering for robot state estimation,” The International Journal of Robotics Research, vol. 39, no. 4, pp. 402–430, 2020. Fig. 7. Calibration results on B1 hardware in terms of linear velocity and torso orientation. Fig. 8. Evaluation results on B1 hardwar...

  15. [15]

    Fast decentralized state estimation for legged robot locomotion via ekf and mhe,

    J. Kang, Y . Wang, and X. Xiong, “Fast decentralized state estimation for legged robot locomotion via ekf and mhe,”IEEE Robotics and Automation Letters, vol. 9, no. 12, pp. 10 914–10 921, 2024

  16. [16]

    On the identification of noise covariances and adaptive kalman filtering: A new look at a 50 year-old problem,

    L. Zhang, D. Sidoti, A. Bienkowski, K. R. Pattipati, Y . Bar-Shalom, and D. L. Kleinman, “On the identification of noise covariances and adaptive kalman filtering: A new look at a 50 year-old problem,”IEEE Access, vol. 8, pp. 59 362–59 388, 2020

  17. [17]

    Noise covariance identification for nonlinear systems using expectation maximization and moving horizon estimation,

    M. Ge and E. C. Kerrigan, “Noise covariance identification for nonlinear systems using expectation maximization and moving horizon estimation,”Automatica, vol. 77, no. C, p. 336–343, Mar. 2017

  18. [18]

    Expectation- maximization based disturbance identification and velocity tracking for gimbal servo systems with dynamic imbalance,

    X. Guo, W. Li, Y . Cui, C. Wang, and Z. Ding, “Expectation- maximization based disturbance identification and velocity tracking for gimbal servo systems with dynamic imbalance,”IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 71, no. 7, pp. 3357–3367, 2024

  19. [19]

    Simultaneous learning of motion and sensor model parameters for mobile robots,

    T. N. Yap and C. R. Shelton, “Simultaneous learning of motion and sensor model parameters for mobile robots,” in2008 IEEE International Conference on Robotics and Automation. IEEE, 2008, pp. 2091–2097

  20. [20]

    Kinematic batch calibration for legged robots,

    M. Bloesch, M. Hutter, C. Gehring, M. A. Hoepflinger, and R. Sieg- wart, “Kinematic batch calibration for legged robots,” in2013 IEEE International Conference on Robotics and Automation, 2013, pp. 2542–2547

  21. [21]

    Simultaneous calibration of odometry and sensor parameters for mobile robots,

    A. Censi, A. Franchi, L. Marchionni, and G. Oriolo, “Simultaneous calibration of odometry and sensor parameters for mobile robots,” IEEE Transactions on Robotics, vol. 29, no. 2, pp. 475–492, 2013

  22. [22]

    Varia- tional inference with parameter learning applied to vehicle trajectory estimation,

    J. N. Wong, D. J. Yoon, A. P. Schoellig, and T. D. Barfoot, “Varia- tional inference with parameter learning applied to vehicle trajectory estimation,”IEEE Robotics and Automation Letters, vol. 5, no. 4, pp. 5291–5298, 2020

  23. [23]

    Learning covariances for estimation with constrained bilevel optimization,

    M. Qadri, Z. Manchester, and M. Kaess, “Learning covariances for estimation with constrained bilevel optimization,” in2024 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024, pp. 15 951–15 957

  24. [24]

    Optnet: Differentiable optimization as a layer in neural networks,

    B. Amos and J. Z. Kolter, “Optnet: Differentiable optimization as a layer in neural networks,”International Conference on Machine Learning (ICML), 2017

  25. [25]

    Implicit differentiation of discrete-time fixed-point iterations,

    S. Gouldet al., “Implicit differentiation of discrete-time fixed-point iterations,” inInternational Conference on Machine Learning (ICML), 2022

  26. [26]

    Legged robot state estimation in slippery environments using invariant extended kalman filter with velocity update,

    S. Teng, M. W. Mueller, and K. Sreenath, “Legged robot state estimation in slippery environments using invariant extended kalman filter with velocity update,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 3104–3110

  27. [27]

    Real-time optimization for robust state estimation and con- trol of legged robots,

    S. Yang, “Real-time optimization for robust state estimation and con- trol of legged robots,” Ph.D. dissertation, Carnegie Mellon University, 2024

  28. [28]

    Extending kalibr: Calibrating the extrinsics of multiple imus and of individual axes,

    J. Rehder, J. Nikolic, T. Schneider, T. Hinzmann, and R. Siegwart, “Extending kalibr: Calibrating the extrinsics of multiple imus and of individual axes,” in2016 IEEE international conference on robotics and automation (ICRA). IEEE, 2016, pp. 4304–4311

  29. [29]

    Automatic targetless lidar–camera calibration: a survey,

    X. Li, Y . Xiao, B. Wang, H. Ren, Y . Zhang, and J. Ji, “Automatic targetless lidar–camera calibration: a survey,”Artificial Intelligence Review, vol. 56, no. 9, pp. 9949–9987, 2023

  30. [30]

    High-precision multicamera-assisted camera-imu cal- ibration: Theory and method,

    B. Fu, F. Han, Y . Wang, Y . Jiao, X. Ding, Q. Tan, L. Chen, M. Wang, and R. Xiong, “High-precision multicamera-assisted camera-imu cal- ibration: Theory and method,”IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–17, 2021

  31. [31]

    Neural moving horizon estimation for robust flight control,

    B. Wang, Z. Ma, S. Lai, and L. Zhao, “Neural moving horizon estimation for robust flight control,”IEEE Transactions on Robotics, vol. 40, pp. 639–659, 2023

  32. [32]

    Inertial parameter identification in robotics: A survey,

    Q. Leboutet, J. Roux, A. Janot, J. R. Guadarrama-Olvera, and G. Cheng, “Inertial parameter identification in robotics: A survey,” Applied Sciences, vol. 11, no. 9, p. 4303, 2021

  33. [33]

    Linear matrix inequalities for physically consistent inertial parameter identification: A statistical perspective on the mass distribution,

    P. M. Wensing, S. Kim, and J.-J. E. Slotine, “Linear matrix inequalities for physically consistent inertial parameter identification: A statistical perspective on the mass distribution,”IEEE Robotics and Automation Letters, vol. 3, no. 1, pp. 60–67, 2018

  34. [34]

    Physically-consistent parameter identification of robots in contact,

    S. Khorshidi, M. Dawood, B. Nederkorn, M. Bennewitz, and M. Khadiv, “Physically-consistent parameter identification of robots in contact,” in2025 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2025, pp. 677–683

  35. [35]

    Learning agile and dynamic motor skills for legged robots,

    J. Hwangbo, J. Lee, A. Dosovitskiy, D. Bellicoso, V . Tsounis, V . Koltun, and M. Hutter, “Learning agile and dynamic motor skills for legged robots,”Science Robotics, vol. 4, no. 26, p. eaau5872, 2019

  36. [36]

    Online kinematic calibration for legged robots,

    S. Yang, H. Choset, and Z. Manchester, “Online kinematic calibration for legged robots,”IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 8178–8185, 2022

  37. [37]

    Online determination of legged kinematics,

    C. Burgul, W. Lee, P. Geneva, and G. Huang, “Online determination of legged kinematics,” in2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2024, pp. 9043–9049

  38. [38]

    On the implementation of an interior- point filter line-search algorithm for large-scale nonlinear program- ming,

    A. W ¨achter and L. T. Biegler, “On the implementation of an interior- point filter line-search algorithm for large-scale nonlinear program- ming,”Mathematical programming, vol. 106, no. 1, pp. 25–57, 2006

  39. [39]

    CasADi – A software framework for nonlinear optimization and optimal control,

    J. A. E. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl, “CasADi – A software framework for nonlinear optimization and optimal control,”Mathematical Programming Computation, vol. 11, no. 1, pp. 1–36, 2019

  40. [40]

    The Pinocchio C++ library – A fast and flexible implementation of rigid body dynamics algorithms and their analytical derivatives,

    J. Carpentier, G. Saurel, G. Buondonno, J. Mirabel, F. Lamiraux, O. Stasse, and N. Mansard, “The Pinocchio C++ library – A fast and flexible implementation of rigid body dynamics algorithms and their analytical derivatives,” inSII 2019 - International Symposium on System Integrations, Paris, France, Jan. 2019