TAG-K: Tail-Averaged Greedy Kaczmarz for Computationally Efficient and Performant Online Inertial Parameter Estimation
Pith reviewed 2026-05-18 09:25 UTC · model grok-4.3
The pith
TAG-K extends the Kaczmarz method with greedy selection and tail averaging for faster online inertial parameter estimation in robots.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
TAG-K is a lightweight extension of the Kaczmarz method that combines greedy randomized row selection for rapid convergence with tail averaging for robustness under noise and inconsistency. This design enables fast, stable parameter adaptation while retaining the low per-iteration complexity inherent to the Kaczmarz framework.
What carries the argument
The TAG-K update that performs greedy randomized row selection followed by tail averaging of recent iterates to produce each new inertial parameter estimate.
If this is right
- Delivers 1.5x-1.9x faster solve times on laptop-class CPUs.
- Delivers 4.8x-20.7x faster solve times on embedded microcontrollers.
- Improves robustness to measurement noise compared with standard recursive least squares and Kalman filters.
- Reduces estimation error by 25 percent in the tested scenarios.
- Yields nearly 2x better end-to-end tracking performance in quadrotor flight tasks.
Where Pith is reading between the lines
- The same selection-plus-averaging pattern could transfer to other online regression tasks that run on microcontrollers.
- Higher update rates become feasible in tight control loops because each iteration stays cheap.
- The noise-handling benefit may extend to robots with lower-quality sensors than those used in the quadrotor tests.
Load-bearing premise
The linear regression model for inertial parameters remains valid and the robot motion supplies enough persistent excitation for the updates to converge despite noise.
What would settle it
A controlled test on a quadrotor where excitation drops below the level needed for identifiability while sensor noise is present, checking whether estimation error rises sharply or tracking performance collapses.
Figures
read the original abstract
Accurate online inertial parameter estimation is essential for adaptive robotic control, enabling real-time adjustment to payload changes, environmental interactions, and system wear. Traditional methods often struggle to track abrupt parameter shifts or incur high computational costs, limiting their effectiveness in dynamic environments and for computationally constrained robotic systems. We introduce TAG-K, a lightweight extension of the Kaczmarz method that combines greedy randomized row selection for rapid convergence with tail averaging for robustness under noise and inconsistency. This design enables fast, stable parameter adaptation while retaining the low per-iteration complexity inherent to the Kaczmarz framework. We evaluate TAG-K in synthetic benchmarks and quadrotor tracking tasks against RLS, KF, and other Kaczmarz variants. TAG-K achieves 1.5x-1.9x faster solve times on laptop-class CPUs and 4.8x-20.7x faster solve times on embedded microcontrollers. More importantly, these speedups are paired with improved robustness to measurement noise and a 25% reduction in estimation error, leading to nearly 2x better end-to-end tracking performance. Website, documentation, and code available at: https://a2r-lab.org/TAG-K/.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes TAG-K, a lightweight extension of the Kaczmarz method for online inertial parameter estimation that integrates greedy randomized row selection for rapid convergence with tail averaging for robustness to noise and inconsistency. It claims 1.5x-1.9x faster solve times on CPUs and 4.8x-20.7x on microcontrollers, plus a 25% reduction in estimation error and nearly 2x better end-to-end tracking performance versus RLS, KF, and other Kaczmarz variants, evaluated on synthetic benchmarks and quadrotor tracking tasks.
Significance. If the empirical results hold under fuller validation, the work offers a computationally efficient alternative for real-time adaptive control on resource-constrained platforms, addressing a practical gap in handling dynamic payloads and noise. The low per-iteration complexity and reported speedups on embedded hardware are strengths that could enable broader deployment of online estimation in robotics.
major comments (2)
- [Abstract and Evaluation] Abstract and §4 (Evaluation): The central motivation is fast adaptation to abrupt parameter shifts, yet no post-change convergence times, adaptation latency metrics, or forgetting-factor equivalents are reported for the quadrotor payload-jump experiments. Tail averaging inherently smooths iterates, so it is unclear whether the claimed robustness gains preserve the tracking agility needed to support the abstract's performance claims.
- [Evaluation] §4 and results tables: The reported speedups and 25% error reduction lack accompanying statistical tests, trial counts, or variance measures. Without these, the evidence for consistent superiority over baselines remains moderate and does not fully substantiate the load-bearing performance assertions.
minor comments (2)
- [Method] Ensure all algorithmic parameters (e.g., tail length, greedy selection probability) are explicitly defined with default values in the method section for reproducibility.
- [Figures] Figure captions for timing and error plots should include axis scales, units, and the exact number of Monte Carlo runs used.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address the major comments point by point below and will incorporate revisions to strengthen the manuscript where appropriate.
read point-by-point responses
-
Referee: [Abstract and Evaluation] Abstract and §4 (Evaluation): The central motivation is fast adaptation to abrupt parameter shifts, yet no post-change convergence times, adaptation latency metrics, or forgetting-factor equivalents are reported for the quadrotor payload-jump experiments. Tail averaging inherently smooths iterates, so it is unclear whether the claimed robustness gains preserve the tracking agility needed to support the abstract's performance claims.
Authors: The quadrotor payload-jump experiments were designed to evaluate adaptation to abrupt shifts, with the reported nearly 2x improvement in end-to-end tracking performance serving as evidence that TAG-K retains sufficient agility. The combination of greedy row selection and tail averaging is intended to balance rapid convergence with noise robustness without excessive smoothing. That said, we agree that explicit post-change convergence times, adaptation latency metrics, and direct comparisons to forgetting-factor equivalents in RLS/KF would provide clearer quantification. In the revised manuscript we will add these metrics to §4. revision: yes
-
Referee: [Evaluation] §4 and results tables: The reported speedups and 25% error reduction lack accompanying statistical tests, trial counts, or variance measures. Without these, the evidence for consistent superiority over baselines remains moderate and does not fully substantiate the load-bearing performance assertions.
Authors: We acknowledge that the current results presentation would be strengthened by explicit statistical support. The evaluations were performed over multiple independent trials on both synthetic and hardware platforms, but trial counts, variance, and formal statistical tests were not reported in the tables. In the revision we will add the number of trials, standard deviations or inter-quartile ranges, and appropriate statistical comparisons (e.g., paired t-tests or Wilcoxon signed-rank tests) to substantiate the speedups and error reductions. revision: yes
Circularity Check
No circularity in algorithmic proposal or empirical claims
full rationale
The paper presents TAG-K as a lightweight algorithmic extension of the Kaczmarz method that adds greedy randomized row selection and tail averaging for online inertial parameter estimation. All performance claims (solve-time speedups, noise robustness, 25% error reduction, and tracking improvements) are established through direct empirical comparison against RLS, KF, and other Kaczmarz variants on synthetic benchmarks and quadrotor experiments. No derivation, equation, or first-principles result is shown to reduce to a fitted quantity or prior self-citation by construction; the central contribution is an independent algorithmic design whose validity rests on external experimental benchmarks rather than internal redefinition or self-referential justification.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Optimization-based control for dynamic legged robots,
P. M. Wensing, M. Posa, Y . Hu, A. Escande, N. Mansard, and A. D. Prete, “Optimization-based control for dynamic legged robots,”IEEE Transactions on Robotics, vol. 40, pp. 43–63, 2024
work page 2024
-
[2]
Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot,
S. Kuindersma, R. Deits, M. Fallon, A. Valenzuela, H. Dai, F. Per- menter, T. Koolen, P. Marion, and R. Tedrake, “Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot,”Autonomous Robots, vol. 40, no. 3, pp. 429–455, 2016
work page 2016
-
[3]
Taskable agility: Making useful dynamic behavior easier to create,
S. Kuindersma, “Taskable agility: Making useful dynamic behavior easier to create,” 2023, princeton University Seminar
work page 2023
-
[4]
D. Kim, J. Di Carlo, B. Katz, G. Bledt, and S. Kim, “Highly dynamic quadruped locomotion via whole-body impulse control and model predictive control,”arXiv preprint arXiv:1909.06586, 2019
-
[5]
Y .-H. Shin, S. Hong, S. Woo, J. Choe, H. Son, G. Kim, J.-H. Kim, K. Lee, J. Hwangbo, and H.-W. Park, “Design of kaist hound, a quadruped robot platform for fast and efficient locomotion with mixed- integer nonlinear optimization of a gear train,” in2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022, pp. 6614–6620
work page 2022
-
[6]
H. Li and P. M. Wensing, “Cafe-mpc: A cascaded-fidelity model predictive control framework with tuning-free whole-body control,” IEEE Transactions on Robotics, vol. 41, pp. 837–856, 2025
work page 2025
-
[7]
Mpcc++: Model predictive contouring control for time-optimal flight with safety constraints,
M. Krinner, A. Romero, L. Bauersfeld, M. Zeilinger, A. Carron, and D. Scaramuzza, “Mpcc++: Model predictive contouring control for time-optimal flight with safety constraints,” 2024. [Online]. Available: https://arxiv.org/abs/2403.17551
-
[8]
Provably-safe, online system identification,
B. Zhang, Z. Zhou, and R. Vasudevan, “Provably-safe, online system identification,” 2025. [Online]. Available: https://arxiv.org/abs/2504. 21486
work page 2025
-
[9]
D. Baek, B. Peng, S. Gupta, and J. Ramos, “Online learning-based inertial parameter identification of unknown object for model- based control of wheeled humanoids,” 2024. [Online]. Available: https://arxiv.org/abs/2309.09810
-
[10]
I. Lopez-Sanchez, F. Rossomando, R. P ´erez-Alcocer, C. Soria, R. Carelli, and J. Moreno-Valenzuela, “Adaptive trajectory tracking control for quadrotors with disturbances by using generalized regression neural networks,”Neurocomputing, vol. 460, pp. 243–255,
-
[11]
Available: https://www.sciencedirect.com/science/ article/pii/S0925231221010092
[Online]. Available: https://www.sciencedirect.com/science/ article/pii/S0925231221010092
-
[12]
Adaptive control of unmanned aerial vehicles with varying payload and full parametric uncertainties,
I. H. Imran, K. Wood, and A. Montazeri, “Adaptive control of unmanned aerial vehicles with varying payload and full parametric uncertainties,”Electronics, vol. 13, no. 2, 2024. [Online]. Available: https://www.mdpi.com/2079-9292/13/2/347
work page 2024
-
[13]
Adaptive parameter estimation for aerial manipulation,
G. Baraban, M. Sheckells, S. Kim, and M. Kobilarov, “Adaptive parameter estimation for aerial manipulation,” in2020 American Control Conference (ACC), 07 2020, pp. 614–619
work page 2020
-
[14]
Some theorems in least squares,
R. L. Plackett, “Some theorems in least squares,”Biometrika, vol. 37, no. 1/2, pp. 149–157, 1950. [Online]. Available: http: //www.jstor.org/stable/2332158
-
[15]
A. H. Sayed,Fundamentals Of Adaptive Filtering. Wiley,
-
[16]
Available: https://api.semanticscholar.org/CorpusID: 60598442
[Online]. Available: https://api.semanticscholar.org/CorpusID: 60598442
-
[17]
A revisit to block and recursive least squares for parameter estimation,
J. Jiang and Y . Zhang, “A revisit to block and recursive least squares for parameter estimation,”Computers & Electrical Engineering, vol. 30, no. 5, pp. 403–416, 2004. [Online]. Available: https: //www.sciencedirect.com/science/article/pii/S0045790604000217
work page 2004
-
[18]
A robust variable forgetting factor recursive least-squares algorithm for system identification,
C. Paleologu, J. Benesty, and S. Ciochina, “A robust variable forgetting factor recursive least-squares algorithm for system identification,” IEEE Signal Processing Letters, vol. 15, pp. 597–600, 2008
work page 2008
-
[19]
A new approach to linear filtering and prediction problems,
R. E. Kalman, “A new approach to linear filtering and prediction problems,”Journal of Basic Engineering, 1960
work page 1960
-
[20]
F. Gustafsson,On-line Approaches. John Wiley & Sons, Ltd, 2000, ch. 3, pp. 55–87. [Online]. Available: https://onlinelibrary.wiley.com/ doi/abs/10.1002/0470841613.ch3
-
[21]
Application of approximate matrix multiplication to neural networks and distributed slam,
B. Plancher, C. D. Brumar, I. Brumar, L. Pentecost, S. Rama, and D. Brooks, “Application of approximate matrix multiplication to neural networks and distributed slam,” in2019 IEEE High Perfor- mance Extreme Computing Conference (HPEC), Waltham, MA, USA, September 2019, pp. 1–7
work page 2019
-
[22]
Physics-informed neural networks in robotics: A review,
V . Sivtsov, A. Papanikolaou, I. Markovic, I. Petrovic, and F. Bon- signorio, “Physics-informed neural networks in robotics: A review,” 01 2025
work page 2025
-
[23]
Deep learning and system identification,
L. Ljung, C. Andersson, K. Tiels, and T. B. Sch ¨on, “Deep learning and system identification,”IFAC-PapersOnLine, vol. 53, no. 2, pp. 1175– 1181, 2020, 21st IFAC World Congress. [Online]. Available: https: //www.sciencedirect.com/science/article/pii/S2405896320317353
work page 2020
-
[24]
H. Li, H. Chen, Y . Chai, H. Zhao, Y . Zhao, Y . Han, and J. Luo, “Pinn-based predictive control combined with unknown payload iden- tification for robots with prismatic quasi-direct-drives,”IEEE Robotics and Automation Letters, pp. 1–8, 2025
work page 2025
-
[25]
Learning deformable linear object dynamics from a single trajectory,
S. Mamedov, A. R. Geist, R. Viljoen, S. Trimpe, and J. Swevers, “Learning deformable linear object dynamics from a single trajectory,”
-
[26]
Available: https://arxiv.org/abs/2407.03476
[Online]. Available: https://arxiv.org/abs/2407.03476
-
[27]
Particle-grid neural dynamics for learning deformable object models from rgb-d videos,
K. Zhang, B. Li, K. Hauser, and Y . Li, “Particle-grid neural dynamics for learning deformable object models from rgb-d videos,” 2025. [Online]. Available: https://arxiv.org/abs/2506.15680
-
[28]
Differentiable physics models for real-world offline model-based reinforcement learning,
M. Lutter, J. Silberbauer, J. Watson, and J. Peters, “Differentiable physics models for real-world offline model-based reinforcement learning,” 2020. [Online]. Available: https://arxiv.org/abs/2011.01734
-
[29]
G. Qu, C. Yu, S. Low, and A. Wierman, “Combining model- based and model-free methods for nonlinear control: A provably convergent policy gradient approach,” 2020. [Online]. Available: https://arxiv.org/abs/2006.07476
-
[30]
Angenaherte auflosung von systemen linearer glei- chungen,
S. Karczmarz, “Angenaherte auflosung von systemen linearer glei- chungen,”Bull. Int. Acad. Pol. Sic. Let., Cl. Sci. Math. Nat., pp. 355– 357, 1937
work page 1937
-
[31]
A randomized kaczmarz algorithm with exponential convergence,
T. Strohmer and R. Vershynin, “A randomized kaczmarz algorithm with exponential convergence,” 2007
work page 2007
-
[32]
Randomized extended kaczmarz for solving least squares,
A. Zouzias and N. M. Freris, “Randomized extended kaczmarz for solving least squares,”SIAM Journal on Matrix Analysis and Applications, vol. 34, no. 2, p. 773–793, Jan. 2013. [Online]. Available: http://dx.doi.org/10.1137/120889897
-
[33]
The kaczmarz algorithm, row action methods, and statistical learning algorithms,
X. Chen, “The kaczmarz algorithm, row action methods, and statistical learning algorithms,”Frames and harmonic analysis, vol. 706, pp. 115–127, 2018
work page 2018
-
[34]
On greedy randomized kaczmarz method for solving large sparse linear systems,
Z.-Z. Bai and W.-T. Wu, “On greedy randomized kaczmarz method for solving large sparse linear systems,”SIAM Journal on Scientific Computing, vol. 40, no. 1, pp. A592–A606, 2018. [Online]. Available: https://doi.org/10.1137/17M1137747
-
[35]
A. Agrawal, P. Mayer, Z. Kingston, and G. Hollinger, “Constrained nonlinear kaczmarz projection on intersections of manifolds for coor- dinated multi-robot mobile manipulation,” 10 2024
work page 2024
-
[36]
C. P. Neuman and P. K. Khosla,Identification of Robot Dynamics: An Application of Recursive Estimation. Boston, MA: Springer US, 1986, pp. 175–194. [Online]. Available: https://doi.org/10.1007/ 978-1-4757-1895-9 12
work page 1986
-
[37]
An efficient calculation of computed torque control of flexible manipulators,
W. Khalil and F. Boyer, “An efficient calculation of computed torque control of flexible manipulators,” inProceedings of 1995 IEEE Inter- national Conference on Robotics and Automation, vol. 1, 1995, pp. 609–614 vol.1
work page 1995
-
[38]
On-line estimation of inertial parameters using a recursive total least-squares approach,
D. Kubus, T. Kroger, and F. M. Wahl, “On-line estimation of inertial parameters using a recursive total least-squares approach,” in2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008, pp. 3845–3852
work page 2008
-
[39]
S. Rhode and F. Gauterin, “Online estimation of vehicle driving resistance parameters with recursive least squares and recursive total least squares,” in2013 IEEE Intelligent Vehicles Symposium (IV), 2013, pp. 269–276
work page 2013
-
[40]
Gradient-based variable forgetting factor rls algorithm in time-varying environments,
S.-H. Leung and C. So, “Gradient-based variable forgetting factor rls algorithm in time-varying environments,”IEEE Transactions on Signal Processing, vol. 53, no. 8, pp. 3141–3150, 2005
work page 2005
-
[41]
The kernel recursive least-squares algorithm,
Y . Engel, S. Mannor, and R. Meir, “The kernel recursive least-squares algorithm,”IEEE Transactions on Signal Processing, vol. 52, no. 8, pp. 2275–2285, 2004
work page 2004
-
[42]
Adaptive adjustment of noise covariance in kalman filter for dynamic state estimation,
S. Akhlaghi, N. Zhou, and Z. Huang, “Adaptive adjustment of noise covariance in kalman filter for dynamic state estimation,” in2017 IEEE Power & Energy Society General Meeting, 2017, pp. 1–5
work page 2017
-
[43]
The Ensemble Kalman Filter: theoretical formulation and practical implementation,
G. Evensen, “The Ensemble Kalman Filter: theoretical formulation and practical implementation,”Ocean Dynamics, vol. 53, no. 4, pp. 343–367, Nov. 2003
work page 2003
-
[44]
A square-root kalman filter using only qr decompositions,
K. Tracy, “A square-root kalman filter using only qr decompositions,”
-
[45]
Available: https://arxiv.org/abs/2208.06452
[Online]. Available: https://arxiv.org/abs/2208.06452
-
[46]
The square-root unscented kalman filter for state and parameter-estimation,
R. Merwe and E. Wan, “The square-root unscented kalman filter for state and parameter-estimation,” in2001 IEEE International Confer- ence on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), vol. 6, 02 2001, pp. 3461 – 3464 vol.6
work page 2001
-
[47]
Survey of a class of iterative row-action methods: The kaczmarz method,
I. A. Ferreira, J. A. Acebr ´on, and J. Monteiro, “Survey of a class of iterative row-action methods: The kaczmarz method,” 2024. [Online]. Available: https://arxiv.org/abs/2401.02842
-
[48]
Randomized kaczmarz with tail averaging,
E. N. Epperly, G. Goldshlager, and R. J. Webber, “Randomized kaczmarz with tail averaging,” 2025. [Online]. Available: https: //arxiv.org/abs/2411.19877
-
[49]
Sayed,Kalman Filtering and RLS
A. Sayed,Kalman Filtering and RLS. John Wiley & Sons, Ltd, 2008, ch. 31, pp. 501–514. [Online]. Available: https: //onlinelibrary.wiley.com/doi/abs/10.1002/9780470374122.ch46
-
[50]
Bitcraze, “Crazyflie 2.1,” 2023. [Online]. Available: https://www. bitcraze.io/products/crazyflie-2-1/
work page 2023
-
[51]
Crazyswarm: A large nano-quadcopter swarm,
J. Preiss, W. Hoenig, G. Sukhatme, and N. Ayanian, “Crazyswarm: A large nano-quadcopter swarm,” in2017 IEEE International Conference on Robotics and Automation (ICRA), 05 2017, pp. 3299–3304
work page 2017
-
[52]
Crazychoir: Flying swarms of crazyflie quadrotors in ros 2,
L. Pichierri, A. Testa, and G. Notarstefano, “Crazychoir: Flying swarms of crazyflie quadrotors in ros 2,”IEEE Robotics and Automa- tion Letters, vol. 8, no. 8, pp. 4713–4720, 2023
work page 2023
-
[53]
Reinforcement learning- based control of crazyflie 2.x quadrotor,
A. Javeed, V . Jim ´enez, and J. Gr ¨onqvist, “Reinforcement learning- based control of crazyflie 2.x quadrotor,” 06 2023
work page 2023
-
[54]
Tinympc: Model-predictive control on resource-constrained microcontrollers,
K. Nguyen, S. Schoedel, A. Alavill, B. Plancher, and Z. Manch- ester, “Tinympc: Model-predictive control on resource-constrained microcontrollers,” inIEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan, May. 2024
work page 2024
- [55]
-
[56]
S. M. M, N. Naik, R. M. O. Gemson, and M. R. Ananthasayanam, “Introduction to the kalman filter and tuning its statistics for near optimal estimates and cramer rao bound,” 2015. [Online]. Available: https://arxiv.org/abs/1503.04313
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[57]
Cyclic adaptive matching pursuit,
A. Onose and B. Dumitrescu, “Cyclic adaptive matching pursuit,” in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012, pp. 3745–3748
work page 2012
- [58]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.