Metric, inertially aligned monocular state estimation via kinetodynamic priors

Jiaqi Yang; Jiaxin Liu; Laurent Kneip; Liang Li; Min Li; Wanting Xu

arxiv: 2511.20496 · v3 · submitted 2025-11-25 · 💻 cs.RO

Metric, inertially aligned monocular state estimation via kinetodynamic priors

Jiaxin Liu , Min Li , Wanting Xu , Liang Li , Jiaqi Yang , Laurent Kneip This is my paper

Pith reviewed 2026-05-17 05:05 UTC · model grok-4.3

classification 💻 cs.RO

keywords monocular visual odometrynon-rigid state estimationkinetodynamic priorsdeformation-force modelB-spline trajectoriesmetric scale recoveryflexible robotic platformsgravity alignment

0 comments

The pith

A learned deformation-force model and B-spline trajectories let Newton's laws recover metric scale and gravity from monocular vision on non-rigid platforms.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a way to estimate full state for flexible robotic systems whose structures deform and break rigid-body assumptions. It learns the link between observed deformation and resulting forces with a multi-layer perceptron, then represents the platform's smooth motion with continuous-time B-splines. By repeatedly applying Newton's second law, the method ties the accelerations implied by visual trajectories to the accelerations predicted from the deformation model. This produces accurate pose estimates and, more importantly, resolves the classic ill-posed problems of metric scale and gravity direction that plague monocular visual odometry. A reader cares because the approach shows that platform physics itself can supply the missing inertial information without extra sensors.

Core claim

The central claim is that capturing elastic properties through a deformation-force model learned by a multi-layer perceptron, combined with continuous-time B-spline kinematic models, allows continuous application of Newton's second law to relate visually derived accelerations to deformation-induced accelerations. This formulation enables robust and accurate pose estimation on non-rigid platforms and permits recovery of inertial sensing properties, demonstrated on a spring-camera system that resolves metric scale and gravity in monocular visual odometry.

What carries the argument

A multi-layer perceptron that learns the deformation-to-force mapping, paired with continuous-time B-spline representations of motion, so that Newton's second law can be enforced between visual trajectory accelerations and predicted deformation accelerations.

If this is right

Pose estimation remains accurate and robust when the platform deforms dynamically rather than staying rigid.
Metric scale and gravity direction become recoverable from monocular vision alone once platform physics is modeled.
The typically ill-posed metric and alignment problems in monocular visual odometry are resolved by the kinetodynamic constraint.
Existing rigid-body state estimation techniques can be extended to non-rigid systems by adding the deformation model and B-spline components.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Flexible robots could rely on monocular cameras for navigation without dedicated inertial sensors if the deformation physics is learned in advance.
The same kinetodynamic prior idea might apply to other deformable settings such as soft robotics or camera systems mounted on compliant structures.
Controlled experiments that vary the stiffness of the spring or the amplitude of deformation would test how sensitive the scale and gravity recovery is to the accuracy of the learned force model.

Load-bearing premise

The multi-layer perceptron accurately predicts the accelerations produced by deformations, and the B-spline model represents the motion faithfully enough for Newton's second law to hold without large modeling error.

What would settle it

If experiments on the spring-camera system show that the recovered metric scale or gravity direction deviates substantially from independent measurements, or if pose estimation accuracy remains poor under deformation compared with rigid-body baselines, the central claim would be falsified.

Figures

Figures reproduced from arXiv: 2511.20496 by Jiaqi Yang, Jiaxin Liu, Laurent Kneip, Liang Li, Min Li, Wanting Xu.

**Figure 2.** Figure 2: Block diagram of this pipeline, mainly contains two part. [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Hardware setup. A typical non-rigid system usually [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Optimization results for representative trajectories. For improved visualization and comparison, X-Z 2D projections [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

read the original abstract

Accurate state estimation for flexible robotic systems poses significant challenges, particularly for platforms with dynamically deforming structures that invalidate rigid-body assumptions. This paper addresses this problem and enables the extension of existing rigid-body pose estimation methods to non-rigid systems. Our approach integrates two core components: first, we capture elastic properties using a deformation-force model, efficiently learned via a Multi-Layer Perceptron; second, we resolve the platform's inherently smooth motion using continuous-time B-spline kinematic models. By continuously applying Newton's Second Law, our method formulates the relationship between visually-derived trajectory acceleration and predicted deformation-induced acceleration. We demonstrate that our approach not only enables robust and accurate pose estimation on non-rigid platforms, but also shows that the properly modeled platform physics allow for the recovery of inertial sensing properties. We validate this feasibility on a simple spring-camera system, showing how it robustly resolves the typically ill-posed problem of metric scale and gravity recovery in monocular visual odometry.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper links an MLP deformation model to B-spline kinematics and continuous Newton's law to recover metric scale and gravity on a spring-camera rig, but the fidelity of spline-derived accelerations needs explicit checks.

read the letter

The main point is that this work shows how to bring platform physics into monocular state estimation for a deforming system so that scale and gravity become observable without extra sensors. They learn a deformation-force map with an MLP, represent the motion with continuous B-splines, and enforce F=ma at every point along the trajectory to tie visual accelerations to the predicted deformation effects. The spring-camera testbed then serves as a minimal case to check whether the recovered inertial quantities hold up.

Referee Report

2 major / 2 minor

Summary. The paper proposes integrating an MLP-learned deformation-force model with continuous-time B-spline kinematic representations to enforce Newton's Second Law on visually observed accelerations for non-rigid platforms. This is claimed to enable robust monocular pose estimation while recovering metric scale and gravity direction, demonstrated on a spring-camera rig as a solution to the ill-posed scale/gravity problem in visual odometry.

Significance. If the central derivation and validation hold, the work would provide a physics-grounded route to metric recovery and inertial alignment without external sensors or rigid-body assumptions, which is relevant for flexible robotic systems. The combination of learned deformation priors with spline-based continuous-time modeling is a plausible direction, though its practical impact depends on demonstrating that modeling errors do not propagate into the recovered quantities.

major comments (2)

[Abstract / Method] Abstract and method description: the central claim that Newton's Second Law can be applied to recover metric scale and gravity rests on equating B-spline-derived second derivatives to MLP-predicted deformation accelerations, yet no error bounds, sensitivity analysis to knot spacing, or polynomial degree are supplied; any mismatch can be absorbed into the recovered scale or gravity vector.
[Abstract / Validation] Validation section: the abstract asserts that the approach 'robustly resolves' the metric scale and gravity recovery problem on the spring-camera system, but the provided text contains no quantitative error metrics, baseline comparisons, ablation studies on MLP architecture, or analysis of how well the fitted model satisfies F=ma across trajectories.

minor comments (2)

[Method] Notation for the B-spline control points and MLP input/output dimensions is not introduced with sufficient clarity before the optimization objective is stated.
[Optimization] The manuscript would benefit from an explicit statement of the degrees of freedom in the joint optimization (B-spline coefficients plus MLP weights) and any regularization used to prevent trivial solutions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback on our manuscript. We address each major comment below and describe the revisions we will incorporate to strengthen the presentation of the method and its validation.

read point-by-point responses

Referee: [Abstract / Method] Abstract and method description: the central claim that Newton's Second Law can be applied to recover metric scale and gravity rests on equating B-spline-derived second derivatives to MLP-predicted deformation accelerations, yet no error bounds, sensitivity analysis to knot spacing, or polynomial degree are supplied; any mismatch can be absorbed into the recovered scale or gravity vector.

Authors: We agree that the manuscript would benefit from explicit discussion of these aspects. The continuous-time B-spline representation guarantees C2 continuity, so second derivatives are well-defined, and the joint optimization couples the kinematic parameters, scale, gravity, and MLP weights under the F=ma residual. Nevertheless, to directly address the concern about potential absorption of modeling error, we will add a dedicated sensitivity study in the revised version. This will include (i) analytic error propagation bounds from the spline fitting residuals to the recovered scale and gravity, (ii) empirical sweeps over knot spacing and polynomial degree, and (iii) plots demonstrating that the recovered inertial quantities remain stable within the operating range of the spring-camera rig. revision: yes
Referee: [Abstract / Validation] Validation section: the abstract asserts that the approach 'robustly resolves' the metric scale and gravity recovery problem on the spring-camera system, but the provided text contains no quantitative error metrics, baseline comparisons, ablation studies on MLP architecture, or analysis of how well the fitted model satisfies F=ma across trajectories.

Authors: The current draft emphasizes feasibility on the spring-camera platform, but we acknowledge that the validation section lacks the quantitative depth requested. In the revision we will expand the experimental section to report: absolute and relative errors on recovered metric scale and gravity direction; comparisons against standard monocular VO pipelines (e.g., ORB-SLAM3, DSO) run on the same sequences; ablations varying MLP depth/width and activation functions; and quantitative F=ma residual statistics (mean and variance of force imbalance) across all recorded trajectories. These additions will allow readers to assess both accuracy and physical consistency. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation grounded in independent physical law and separately learned model

full rationale

The paper learns an MLP deformation-force model from data and fits continuous-time B-splines to visual observations, then enforces Newton's Second Law to relate spline-derived accelerations to MLP-predicted deformation accelerations. This relationship is used to recover metric scale and gravity direction. Newton's law supplies external grounding independent of the fitted quantities, the MLP is trained separately rather than on the target estimation outputs, and no equations reduce the recovered scale/gravity to a renaming or re-use of the inputs by construction. No self-citation chains or uniqueness theorems are invoked in the provided description. The approach is self-contained against external physical priors.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim rests on a learned neural model for deformation and the direct applicability of Newton's Second Law to visually observed trajectories; these introduce fitted parameters and domain assumptions not supplied by prior literature.

free parameters (2)

MLP weights and biases for deformation-force model
Efficiently learned via Multi-Layer Perceptron to capture elastic properties
B-spline coefficients or knot placements
Used to represent continuous-time kinematic models of smooth motion

axioms (2)

domain assumption Newton's Second Law can be continuously applied to relate visually-derived trajectory acceleration and deformation-induced acceleration
Invoked to formulate the relationship between observed and predicted accelerations
domain assumption The platform motion is inherently smooth and well-represented by continuous-time B-spline models
Used to resolve the platform's motion prior to applying physics constraints

pith-pipeline@v0.9.0 · 5475 in / 1574 out tokens · 51868 ms · 2026-05-17T05:05:17.346264+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

By continuously applying Newton’s Second Law, our method establishes a physical link between visually-derived trajectory acceleration and predicted deformation-induced acceleration... aaa_opt_ci = T_opt_ci · N((T_opt_bi)^{-1} · T_opt_ci) + g_opt

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages · 1 internal anchor

[1]

A survey on soft robot adaptability: Implementations, applications, and prospects [survey],

Z. Chen, D. Wu, Q. Guan, D. Hardman, F. Renda, J. Hughes, T. G. Thuruthel, C. Della Santina, B. Mazzolai, H. Zhao,et al., “A survey on soft robot adaptability: Implementations, applications, and prospects [survey],”IEEE Robotics & Automation Magazine, 2025

work page 2025
[2]

Learning soft robot and soft actuator dynamics using deep neural network,

H. P. Thanabalan, “Learning soft robot and soft actuator dynamics using deep neural network,” Master’s thesis, Queen Mary University of London (United Kingdom), 2020

work page 2020
[3]

Sequential non-rigid structure from motion using physical priors,

A. Agudo, F. Moreno-Noguer, B. Calvo, and J. M. M. Montiel, “Sequential non-rigid structure from motion using physical priors,” IEEE transactions on pattern analysis and machine intelligence, vol. 38, no. 5, pp. 979–994, 2015

work page 2015
[4]

Probabilistic modeling and bayesian filtering for improved state estimation for soft robots,

D. Kim, M. Park, and Y .-L. Park, “Probabilistic modeling and bayesian filtering for improved state estimation for soft robots,”IEEE Transac- tions on Robotics, vol. 37, no. 5, pp. 1728–1741, 2021

work page 2021
[5]

Monocular vo scale ambiguity resolution using an ultra low-cost spike rangefinder,

A. El Amin, A. El-Rabbany,et al., “Monocular vo scale ambiguity resolution using an ultra low-cost spike rangefinder,”Positioning, vol. 11, no. 04, p. 45, 2020

work page 2020
[6]

Relative pose for nonrigid multi- perspective cameras: The static case,

M. Li, J. Yang, and L. Kneip, “Relative pose for nonrigid multi- perspective cameras: The static case,” in2024 International Confer- ence on 3D Vision (3DV). IEEE, 2024, pp. 96–105

work page 2024
[7]

Zebedee: Design of a spring- mounted 3-d range sensor with application to mobile mapping,

M. Bosse, R. Zlot, and P. Flick, “Zebedee: Design of a spring- mounted 3-d range sensor with application to mobile mapping,”IEEE Transactions on Robotics, vol. 28, no. 5, pp. 1104–1119, 2012

work page 2012
[8]

Three-dimensional mobile mapping of caves

R. Zlot and M. Bosse, “Three-dimensional mobile mapping of caves.” Journal of Cave & Karst Studies, vol. 76, no. 3, 2014

work page 2014
[9]

Continuous perception for deformable objects understand- ing,

L. Mart ´ınez, J. Ruiz-del Solar, L. Sun, J. P. Siebert, and G. Aragon- Camarasa, “Continuous perception for deformable objects understand- ing,”Robotics and Autonomous Systems, vol. 118, pp. 220–230, 2019

work page 2019
[10]

Mlp based continuous gait recognition of a powered ankle prosthesis with serial elastic actuator,

Y . Li, F. Chen, J. Cao, R. Zhao, X. Yang, X. Yang, and Y . Fan, “Mlp based continuous gait recognition of a powered ankle prosthesis with serial elastic actuator,”arXiv preprint arXiv:2309.08323, 2023

work page arXiv 2023
[11]

Learning physics-informed simulation models for soft robotic manipulation: A case study with dielectric elastomer actuators,

M. Lahariya, C. Innes, C. Develder, and S. Ramamoorthy, “Learning physics-informed simulation models for soft robotic manipulation: A case study with dielectric elastomer actuators,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 11 031–11 038

work page 2022
[12]

Accurate and robust scale recovery for monocular visual odometry based on plane geometry,

R. Tian, Y . Zhang, D. Zhu, S. Liang, S. Coleman, and D. Kerr, “Accurate and robust scale recovery for monocular visual odometry based on plane geometry,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 5296–5302

work page 2021
[13]

High compliant series elastic actuation for the robotic leg scarl eth,

M. Hutter, C. D. Remy, M. A. Hoepflinger, and R. Siegwart, “High compliant series elastic actuation for the robotic leg scarl eth,” inField robotics. World Scientific, 2012, pp. 507–514

work page 2012
[14]

Flexible stereo: con- strained, non-rigid, wide-baseline stereo vision for fixed-wing aerial platforms,

T. Hinzmann, T. Taubner, and R. Siegwart, “Flexible stereo: con- strained, non-rigid, wide-baseline stereo vision for fixed-wing aerial platforms,” in2018 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2018, pp. 2550–2557

work page 2018
[15]

Flexible trinocular: Non-rigid multi-camera-imu dense reconstruction for uav navigation and mapping,

T. Hinzmann, C. Cadena, J. Nieto, and R. Siegwart, “Flexible trinocular: Non-rigid multi-camera-imu dense reconstruction for uav navigation and mapping,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 1137– 1142

work page 2019
[16]

Agilicious: Open- source and open-hardware agile quadrotor for vision-based flight,

P. Foehn, E. Kaufmann, A. Romero, R. Penicka, S. Sun, L. Bauersfeld, T. Laengle, G. Cioffi, Y . Song, A. Loquercio,et al., “Agilicious: Open- source and open-hardware agile quadrotor for vision-based flight,” Science robotics, vol. 7, no. 67, p. eabl6259, 2022

work page 2022
[17]

A survey of snake- inspired robot designs,

J. K. Hopkins, B. W. Spranklin, and S. K. Gupta, “A survey of snake- inspired robot designs,”Bioinspiration & biomimetics, vol. 4, no. 2, p. 021001, 2009

work page 2009
[18]

Articulated multi-perspective cameras and their application to truck motion estimation,

X. Peng, J. Cui, and L. Kneip, “Articulated multi-perspective cameras and their application to truck motion estimation,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 2052–2059

work page 2019
[19]

Orb-slam: A versatile and accurate monocular slam system,

R. Mur-Artal, J. M. M. Montiel, and J. D. Tardos, “Orb-slam: A versatile and accurate monocular slam system,”IEEE transactions on robotics, vol. 31, no. 5, pp. 1147–1163, 2015

work page 2015
[20]

Structure-from-motion revisited,

J. L. Sch ¨onberger and J.-M. Frahm, “Structure-from-motion revisited,” inConference on Computer Vision and Pattern Recognition (CVPR), 2016

work page 2016
[21]

Vins-mono: A robust and versatile monoc- ular visual-inertial state estimator,

T. Qin, P. Li, and S. Shen, “Vins-mono: A robust and versatile monoc- ular visual-inertial state estimator,”IEEE transactions on robotics, vol. 34, no. 4, pp. 1004–1020, 2018

work page 2018
[22]

maplab: An Open Framework for Research in Visual-inertial Mapping and Localization,

T. Schneider, M. T. Dymczyk, M. Fehr, K. Egger, S. Lynen, I. Gilitschenski, and R. Siegwart, “maplab: An Open Framework for Research in Visual-inertial Mapping and Localization,”IEEE Robotics and Automation Letters, vol. 3, no. 3, pp. 1418–1425, 2018

work page 2018
[23]

Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping,

T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and D. Rus, “Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping,” in2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2020, pp. 5135–5142

work page 2020
[24]

Gvins: Tightly coupled gnss–visual– inertial fusion for smooth and consistent state estimation,

S. Cao, X. Lu, and S. Shen, “Gvins: Tightly coupled gnss–visual– inertial fusion for smooth and consistent state estimation,”IEEE Transactions on Robotics, vol. 38, no. 4, pp. 2004–2021, 2022

work page 2004
[25]

Mast3r-slam: Real- time dense slam with 3d reconstruction priors,

R. Murai, E. Dexheimer, and A. J. Davison, “Mast3r-slam: Real- time dense slam with 3d reconstruction priors,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 16 695–16 705

work page 2025
[26]

VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold

D. Maggio, H. Lim, and L. Carlone, “Vggt-slam: Dense rgb slam optimized on the sl (4) manifold,”arXiv preprint arXiv:2505.12549, 2025

work page internal anchor Pith review arXiv 2025
[27]

Robust control of a silicone soft robot using neural networks,

G. Zheng, Y . Zhou, and M. Ju, “Robust control of a silicone soft robot using neural networks,”ISA transactions, vol. 100, pp. 38–45, 2020

work page 2020
[28]

Pinn-ray: A physics-informed neural network to model soft robotic fin ray fingers,

X. Wang, J. J. Dabrowski, J. Pinskier, L. Liow, V . Viswanathan, R. Scalzo, and D. Howard, “Pinn-ray: A physics-informed neural network to model soft robotic fin ray fingers,” in2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2024, pp. 247–254

work page 2024
[29]

M. I. Friswell,Dynamics of rotating machines. Cambridge university press, 2010

work page 2010
[30]

Learning to solve hard minimal problems,

P. Hruby, T. Duff, A. Leykin, and T. Pajdla, “Learning to solve hard minimal problems,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5532–5542

work page 2022
[31]

L6dnet: Light 6 dof network for robust and precise object pose estimation with small datasets,

M. Gonzalez, A. Kacete, A. Murienne, and E. Marchand, “L6dnet: Light 6 dof network for robust and precise object pose estimation with small datasets,”IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 2914–2921, 2021

work page 2021
[32]

Survey and research challenges in monocular visual odometry,

A. Neyestani, F. Picariello, A. Basiri, P. Daponte, and L. De Vito, “Survey and research challenges in monocular visual odometry,” in 2023 IEEE International Workshop on Metrology for Living Environ- ment (MetroLivEnv). IEEE, 2023, pp. 107–112

work page 2023
[33]

Efficient derivative computation for cumulative b-splines on lie groups,

C. Sommer, V . Usenko, D. Schubert, N. Demmel, and D. Cremers, “Efficient derivative computation for cumulative b-splines on lie groups,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11 148–11 156

work page 2020
[34]

Square root marginalization for sliding-window bundle adjustment,

N. Demmel, D. Schubert, C. Sommer, D. Cremers, and V . Usenko, “Square root marginalization for sliding-window bundle adjustment,” inProceedings of the IEEE/CVF International Conference on Com- puter Vision, 2021, pp. 13 260–13 268

work page 2021
[35]

Structure-from-motion revisited,

J. L. Schonberger and J.-M. Frahm, “Structure-from-motion revisited,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 4104–4113

work page 2016
[36]

Ceres Solver,

S. Agarwal, K. Mierle, and T. C. S. Team, “Ceres Solver,” 10 2023. [Online]. Available: https://github.com/ceres-solver/ceres-solver

work page 2023
[37]

evo: Python package for the evaluation of odometry and slam

M. Grupp, “evo: Python package for the evaluation of odometry and slam.” https://github.com/MichaelGrupp/evo, 2017

work page 2017

[1] [1]

A survey on soft robot adaptability: Implementations, applications, and prospects [survey],

Z. Chen, D. Wu, Q. Guan, D. Hardman, F. Renda, J. Hughes, T. G. Thuruthel, C. Della Santina, B. Mazzolai, H. Zhao,et al., “A survey on soft robot adaptability: Implementations, applications, and prospects [survey],”IEEE Robotics & Automation Magazine, 2025

work page 2025

[2] [2]

Learning soft robot and soft actuator dynamics using deep neural network,

H. P. Thanabalan, “Learning soft robot and soft actuator dynamics using deep neural network,” Master’s thesis, Queen Mary University of London (United Kingdom), 2020

work page 2020

[3] [3]

Sequential non-rigid structure from motion using physical priors,

A. Agudo, F. Moreno-Noguer, B. Calvo, and J. M. M. Montiel, “Sequential non-rigid structure from motion using physical priors,” IEEE transactions on pattern analysis and machine intelligence, vol. 38, no. 5, pp. 979–994, 2015

work page 2015

[4] [4]

Probabilistic modeling and bayesian filtering for improved state estimation for soft robots,

D. Kim, M. Park, and Y .-L. Park, “Probabilistic modeling and bayesian filtering for improved state estimation for soft robots,”IEEE Transac- tions on Robotics, vol. 37, no. 5, pp. 1728–1741, 2021

work page 2021

[5] [5]

Monocular vo scale ambiguity resolution using an ultra low-cost spike rangefinder,

A. El Amin, A. El-Rabbany,et al., “Monocular vo scale ambiguity resolution using an ultra low-cost spike rangefinder,”Positioning, vol. 11, no. 04, p. 45, 2020

work page 2020

[6] [6]

Relative pose for nonrigid multi- perspective cameras: The static case,

M. Li, J. Yang, and L. Kneip, “Relative pose for nonrigid multi- perspective cameras: The static case,” in2024 International Confer- ence on 3D Vision (3DV). IEEE, 2024, pp. 96–105

work page 2024

[7] [7]

Zebedee: Design of a spring- mounted 3-d range sensor with application to mobile mapping,

M. Bosse, R. Zlot, and P. Flick, “Zebedee: Design of a spring- mounted 3-d range sensor with application to mobile mapping,”IEEE Transactions on Robotics, vol. 28, no. 5, pp. 1104–1119, 2012

work page 2012

[8] [8]

Three-dimensional mobile mapping of caves

R. Zlot and M. Bosse, “Three-dimensional mobile mapping of caves.” Journal of Cave & Karst Studies, vol. 76, no. 3, 2014

work page 2014

[9] [9]

Continuous perception for deformable objects understand- ing,

L. Mart ´ınez, J. Ruiz-del Solar, L. Sun, J. P. Siebert, and G. Aragon- Camarasa, “Continuous perception for deformable objects understand- ing,”Robotics and Autonomous Systems, vol. 118, pp. 220–230, 2019

work page 2019

[10] [10]

Mlp based continuous gait recognition of a powered ankle prosthesis with serial elastic actuator,

Y . Li, F. Chen, J. Cao, R. Zhao, X. Yang, X. Yang, and Y . Fan, “Mlp based continuous gait recognition of a powered ankle prosthesis with serial elastic actuator,”arXiv preprint arXiv:2309.08323, 2023

work page arXiv 2023

[11] [11]

Learning physics-informed simulation models for soft robotic manipulation: A case study with dielectric elastomer actuators,

M. Lahariya, C. Innes, C. Develder, and S. Ramamoorthy, “Learning physics-informed simulation models for soft robotic manipulation: A case study with dielectric elastomer actuators,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 11 031–11 038

work page 2022

[12] [12]

Accurate and robust scale recovery for monocular visual odometry based on plane geometry,

R. Tian, Y . Zhang, D. Zhu, S. Liang, S. Coleman, and D. Kerr, “Accurate and robust scale recovery for monocular visual odometry based on plane geometry,” in2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 5296–5302

work page 2021

[13] [13]

High compliant series elastic actuation for the robotic leg scarl eth,

M. Hutter, C. D. Remy, M. A. Hoepflinger, and R. Siegwart, “High compliant series elastic actuation for the robotic leg scarl eth,” inField robotics. World Scientific, 2012, pp. 507–514

work page 2012

[14] [14]

Flexible stereo: con- strained, non-rigid, wide-baseline stereo vision for fixed-wing aerial platforms,

T. Hinzmann, T. Taubner, and R. Siegwart, “Flexible stereo: con- strained, non-rigid, wide-baseline stereo vision for fixed-wing aerial platforms,” in2018 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2018, pp. 2550–2557

work page 2018

[15] [15]

Flexible trinocular: Non-rigid multi-camera-imu dense reconstruction for uav navigation and mapping,

T. Hinzmann, C. Cadena, J. Nieto, and R. Siegwart, “Flexible trinocular: Non-rigid multi-camera-imu dense reconstruction for uav navigation and mapping,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 1137– 1142

work page 2019

[16] [16]

Agilicious: Open- source and open-hardware agile quadrotor for vision-based flight,

P. Foehn, E. Kaufmann, A. Romero, R. Penicka, S. Sun, L. Bauersfeld, T. Laengle, G. Cioffi, Y . Song, A. Loquercio,et al., “Agilicious: Open- source and open-hardware agile quadrotor for vision-based flight,” Science robotics, vol. 7, no. 67, p. eabl6259, 2022

work page 2022

[17] [17]

A survey of snake- inspired robot designs,

J. K. Hopkins, B. W. Spranklin, and S. K. Gupta, “A survey of snake- inspired robot designs,”Bioinspiration & biomimetics, vol. 4, no. 2, p. 021001, 2009

work page 2009

[18] [18]

Articulated multi-perspective cameras and their application to truck motion estimation,

X. Peng, J. Cui, and L. Kneip, “Articulated multi-perspective cameras and their application to truck motion estimation,” in2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 2052–2059

work page 2019

[19] [19]

Orb-slam: A versatile and accurate monocular slam system,

R. Mur-Artal, J. M. M. Montiel, and J. D. Tardos, “Orb-slam: A versatile and accurate monocular slam system,”IEEE transactions on robotics, vol. 31, no. 5, pp. 1147–1163, 2015

work page 2015

[20] [20]

Structure-from-motion revisited,

J. L. Sch ¨onberger and J.-M. Frahm, “Structure-from-motion revisited,” inConference on Computer Vision and Pattern Recognition (CVPR), 2016

work page 2016

[21] [21]

Vins-mono: A robust and versatile monoc- ular visual-inertial state estimator,

T. Qin, P. Li, and S. Shen, “Vins-mono: A robust and versatile monoc- ular visual-inertial state estimator,”IEEE transactions on robotics, vol. 34, no. 4, pp. 1004–1020, 2018

work page 2018

[22] [22]

maplab: An Open Framework for Research in Visual-inertial Mapping and Localization,

T. Schneider, M. T. Dymczyk, M. Fehr, K. Egger, S. Lynen, I. Gilitschenski, and R. Siegwart, “maplab: An Open Framework for Research in Visual-inertial Mapping and Localization,”IEEE Robotics and Automation Letters, vol. 3, no. 3, pp. 1418–1425, 2018

work page 2018

[23] [23]

Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping,

T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and D. Rus, “Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping,” in2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2020, pp. 5135–5142

work page 2020

[24] [24]

Gvins: Tightly coupled gnss–visual– inertial fusion for smooth and consistent state estimation,

S. Cao, X. Lu, and S. Shen, “Gvins: Tightly coupled gnss–visual– inertial fusion for smooth and consistent state estimation,”IEEE Transactions on Robotics, vol. 38, no. 4, pp. 2004–2021, 2022

work page 2004

[25] [25]

Mast3r-slam: Real- time dense slam with 3d reconstruction priors,

R. Murai, E. Dexheimer, and A. J. Davison, “Mast3r-slam: Real- time dense slam with 3d reconstruction priors,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 16 695–16 705

work page 2025

[26] [26]

VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold

D. Maggio, H. Lim, and L. Carlone, “Vggt-slam: Dense rgb slam optimized on the sl (4) manifold,”arXiv preprint arXiv:2505.12549, 2025

work page internal anchor Pith review arXiv 2025

[27] [27]

Robust control of a silicone soft robot using neural networks,

G. Zheng, Y . Zhou, and M. Ju, “Robust control of a silicone soft robot using neural networks,”ISA transactions, vol. 100, pp. 38–45, 2020

work page 2020

[28] [28]

Pinn-ray: A physics-informed neural network to model soft robotic fin ray fingers,

X. Wang, J. J. Dabrowski, J. Pinskier, L. Liow, V . Viswanathan, R. Scalzo, and D. Howard, “Pinn-ray: A physics-informed neural network to model soft robotic fin ray fingers,” in2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2024, pp. 247–254

work page 2024

[29] [29]

M. I. Friswell,Dynamics of rotating machines. Cambridge university press, 2010

work page 2010

[30] [30]

Learning to solve hard minimal problems,

P. Hruby, T. Duff, A. Leykin, and T. Pajdla, “Learning to solve hard minimal problems,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5532–5542

work page 2022

[31] [31]

L6dnet: Light 6 dof network for robust and precise object pose estimation with small datasets,

M. Gonzalez, A. Kacete, A. Murienne, and E. Marchand, “L6dnet: Light 6 dof network for robust and precise object pose estimation with small datasets,”IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 2914–2921, 2021

work page 2021

[32] [32]

Survey and research challenges in monocular visual odometry,

A. Neyestani, F. Picariello, A. Basiri, P. Daponte, and L. De Vito, “Survey and research challenges in monocular visual odometry,” in 2023 IEEE International Workshop on Metrology for Living Environ- ment (MetroLivEnv). IEEE, 2023, pp. 107–112

work page 2023

[33] [33]

Efficient derivative computation for cumulative b-splines on lie groups,

C. Sommer, V . Usenko, D. Schubert, N. Demmel, and D. Cremers, “Efficient derivative computation for cumulative b-splines on lie groups,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11 148–11 156

work page 2020

[34] [34]

Square root marginalization for sliding-window bundle adjustment,

N. Demmel, D. Schubert, C. Sommer, D. Cremers, and V . Usenko, “Square root marginalization for sliding-window bundle adjustment,” inProceedings of the IEEE/CVF International Conference on Com- puter Vision, 2021, pp. 13 260–13 268

work page 2021

[35] [35]

Structure-from-motion revisited,

J. L. Schonberger and J.-M. Frahm, “Structure-from-motion revisited,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 4104–4113

work page 2016

[36] [36]

Ceres Solver,

S. Agarwal, K. Mierle, and T. C. S. Team, “Ceres Solver,” 10 2023. [Online]. Available: https://github.com/ceres-solver/ceres-solver

work page 2023

[37] [37]

evo: Python package for the evaluation of odometry and slam

M. Grupp, “evo: Python package for the evaluation of odometry and slam.” https://github.com/MichaelGrupp/evo, 2017

work page 2017