Learning partially observed systems with neural Hamiltonian ordinary differential equations
Pith reviewed 2026-05-25 05:06 UTC · model grok-4.3
The pith
Neural Hamiltonian ODEs learn partially observed dynamical systems by enforcing energy conservation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
NHODE merges Hamiltonian neural networks with neural ODEs so that the learned vector field respects energy conservation even when only a subset of coordinates is observed. The training loss is defined solely on the visible variables; the Hamiltonian formulation plus optional symmetry transformations and separable energy terms then determine the evolution of the unobserved variables. On linear and nonlinear oscillators as well as the three-body problem, models with more embedded structure produce lower error and remain stable far beyond the training horizon, while unstructured neural ODEs become unstable.
What carries the argument
The NHODE model, which uses a neural network to parameterize a Hamiltonian whose gradients define the ODE right-hand side, combined with a flexible neural-ODE integrator that permits supervision only on observed coordinates.
If this is right
- Accuracy and long-horizon stability increase as more physical structure (energy conservation, symmetries, separability) is embedded.
- Latent variables can be reconstructed without any direct supervision on them.
- The same framework remains stable on chaotic systems where purely data-driven models diverge.
- Training remains possible when the loss is restricted to the observed subset of the state.
Where Pith is reading between the lines
- The same inductive bias could be tested on systems that conserve quantities other than energy, such as angular momentum.
- Real sensor data with measurement noise would provide a practical test of whether the learned Hamiltonian remains identifiable.
- The approach may transfer to control tasks where only partial state feedback is available.
- Adding dissipative terms or time-dependent Hamiltonians would be a direct next extension.
Load-bearing premise
The underlying system must obey Hamiltonian mechanics whose total energy can be recovered from the observed variables alone.
What would settle it
A direct comparison in which an unstructured neural ODE matches or exceeds NHODE long-horizon accuracy on the three-body problem after identical training would falsify the claimed benefit of the Hamiltonian constraint.
Figures
read the original abstract
When learning dynamical systems from data, embedding physical structure can constrain the solution space and improve generalization, but many physics-informed models assume access to the full system state. This limits their use in partially observed settings, where some state variables are completely unobserved and must be inferred without direct supervision. Here, we present neural Hamiltonian ordinary differential equations (NHODE), a framework that combines Hamiltonian neural networks (HNNs) with neural ordinary differential equations (neural ODEs) to learn partially observed dynamical systems from data. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE framework enables a flexible training procedure that allows the loss to be defined only on observed variables. We also incorporate additional physical constraints through symmetry-aware coordinate transformations and separable energy formulations. The framework is evaluated on systems of increasing complexity, from linear and nonlinear mass-spring systems to the chaotic three-body problem. Across all examples, increasing the amount of embedded physical structure improves the accuracy and long-horizon stability of the predictions. Even in the most challenging regimes, the NHODE framework captures both observed and latent dynamics, whereas purely data-driven baselines become unstable.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces neural Hamiltonian ordinary differential equations (NHODE), which integrate Hamiltonian neural networks with neural ODEs to learn partially observed dynamical systems. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE formulation permits the training loss to be defined exclusively on observed variables. Additional inductive biases are incorporated via symmetry-aware coordinate transformations and separable energy formulations. The method is evaluated on systems of increasing complexity, including linear and nonlinear mass-spring oscillators and the chaotic three-body problem, with the central claim that greater embedding of physical structure yields improved accuracy and long-horizon stability, and that NHODE recovers both observed and latent dynamics where purely data-driven baselines become unstable.
Significance. If the empirical results are robust, the work provides a principled route to physics-informed learning under partial observability, a setting that arises frequently in applications such as robotics and experimental physics. The explicit combination of conservation-law constraints with flexible neural-ODE training on incomplete data is a clear technical contribution. The evaluation across a progression of Hamiltonian systems, including a chaotic example, supplies a useful test bed for assessing long-term stability.
minor comments (3)
- [Abstract] Abstract: the summary statement would be strengthened by the inclusion of at least one quantitative performance metric (e.g., long-horizon prediction error or stability horizon) together with a brief description of the baselines and training protocol.
- [§3] The manuscript should clarify in §3 or §4 whether the latent-state inference is performed by augmenting the observed state with additional neural-ODE variables or by an explicit encoder; the current description leaves this architectural choice ambiguous.
- [Figures 4–6] Figure captions for the three-body experiments should report the integration horizon and the precise definition of “stability” used to generate the reported trajectories.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our manuscript, the recognition of its technical contribution, and the recommendation for minor revision. The report does not list any specific major comments.
Circularity Check
No significant circularity
full rationale
The provided abstract and context describe NHODE as combining HNNs (energy conservation by construction) with neural ODEs for training on observed variables only, with added symmetry constraints. No equations, derivations, or self-citations are exhibited that reduce any prediction or result to fitted inputs or prior author work by construction. The framework uses standard optimization with physical inductive biases; the central claims remain independent of any internal reduction and are self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- Neural network parameters
axioms (1)
- domain assumption The system is Hamiltonian with conserved total energy
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.lean; IndisputableMonolith/Foundation/BranchSelection.lean; IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanwashburn_uniqueness_aczel; branch_selection; alpha_pin_under_high_calibration echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
The Hamiltonian structure enforces energy conservation by construction... NHODEpot... only the potential energy is learned... T(p)=p^{2}/2m... translational symmetry... pairwise distances... conservation of linear and angular momentum
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
three-body problem... chaotic... NHODEpot,rel... only model variant that consistently produce non-diverging trajectories... conserves energy, linear momentum, and angular momentum
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Greydanus, S., Dzamba, M. & Yosinski, J. “Hamiltonian Neural Networks”. In:Advances in Neural Information Processing Systems. Ed. by H. Wallach et al. Vol. 32. Curran Associates, Inc., 2019.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2019 / file / 26cd8ecadce0d4efd6cc8a8725cbd1f8-Paper.pdf
work page 2019
-
[2]
Neural Ordinary Differ- ential Equations
Chen, R. T. Q., Rubanova, Y., Bettencourt, J. & Duvenaud, D. K. “Neural Ordinary Differ- ential Equations”. In:Advances in Neural Information Processing Systems. Ed. by S. Bengio et al. Vol. 31. Curran Associates, Inc., 2018.url:https : / / proceedings . neurips . cc / paper_files/paper/2018/file/69386f6bb1dfed68692a24c8686939b9-Paper.pdf
work page 2018
-
[3]
A Proposal on Machine Learning via Dynamical Systems
E, W. “A Proposal on Machine Learning via Dynamical Systems”. In:Communications in Mathematics and Statistics5.1 (Mar. 2017), pp. 1–11.doi:10.1007/s40304-017-0103-z
-
[4]
Stable architectures for deep neural networks
Haber, E. & Ruthotto, L. “Stable architectures for deep neural networks”. In:Inverse Prob- lems34.1 (Dec. 2017), p. 014004.doi:10.1088/1361-6420/aa9a90
-
[5]
A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems
Chu, S. R. & Shoureshi, R. “A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems”. In:1991 American Control Conference. 1991, pp. 1–5. doi:10.23919/ACC.1991.4791308
-
[6]
Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data
Rico-Mart´ ınez, R., Krischer, K., Kevrekidis, I. G., Kube, M. C. & Hudson, J. L. “Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data”. In:Chemical Engineering Communications118.1 (1992), pp. 25–48.doi:10.1080/00986449208936084
-
[7]
On Neural Differential Equations
Kidger, P. “On Neural Differential Equations”. PhD thesis. University of Oxford, 2021. arXiv: 2202.02435
-
[8]
Which priors matter? Benchmarking models for learning latent dynamics
Botev, A. et al. “Which priors matter? Benchmarking models for learning latent dynamics”. In:Thirty-fifth Conference on Neural Information Processing Systems Datasets and Bench- marks Track. 2021. arXiv:2111.05458. 22
-
[9]
SNODE: Spectral Discretization of Neural ODEs for System Identification
Quaglino, A., Gallieri, M., Masci, J. & Koutn´ ık, J. “SNODE: Spectral Discretization of Neural ODEs for System Identification”. In:International Conference on Learning Representations
-
[10]
LocalEigenmotionControlforNearRectilinearHaloOrbits,
Rahman, A., Drgoˇ na, J., Tuor, A. & Strube, J. “Neural Ordinary Differential Equations for Nonlinear System Identification”. In:2022 American Control Conference (ACC). 2022, pp. 3979–3984.doi:10.23919/ACC53348.2022.9867586
-
[11]
& Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities
Matsubara, T. & Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities. Mar. 2023. arXiv:2210.00272
-
[12]
et al.Universal Differential Equations for Scientific Machine Learning
Rackauckas, C. et al.Universal Differential Equations for Scientific Machine Learning. Nov
-
[13]
arXiv:2001.04385 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2001
-
[14]
et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations
Rackauckas, C. et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations. Feb
-
[15]
arXiv:1902.02376 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 1902
-
[16]
et al.Bayesian Neural Ordinary Differential Equations
Dandekar, R. et al.Bayesian Neural Ordinary Differential Equations. Feb. 2022. arXiv:2012. 07244
work page 2022
-
[17]
On learning Hamiltonian systems from data
Bertalan, T., Dietrich, F., Mezi´ c, I. & Kevrekidis, I. G. “On learning Hamiltonian systems from data”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29.12 (Dec. 2019), p. 121107.doi:10.1063/1.5128231
-
[18]
Symplectic Recurrent Neural Networks
Chen, Z., Zhang, J., Arjovsky, M. & Bottou, L. “Symplectic Recurrent Neural Networks”. In: International Conference on Learning Representations. 2020. arXiv:1909.13334
-
[19]
Symplectic learning for Hamiltonian neural networks
David, M. & M´ ehats, F. “Symplectic learning for Hamiltonian neural networks”. In:Journal of Computational Physics494 (Dec. 2023), p. 112495.doi:10.1016/j.jcp.2023.112495
-
[20]
Pseudo-Hamiltonian neural networks with state-dependent external forces
Eidnes, S., Stasik, A. J., Sterud, C., Bøhn, E. & Riemer-Sørensen, S. “Pseudo-Hamiltonian neural networks with state-dependent external forces”. In:Physica D: Nonlinear Phenomena 446 (Apr. 2023), p. 133673.doi:10.1016/j.physd.2023.133673
-
[21]
Learning dynamical systems from noisy data with inverse-explicit integrators
Celledoni, E., Eidnes, S. & Myhr, H. N. “Learning dynamical systems from noisy data with inverse-explicit integrators”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134471. doi:10.1016/j.physd.2024.134471
-
[22]
& Tang, Y.Deep Hamiltonian networks based on symplectic integrators
Zhu, A., Jin, P. & Tang, Y.Deep Hamiltonian networks based on symplectic integrators. Apr
- [23]
-
[24]
Symplectic integration of learned Hamiltonian systems
Offen, C. & Ober-Bl¨ obaum, S. “Symplectic integration of learned Hamiltonian systems”. In: Chaos: An Interdisciplinary Journal of Nonlinear Science32.1 (Jan. 2022), p. 013122.doi: 10.1063/5.0065913
-
[25]
Hamiltonian Generative Networks
Toth, P. et al. “Hamiltonian Generative Networks”. In:International Conference on Learning Representations. Feb. 2020. arXiv:1909.13789
-
[26]
Learning governing equations of unobserved states in dynamical systems
Grigorian, G., George, S. V. & Arridge, S. “Learning governing equations of unobserved states in dynamical systems”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134499.doi: 10.1016/j.physd.2024.134499
-
[27]
& Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs
Buisson-Fenet, M., Morgenthaler, V., Trimpe, S. & Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs. Jan. 2023. arXiv:2205.12550 [eess.SY]. 23
-
[28]
Learning Physics Informed Neural ODEs with Partial Measurements
Ghanem, P. et al. “Learning Physics Informed Neural ODEs with Partial Measurements”. In: Proceedings of the AAAI Conference on Artificial Intelligence39.16 (Apr. 2025), pp. 16799– 16807.doi:10.1609/aaai.v39i16.33846
-
[29]
Malani, S. et al. “Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information”. In:Computers & Chemical Engineering178 (Oct. 2023), p. 108343.doi:10.1016/j.compchemeng.2023.108343
-
[30]
Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control
Zhong, Y. D., Dey, B. & Chakraborty, A. “Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control”. In:International Conference on Learning Representations. 2020. arXiv: 1909.12077
-
[31]
& Battaglia, P.Hamiltonian Graph Networks with ODE Integrators
Sanchez-Gonzalez, A., Bapst, V., Cranmer, K. & Battaglia, P.Hamiltonian Graph Networks with ODE Integrators. Sept. 2019. arXiv:1909.12790
-
[32]
Dynamical Evolution of Clusters of Galaxies, I
Aarseth, S. J. & Hoyle, F. “Dynamical Evolution of Clusters of Galaxies, I”. In:Monthly Notices of the Royal Astronomical Society126.3 (June 1963), pp. 223–255.doi:10.1093/ mnras/126.3.223
work page 1963
-
[33]
Dehnen, W. “Towards optimal softening in three-dimensional N-body codes — I. Minimizing the force error”. In:Monthly Notices of the Royal Astronomical Society324.2 (June 2001), pp. 273–291.doi:10.1046/j.1365-8711.2001.04237.x
-
[34]
Physics-informed machine learning
Karniadakis, G. E. et al. “Physics-informed machine learning”. In:Nature Reviews Physics 3.6 (June 2021), pp. 422–440.doi:10.1038/s42254-021-00314-5
-
[35]
Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems
Desai, S. A., Mattheakis, M., Sondak, D., Protopapas, P. & Roberts, S. J. “Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems”. In:Phys. Rev. E104 (3 Sept. 2021), p. 034312.doi:10.1103/PhysRevE.104.034312
-
[36]
et al.Lagrangian Neural Networks
Cranmer, M. et al.Lagrangian Neural Networks. July 2020. arXiv:2003.04630 [cs.LG]
-
[37]
Discovering Symbolic Models from Deep Learning with Inductive Bi- ases
Cranmer, M. et al. “Discovering Symbolic Models from Deep Learning with Inductive Bi- ases”. In:Advances in Neural Information Processing Systems. Ed. by H. Larochelle, M. Ran- zato, R. Hadsell, M. Balcan & H. Lin. Vol. 33. Curran Associates, Inc., 2020, pp. 17429– 17442.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2020 / file / c9...
work page 2020
-
[38]
Artificial neural networks for solving ordinary and partial differential equations
Lagaris, I., Likas, A. & Fotiadis, D. “Artificial neural networks for solving ordinary and partial differential equations”. In:IEEE Transactions on Neural Networks9.5 (1998), pp. 987–1000. doi:10.1109/72.712178
-
[39]
Raissi, M., Perdikaris, P. & Karniadakis, G. E. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial dif- ferential equations”. In:Journal of Computational Physics378 (Feb. 2019), pp. 686–707.doi: 10.1016/j.jcp.2018.10.045
-
[40]
Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption
Tsitouras, C. “Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption”. In:Computers & Mathematics with Applications62.2 (July 2011), pp. 770–775. doi:10.1016/j.camwa.2011.06.002
-
[41]
Adam: A Method for Stochastic Optimization
Kingma, D. & Ba, J. “Adam: A Method for Stochastic Optimization”. In:International Conference on Learning Representations(Dec. 2014)
work page 2014
-
[42]
Meltzer, S., Eidnes, S. & Stasik, A. J.Neural Hamiltonian ordinary differential equations. Version 1.0. 2026.doi:10.5281/zenodo.20283041. 24
-
[43]
et al.JAX: composable transformations of Python+NumPy programs
Bradbury, J. et al.JAX: composable transformations of Python+NumPy programs. Ver- sion 0.3.13. 2018.url:http://github.com/jax-ml/jax
work page 2018
-
[44]
Equinox: neural networks in JAX via callable PyTrees and filtered transformations
Kidger, P. & Garcia, C. “Equinox: neural networks in JAX via callable PyTrees and filtered transformations”. In:Differentiable Programming workshop at Neural Information Processing Systems 2021(2021). arXiv:2111.00254. 25
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.