pith. sign in

arxiv: 2605.23510 · v1 · pith:6WTF65A3new · submitted 2026-05-22 · 💻 cs.LG

Learning partially observed systems with neural Hamiltonian ordinary differential equations

Pith reviewed 2026-05-25 05:06 UTC · model grok-4.3

classification 💻 cs.LG
keywords neural Hamiltonian ODEpartially observed dynamical systemsenergy conservationphysics-informed learninglatent dynamicsHamiltonian neural networksneural ODElong-horizon prediction
0
0 comments X

The pith

Neural Hamiltonian ODEs learn partially observed dynamical systems by enforcing energy conservation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents NHODE as a way to learn dynamical systems when some state variables are never observed directly. It embeds Hamiltonian structure so that energy is conserved by construction, allowing the loss to be computed only on the visible variables while the model still infers the hidden ones. The authors test the approach on mass-spring systems and the chaotic three-body problem, showing that adding more physical constraints steadily raises accuracy and prevents instability over long time horizons. Purely data-driven baselines diverge in the same regimes. The central point is that the conserved-energy prior supplies enough inductive bias to recover latent dynamics from partial data alone.

Core claim

NHODE merges Hamiltonian neural networks with neural ODEs so that the learned vector field respects energy conservation even when only a subset of coordinates is observed. The training loss is defined solely on the visible variables; the Hamiltonian formulation plus optional symmetry transformations and separable energy terms then determine the evolution of the unobserved variables. On linear and nonlinear oscillators as well as the three-body problem, models with more embedded structure produce lower error and remain stable far beyond the training horizon, while unstructured neural ODEs become unstable.

What carries the argument

The NHODE model, which uses a neural network to parameterize a Hamiltonian whose gradients define the ODE right-hand side, combined with a flexible neural-ODE integrator that permits supervision only on observed coordinates.

If this is right

  • Accuracy and long-horizon stability increase as more physical structure (energy conservation, symmetries, separability) is embedded.
  • Latent variables can be reconstructed without any direct supervision on them.
  • The same framework remains stable on chaotic systems where purely data-driven models diverge.
  • Training remains possible when the loss is restricted to the observed subset of the state.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same inductive bias could be tested on systems that conserve quantities other than energy, such as angular momentum.
  • Real sensor data with measurement noise would provide a practical test of whether the learned Hamiltonian remains identifiable.
  • The approach may transfer to control tasks where only partial state feedback is available.
  • Adding dissipative terms or time-dependent Hamiltonians would be a direct next extension.

Load-bearing premise

The underlying system must obey Hamiltonian mechanics whose total energy can be recovered from the observed variables alone.

What would settle it

A direct comparison in which an unstructured neural ODE matches or exceeds NHODE long-horizon accuracy on the three-body problem after identical training would falsify the claimed benefit of the Hamiltonian constraint.

Figures

Figures reproduced from arXiv: 2605.23510 by Alexander Johannes Stasik, S{\o}lve Eidnes, Sunniva Meltzer.

Figure 1
Figure 1. Figure 1: Training pipeline for neural Hamiltonian ODEs. From an observed history, a neural [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Dynamical systems investigated in this study. From left to right, we have a linear [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Predictions for the linear two-mass two-spring system. Panel (a) shows the predicted [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Model prediction for the triangular nonlinear mass-spring system. Panel (a) shows a grid [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Model prediction for the three body problem. Panel (a) shows a grid of snapshots, where [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Model predictions for the linear two–mass two–spring system when initial conditions of [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Testing the optimal value of the Plummer softening parameter in the three body problem. [PITH_FULL_IMAGE:figures/full_fig_p020_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Long-horizon rollouts for the three-body problem, showing the predicted position of the [PITH_FULL_IMAGE:figures/full_fig_p021_8.png] view at source ↗
read the original abstract

When learning dynamical systems from data, embedding physical structure can constrain the solution space and improve generalization, but many physics-informed models assume access to the full system state. This limits their use in partially observed settings, where some state variables are completely unobserved and must be inferred without direct supervision. Here, we present neural Hamiltonian ordinary differential equations (NHODE), a framework that combines Hamiltonian neural networks (HNNs) with neural ordinary differential equations (neural ODEs) to learn partially observed dynamical systems from data. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE framework enables a flexible training procedure that allows the loss to be defined only on observed variables. We also incorporate additional physical constraints through symmetry-aware coordinate transformations and separable energy formulations. The framework is evaluated on systems of increasing complexity, from linear and nonlinear mass-spring systems to the chaotic three-body problem. Across all examples, increasing the amount of embedded physical structure improves the accuracy and long-horizon stability of the predictions. Even in the most challenging regimes, the NHODE framework captures both observed and latent dynamics, whereas purely data-driven baselines become unstable.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The manuscript introduces neural Hamiltonian ordinary differential equations (NHODE), which integrate Hamiltonian neural networks with neural ODEs to learn partially observed dynamical systems. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE formulation permits the training loss to be defined exclusively on observed variables. Additional inductive biases are incorporated via symmetry-aware coordinate transformations and separable energy formulations. The method is evaluated on systems of increasing complexity, including linear and nonlinear mass-spring oscillators and the chaotic three-body problem, with the central claim that greater embedding of physical structure yields improved accuracy and long-horizon stability, and that NHODE recovers both observed and latent dynamics where purely data-driven baselines become unstable.

Significance. If the empirical results are robust, the work provides a principled route to physics-informed learning under partial observability, a setting that arises frequently in applications such as robotics and experimental physics. The explicit combination of conservation-law constraints with flexible neural-ODE training on incomplete data is a clear technical contribution. The evaluation across a progression of Hamiltonian systems, including a chaotic example, supplies a useful test bed for assessing long-term stability.

minor comments (3)
  1. [Abstract] Abstract: the summary statement would be strengthened by the inclusion of at least one quantitative performance metric (e.g., long-horizon prediction error or stability horizon) together with a brief description of the baselines and training protocol.
  2. [§3] The manuscript should clarify in §3 or §4 whether the latent-state inference is performed by augmenting the observed state with additional neural-ODE variables or by an explicit encoder; the current description leaves this architectural choice ambiguous.
  3. [Figures 4–6] Figure captions for the three-body experiments should report the integration horizon and the precise definition of “stability” used to generate the reported trajectories.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript, the recognition of its technical contribution, and the recommendation for minor revision. The report does not list any specific major comments.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The provided abstract and context describe NHODE as combining HNNs (energy conservation by construction) with neural ODEs for training on observed variables only, with added symmetry constraints. No equations, derivations, or self-citations are exhibited that reduce any prediction or result to fitted inputs or prior author work by construction. The framework uses standard optimization with physical inductive biases; the central claims remain independent of any internal reduction and are self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The framework rests on the domain assumption that the target systems are Hamiltonian. Neural network parameters are the standard fitted objects. No new physical entities are postulated.

free parameters (1)
  • Neural network parameters
    Weights of the networks approximating the Hamiltonian and dynamics, optimized via gradient descent on observed data.
axioms (1)
  • domain assumption The system is Hamiltonian with conserved total energy
    Invoked to enforce conservation by construction and to enable inference of unobserved states.

pith-pipeline@v0.9.0 · 5730 in / 1178 out tokens · 31859 ms · 2026-05-25T05:06:01.940928+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages · 2 internal anchors

  1. [1]

    Hamiltonian Neural Networks

    Greydanus, S., Dzamba, M. & Yosinski, J. “Hamiltonian Neural Networks”. In:Advances in Neural Information Processing Systems. Ed. by H. Wallach et al. Vol. 32. Curran Associates, Inc., 2019.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2019 / file / 26cd8ecadce0d4efd6cc8a8725cbd1f8-Paper.pdf

  2. [2]

    Neural Ordinary Differ- ential Equations

    Chen, R. T. Q., Rubanova, Y., Bettencourt, J. & Duvenaud, D. K. “Neural Ordinary Differ- ential Equations”. In:Advances in Neural Information Processing Systems. Ed. by S. Bengio et al. Vol. 31. Curran Associates, Inc., 2018.url:https : / / proceedings . neurips . cc / paper_files/paper/2018/file/69386f6bb1dfed68692a24c8686939b9-Paper.pdf

  3. [3]

    A Proposal on Machine Learning via Dynamical Systems

    E, W. “A Proposal on Machine Learning via Dynamical Systems”. In:Communications in Mathematics and Statistics5.1 (Mar. 2017), pp. 1–11.doi:10.1007/s40304-017-0103-z

  4. [4]

    Stable architectures for deep neural networks

    Haber, E. & Ruthotto, L. “Stable architectures for deep neural networks”. In:Inverse Prob- lems34.1 (Dec. 2017), p. 014004.doi:10.1088/1361-6420/aa9a90

  5. [5]

    A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems

    Chu, S. R. & Shoureshi, R. “A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems”. In:1991 American Control Conference. 1991, pp. 1–5. doi:10.23919/ACC.1991.4791308

  6. [6]

    Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data

    Rico-Mart´ ınez, R., Krischer, K., Kevrekidis, I. G., Kube, M. C. & Hudson, J. L. “Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data”. In:Chemical Engineering Communications118.1 (1992), pp. 25–48.doi:10.1080/00986449208936084

  7. [7]

    On Neural Differential Equations

    Kidger, P. “On Neural Differential Equations”. PhD thesis. University of Oxford, 2021. arXiv: 2202.02435

  8. [8]

    Which priors matter? Benchmarking models for learning latent dynamics

    Botev, A. et al. “Which priors matter? Benchmarking models for learning latent dynamics”. In:Thirty-fifth Conference on Neural Information Processing Systems Datasets and Bench- marks Track. 2021. arXiv:2111.05458. 22

  9. [9]

    SNODE: Spectral Discretization of Neural ODEs for System Identification

    Quaglino, A., Gallieri, M., Masci, J. & Koutn´ ık, J. “SNODE: Spectral Discretization of Neural ODEs for System Identification”. In:International Conference on Learning Representations

  10. [10]

    LocalEigenmotionControlforNearRectilinearHaloOrbits,

    Rahman, A., Drgoˇ na, J., Tuor, A. & Strube, J. “Neural Ordinary Differential Equations for Nonlinear System Identification”. In:2022 American Control Conference (ACC). 2022, pp. 3979–3984.doi:10.23919/ACC53348.2022.9867586

  11. [11]

    & Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities

    Matsubara, T. & Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities. Mar. 2023. arXiv:2210.00272

  12. [12]

    et al.Universal Differential Equations for Scientific Machine Learning

    Rackauckas, C. et al.Universal Differential Equations for Scientific Machine Learning. Nov

  13. [13]

    arXiv:2001.04385 [cs.LG]

  14. [14]

    et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations

    Rackauckas, C. et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations. Feb

  15. [15]

    arXiv:1902.02376 [cs.LG]

  16. [16]

    et al.Bayesian Neural Ordinary Differential Equations

    Dandekar, R. et al.Bayesian Neural Ordinary Differential Equations. Feb. 2022. arXiv:2012. 07244

  17. [17]

    On learning Hamiltonian systems from data

    Bertalan, T., Dietrich, F., Mezi´ c, I. & Kevrekidis, I. G. “On learning Hamiltonian systems from data”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29.12 (Dec. 2019), p. 121107.doi:10.1063/1.5128231

  18. [18]

    Symplectic Recurrent Neural Networks

    Chen, Z., Zhang, J., Arjovsky, M. & Bottou, L. “Symplectic Recurrent Neural Networks”. In: International Conference on Learning Representations. 2020. arXiv:1909.13334

  19. [19]

    Symplectic learning for Hamiltonian neural networks

    David, M. & M´ ehats, F. “Symplectic learning for Hamiltonian neural networks”. In:Journal of Computational Physics494 (Dec. 2023), p. 112495.doi:10.1016/j.jcp.2023.112495

  20. [20]

    Pseudo-Hamiltonian neural networks with state-dependent external forces

    Eidnes, S., Stasik, A. J., Sterud, C., Bøhn, E. & Riemer-Sørensen, S. “Pseudo-Hamiltonian neural networks with state-dependent external forces”. In:Physica D: Nonlinear Phenomena 446 (Apr. 2023), p. 133673.doi:10.1016/j.physd.2023.133673

  21. [21]

    Learning dynamical systems from noisy data with inverse-explicit integrators

    Celledoni, E., Eidnes, S. & Myhr, H. N. “Learning dynamical systems from noisy data with inverse-explicit integrators”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134471. doi:10.1016/j.physd.2024.134471

  22. [22]

    & Tang, Y.Deep Hamiltonian networks based on symplectic integrators

    Zhu, A., Jin, P. & Tang, Y.Deep Hamiltonian networks based on symplectic integrators. Apr

  23. [23]

    arXiv:2004.13830 [math.NA]

  24. [24]

    Symplectic integration of learned Hamiltonian systems

    Offen, C. & Ober-Bl¨ obaum, S. “Symplectic integration of learned Hamiltonian systems”. In: Chaos: An Interdisciplinary Journal of Nonlinear Science32.1 (Jan. 2022), p. 013122.doi: 10.1063/5.0065913

  25. [25]

    Hamiltonian Generative Networks

    Toth, P. et al. “Hamiltonian Generative Networks”. In:International Conference on Learning Representations. Feb. 2020. arXiv:1909.13789

  26. [26]

    Learning governing equations of unobserved states in dynamical systems

    Grigorian, G., George, S. V. & Arridge, S. “Learning governing equations of unobserved states in dynamical systems”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134499.doi: 10.1016/j.physd.2024.134499

  27. [27]

    & Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs

    Buisson-Fenet, M., Morgenthaler, V., Trimpe, S. & Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs. Jan. 2023. arXiv:2205.12550 [eess.SY]. 23

  28. [28]

    Learning Physics Informed Neural ODEs with Partial Measurements

    Ghanem, P. et al. “Learning Physics Informed Neural ODEs with Partial Measurements”. In: Proceedings of the AAAI Conference on Artificial Intelligence39.16 (Apr. 2025), pp. 16799– 16807.doi:10.1609/aaai.v39i16.33846

  29. [29]

    Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

    Malani, S. et al. “Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information”. In:Computers & Chemical Engineering178 (Oct. 2023), p. 108343.doi:10.1016/j.compchemeng.2023.108343

  30. [30]

    Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control

    Zhong, Y. D., Dey, B. & Chakraborty, A. “Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control”. In:International Conference on Learning Representations. 2020. arXiv: 1909.12077

  31. [31]

    & Battaglia, P.Hamiltonian Graph Networks with ODE Integrators

    Sanchez-Gonzalez, A., Bapst, V., Cranmer, K. & Battaglia, P.Hamiltonian Graph Networks with ODE Integrators. Sept. 2019. arXiv:1909.12790

  32. [32]

    Dynamical Evolution of Clusters of Galaxies, I

    Aarseth, S. J. & Hoyle, F. “Dynamical Evolution of Clusters of Galaxies, I”. In:Monthly Notices of the Royal Astronomical Society126.3 (June 1963), pp. 223–255.doi:10.1093/ mnras/126.3.223

  33. [33]

    A., Kirk, J

    Dehnen, W. “Towards optimal softening in three-dimensional N-body codes — I. Minimizing the force error”. In:Monthly Notices of the Royal Astronomical Society324.2 (June 2001), pp. 273–291.doi:10.1046/j.1365-8711.2001.04237.x

  34. [34]

    Physics-informed machine learning

    Karniadakis, G. E. et al. “Physics-informed machine learning”. In:Nature Reviews Physics 3.6 (June 2021), pp. 422–440.doi:10.1038/s42254-021-00314-5

  35. [35]

    Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems

    Desai, S. A., Mattheakis, M., Sondak, D., Protopapas, P. & Roberts, S. J. “Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems”. In:Phys. Rev. E104 (3 Sept. 2021), p. 034312.doi:10.1103/PhysRevE.104.034312

  36. [36]

    et al.Lagrangian Neural Networks

    Cranmer, M. et al.Lagrangian Neural Networks. July 2020. arXiv:2003.04630 [cs.LG]

  37. [37]

    Discovering Symbolic Models from Deep Learning with Inductive Bi- ases

    Cranmer, M. et al. “Discovering Symbolic Models from Deep Learning with Inductive Bi- ases”. In:Advances in Neural Information Processing Systems. Ed. by H. Larochelle, M. Ran- zato, R. Hadsell, M. Balcan & H. Lin. Vol. 33. Curran Associates, Inc., 2020, pp. 17429– 17442.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2020 / file / c9...

  38. [38]

    Artificial neural networks for solving ordinary and partial differential equations

    Lagaris, I., Likas, A. & Fotiadis, D. “Artificial neural networks for solving ordinary and partial differential equations”. In:IEEE Transactions on Neural Networks9.5 (1998), pp. 987–1000. doi:10.1109/72.712178

  39. [39]

    Raissi, P

    Raissi, M., Perdikaris, P. & Karniadakis, G. E. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial dif- ferential equations”. In:Journal of Computational Physics378 (Feb. 2019), pp. 686–707.doi: 10.1016/j.jcp.2018.10.045

  40. [40]

    Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption

    Tsitouras, C. “Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption”. In:Computers & Mathematics with Applications62.2 (July 2011), pp. 770–775. doi:10.1016/j.camwa.2011.06.002

  41. [41]

    Adam: A Method for Stochastic Optimization

    Kingma, D. & Ba, J. “Adam: A Method for Stochastic Optimization”. In:International Conference on Learning Representations(Dec. 2014)

  42. [42]

    & Stasik, A

    Meltzer, S., Eidnes, S. & Stasik, A. J.Neural Hamiltonian ordinary differential equations. Version 1.0. 2026.doi:10.5281/zenodo.20283041. 24

  43. [43]

    et al.JAX: composable transformations of Python+NumPy programs

    Bradbury, J. et al.JAX: composable transformations of Python+NumPy programs. Ver- sion 0.3.13. 2018.url:http://github.com/jax-ml/jax

  44. [44]

    Equinox: neural networks in JAX via callable PyTrees and filtered transformations

    Kidger, P. & Garcia, C. “Equinox: neural networks in JAX via callable PyTrees and filtered transformations”. In:Differentiable Programming workshop at Neural Information Processing Systems 2021(2021). arXiv:2111.00254. 25