Learning partially observed systems with neural Hamiltonian ordinary differential equations

Alexander Johannes Stasik; S{\o}lve Eidnes; Sunniva Meltzer

arxiv: 2605.23510 · v1 · pith:6WTF65A3new · submitted 2026-05-22 · 💻 cs.LG

Learning partially observed systems with neural Hamiltonian ordinary differential equations

Sunniva Meltzer , S{\o}lve Eidnes , Alexander Johannes Stasik This is my paper

Pith reviewed 2026-05-25 05:06 UTC · model grok-4.3

classification 💻 cs.LG

keywords neural Hamiltonian ODEpartially observed dynamical systemsenergy conservationphysics-informed learninglatent dynamicsHamiltonian neural networksneural ODElong-horizon prediction

0 comments

The pith

Neural Hamiltonian ODEs learn partially observed dynamical systems by enforcing energy conservation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents NHODE as a way to learn dynamical systems when some state variables are never observed directly. It embeds Hamiltonian structure so that energy is conserved by construction, allowing the loss to be computed only on the visible variables while the model still infers the hidden ones. The authors test the approach on mass-spring systems and the chaotic three-body problem, showing that adding more physical constraints steadily raises accuracy and prevents instability over long time horizons. Purely data-driven baselines diverge in the same regimes. The central point is that the conserved-energy prior supplies enough inductive bias to recover latent dynamics from partial data alone.

Core claim

NHODE merges Hamiltonian neural networks with neural ODEs so that the learned vector field respects energy conservation even when only a subset of coordinates is observed. The training loss is defined solely on the visible variables; the Hamiltonian formulation plus optional symmetry transformations and separable energy terms then determine the evolution of the unobserved variables. On linear and nonlinear oscillators as well as the three-body problem, models with more embedded structure produce lower error and remain stable far beyond the training horizon, while unstructured neural ODEs become unstable.

What carries the argument

The NHODE model, which uses a neural network to parameterize a Hamiltonian whose gradients define the ODE right-hand side, combined with a flexible neural-ODE integrator that permits supervision only on observed coordinates.

If this is right

Accuracy and long-horizon stability increase as more physical structure (energy conservation, symmetries, separability) is embedded.
Latent variables can be reconstructed without any direct supervision on them.
The same framework remains stable on chaotic systems where purely data-driven models diverge.
Training remains possible when the loss is restricted to the observed subset of the state.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same inductive bias could be tested on systems that conserve quantities other than energy, such as angular momentum.
Real sensor data with measurement noise would provide a practical test of whether the learned Hamiltonian remains identifiable.
The approach may transfer to control tasks where only partial state feedback is available.
Adding dissipative terms or time-dependent Hamiltonians would be a direct next extension.

Load-bearing premise

The underlying system must obey Hamiltonian mechanics whose total energy can be recovered from the observed variables alone.

What would settle it

A direct comparison in which an unstructured neural ODE matches or exceeds NHODE long-horizon accuracy on the three-body problem after identical training would falsify the claimed benefit of the Hamiltonian constraint.

Figures

Figures reproduced from arXiv: 2605.23510 by Alexander Johannes Stasik, S{\o}lve Eidnes, Sunniva Meltzer.

**Figure 2.** Figure 2: Dynamical systems investigated in this study. From left to right, we have a linear [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Predictions for the linear two-mass two-spring system. Panel (a) shows the predicted [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Model prediction for the triangular nonlinear mass-spring system. Panel (a) shows a grid [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Model prediction for the three body problem. Panel (a) shows a grid of snapshots, where [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 6.** Figure 6: Model predictions for the linear two–mass two–spring system when initial conditions of [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

**Figure 7.** Figure 7: Testing the optimal value of the Plummer softening parameter in the three body problem. [PITH_FULL_IMAGE:figures/full_fig_p020_7.png] view at source ↗

**Figure 8.** Figure 8: Long-horizon rollouts for the three-body problem, showing the predicted position of the [PITH_FULL_IMAGE:figures/full_fig_p021_8.png] view at source ↗

read the original abstract

When learning dynamical systems from data, embedding physical structure can constrain the solution space and improve generalization, but many physics-informed models assume access to the full system state. This limits their use in partially observed settings, where some state variables are completely unobserved and must be inferred without direct supervision. Here, we present neural Hamiltonian ordinary differential equations (NHODE), a framework that combines Hamiltonian neural networks (HNNs) with neural ordinary differential equations (neural ODEs) to learn partially observed dynamical systems from data. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE framework enables a flexible training procedure that allows the loss to be defined only on observed variables. We also incorporate additional physical constraints through symmetry-aware coordinate transformations and separable energy formulations. The framework is evaluated on systems of increasing complexity, from linear and nonlinear mass-spring systems to the chaotic three-body problem. Across all examples, increasing the amount of embedded physical structure improves the accuracy and long-horizon stability of the predictions. Even in the most challenging regimes, the NHODE framework captures both observed and latent dynamics, whereas purely data-driven baselines become unstable.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

NHODE adds Hamiltonian structure and symmetry constraints to neural ODEs for partial observations, but the abstract gives no numbers so the gains are hard to judge.

read the letter

The core idea is to embed a Hamiltonian neural network inside a neural ODE so the model can be trained only on observed variables while the latent dynamics are carried by the conserved energy. They also add symmetry-aware coordinate changes and separable energy terms. That combination is the main new piece relative to plain neural ODEs or standard HNNs. It targets a practical gap: many real systems give only partial state data, and the method tries to infer the rest without direct supervision on the hidden variables. The abstract says the physics constraints help long-horizon stability on linear and nonlinear springs and on the three-body problem, where pure data-driven baselines blow up. That direction makes sense and the construction looks internally consistent. The main soft spot is the lack of any reported metrics, error bars, or baseline details in the abstract, so it is impossible to tell how large the improvement actually is or whether the latent inference is accurate. The claim that the system must obey Hamiltonian mechanics with learnable energy from partial data is also doing heavy lifting; if that assumption is off, the advantage disappears. This paper is aimed at people already working on physics-informed neural ODEs who need to handle missing state variables. It is worth sending to peer review because the problem is real and the framework is a clean extension, even if the current evidence is thin.

Referee Report

0 major / 3 minor

Summary. The manuscript introduces neural Hamiltonian ordinary differential equations (NHODE), which integrate Hamiltonian neural networks with neural ODEs to learn partially observed dynamical systems. The Hamiltonian structure enforces energy conservation by construction, while the neural ODE formulation permits the training loss to be defined exclusively on observed variables. Additional inductive biases are incorporated via symmetry-aware coordinate transformations and separable energy formulations. The method is evaluated on systems of increasing complexity, including linear and nonlinear mass-spring oscillators and the chaotic three-body problem, with the central claim that greater embedding of physical structure yields improved accuracy and long-horizon stability, and that NHODE recovers both observed and latent dynamics where purely data-driven baselines become unstable.

Significance. If the empirical results are robust, the work provides a principled route to physics-informed learning under partial observability, a setting that arises frequently in applications such as robotics and experimental physics. The explicit combination of conservation-law constraints with flexible neural-ODE training on incomplete data is a clear technical contribution. The evaluation across a progression of Hamiltonian systems, including a chaotic example, supplies a useful test bed for assessing long-term stability.

minor comments (3)

[Abstract] Abstract: the summary statement would be strengthened by the inclusion of at least one quantitative performance metric (e.g., long-horizon prediction error or stability horizon) together with a brief description of the baselines and training protocol.
[§3] The manuscript should clarify in §3 or §4 whether the latent-state inference is performed by augmenting the observed state with additional neural-ODE variables or by an explicit encoder; the current description leaves this architectural choice ambiguous.
[Figures 4–6] Figure captions for the three-body experiments should report the integration horizon and the precise definition of “stability” used to generate the reported trajectories.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript, the recognition of its technical contribution, and the recommendation for minor revision. The report does not list any specific major comments.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The provided abstract and context describe NHODE as combining HNNs (energy conservation by construction) with neural ODEs for training on observed variables only, with added symmetry constraints. No equations, derivations, or self-citations are exhibited that reduce any prediction or result to fitted inputs or prior author work by construction. The framework uses standard optimization with physical inductive biases; the central claims remain independent of any internal reduction and are self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The framework rests on the domain assumption that the target systems are Hamiltonian. Neural network parameters are the standard fitted objects. No new physical entities are postulated.

free parameters (1)

Neural network parameters
Weights of the networks approximating the Hamiltonian and dynamics, optimized via gradient descent on observed data.

axioms (1)

domain assumption The system is Hamiltonian with conserved total energy
Invoked to enforce conservation by construction and to enable inference of unobserved states.

pith-pipeline@v0.9.0 · 5730 in / 1178 out tokens · 31859 ms · 2026-05-25T05:06:01.940928+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean; IndisputableMonolith/Foundation/BranchSelection.lean; IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean washburn_uniqueness_aczel; branch_selection; alpha_pin_under_high_calibration echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

The Hamiltonian structure enforces energy conservation by construction... NHODEpot... only the potential energy is learned... T(p)=p^{2}/2m... translational symmetry... pairwise distances... conservation of linear and angular momentum
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

three-body problem... chaotic... NHODEpot,rel... only model variant that consistently produce non-diverging trajectories... conserves energy, linear momentum, and angular momentum

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages · 2 internal anchors

[1]

Hamiltonian Neural Networks

Greydanus, S., Dzamba, M. & Yosinski, J. “Hamiltonian Neural Networks”. In:Advances in Neural Information Processing Systems. Ed. by H. Wallach et al. Vol. 32. Curran Associates, Inc., 2019.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2019 / file / 26cd8ecadce0d4efd6cc8a8725cbd1f8-Paper.pdf

work page 2019
[2]

Neural Ordinary Differ- ential Equations

Chen, R. T. Q., Rubanova, Y., Bettencourt, J. & Duvenaud, D. K. “Neural Ordinary Differ- ential Equations”. In:Advances in Neural Information Processing Systems. Ed. by S. Bengio et al. Vol. 31. Curran Associates, Inc., 2018.url:https : / / proceedings . neurips . cc / paper_files/paper/2018/file/69386f6bb1dfed68692a24c8686939b9-Paper.pdf

work page 2018
[3]

A Proposal on Machine Learning via Dynamical Systems

E, W. “A Proposal on Machine Learning via Dynamical Systems”. In:Communications in Mathematics and Statistics5.1 (Mar. 2017), pp. 1–11.doi:10.1007/s40304-017-0103-z

work page doi:10.1007/s40304-017-0103-z 2017
[4]

Stable architectures for deep neural networks

Haber, E. & Ruthotto, L. “Stable architectures for deep neural networks”. In:Inverse Prob- lems34.1 (Dec. 2017), p. 014004.doi:10.1088/1361-6420/aa9a90

work page doi:10.1088/1361-6420/aa9a90 2017
[5]

A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems

Chu, S. R. & Shoureshi, R. “A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems”. In:1991 American Control Conference. 1991, pp. 1–5. doi:10.23919/ACC.1991.4791308

work page doi:10.23919/acc.1991.4791308 1991
[6]

Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data

Rico-Mart´ ınez, R., Krischer, K., Kevrekidis, I. G., Kube, M. C. & Hudson, J. L. “Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data”. In:Chemical Engineering Communications118.1 (1992), pp. 25–48.doi:10.1080/00986449208936084

work page doi:10.1080/00986449208936084 1992
[7]

On Neural Differential Equations

Kidger, P. “On Neural Differential Equations”. PhD thesis. University of Oxford, 2021. arXiv: 2202.02435

work page arXiv 2021
[8]

Which priors matter? Benchmarking models for learning latent dynamics

Botev, A. et al. “Which priors matter? Benchmarking models for learning latent dynamics”. In:Thirty-fifth Conference on Neural Information Processing Systems Datasets and Bench- marks Track. 2021. arXiv:2111.05458. 22

work page arXiv 2021
[9]

SNODE: Spectral Discretization of Neural ODEs for System Identification

Quaglino, A., Gallieri, M., Masci, J. & Koutn´ ık, J. “SNODE: Spectral Discretization of Neural ODEs for System Identification”. In:International Conference on Learning Representations

work page
[10]

LocalEigenmotionControlforNearRectilinearHaloOrbits,

Rahman, A., Drgoˇ na, J., Tuor, A. & Strube, J. “Neural Ordinary Differential Equations for Nonlinear System Identification”. In:2022 American Control Conference (ACC). 2022, pp. 3979–3984.doi:10.23919/ACC53348.2022.9867586

work page doi:10.23919/acc53348.2022.9867586 2022
[11]

& Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities

Matsubara, T. & Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities. Mar. 2023. arXiv:2210.00272

work page arXiv 2023
[12]

et al.Universal Differential Equations for Scientific Machine Learning

Rackauckas, C. et al.Universal Differential Equations for Scientific Machine Learning. Nov

work page
[13]

arXiv:2001.04385 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 2001
[14]

et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations

Rackauckas, C. et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations. Feb

work page
[15]

arXiv:1902.02376 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 1902
[16]

et al.Bayesian Neural Ordinary Differential Equations

Dandekar, R. et al.Bayesian Neural Ordinary Differential Equations. Feb. 2022. arXiv:2012. 07244

work page 2022
[17]

On learning Hamiltonian systems from data

Bertalan, T., Dietrich, F., Mezi´ c, I. & Kevrekidis, I. G. “On learning Hamiltonian systems from data”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29.12 (Dec. 2019), p. 121107.doi:10.1063/1.5128231

work page doi:10.1063/1.5128231 2019
[18]

Symplectic Recurrent Neural Networks

Chen, Z., Zhang, J., Arjovsky, M. & Bottou, L. “Symplectic Recurrent Neural Networks”. In: International Conference on Learning Representations. 2020. arXiv:1909.13334

work page arXiv 2020
[19]

Symplectic learning for Hamiltonian neural networks

David, M. & M´ ehats, F. “Symplectic learning for Hamiltonian neural networks”. In:Journal of Computational Physics494 (Dec. 2023), p. 112495.doi:10.1016/j.jcp.2023.112495

work page doi:10.1016/j.jcp.2023.112495 2023
[20]

Pseudo-Hamiltonian neural networks with state-dependent external forces

Eidnes, S., Stasik, A. J., Sterud, C., Bøhn, E. & Riemer-Sørensen, S. “Pseudo-Hamiltonian neural networks with state-dependent external forces”. In:Physica D: Nonlinear Phenomena 446 (Apr. 2023), p. 133673.doi:10.1016/j.physd.2023.133673

work page doi:10.1016/j.physd.2023.133673 2023
[21]

Learning dynamical systems from noisy data with inverse-explicit integrators

Celledoni, E., Eidnes, S. & Myhr, H. N. “Learning dynamical systems from noisy data with inverse-explicit integrators”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134471. doi:10.1016/j.physd.2024.134471

work page doi:10.1016/j.physd.2024.134471 2025
[22]

& Tang, Y.Deep Hamiltonian networks based on symplectic integrators

Zhu, A., Jin, P. & Tang, Y.Deep Hamiltonian networks based on symplectic integrators. Apr

work page
[23]

arXiv:2004.13830 [math.NA]

work page arXiv 2004
[24]

Symplectic integration of learned Hamiltonian systems

Offen, C. & Ober-Bl¨ obaum, S. “Symplectic integration of learned Hamiltonian systems”. In: Chaos: An Interdisciplinary Journal of Nonlinear Science32.1 (Jan. 2022), p. 013122.doi: 10.1063/5.0065913

work page doi:10.1063/5.0065913 2022
[25]

Hamiltonian Generative Networks

Toth, P. et al. “Hamiltonian Generative Networks”. In:International Conference on Learning Representations. Feb. 2020. arXiv:1909.13789

work page arXiv 2020
[26]

Learning governing equations of unobserved states in dynamical systems

Grigorian, G., George, S. V. & Arridge, S. “Learning governing equations of unobserved states in dynamical systems”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134499.doi: 10.1016/j.physd.2024.134499

work page doi:10.1016/j.physd.2024.134499 2025
[27]

& Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs

Buisson-Fenet, M., Morgenthaler, V., Trimpe, S. & Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs. Jan. 2023. arXiv:2205.12550 [eess.SY]. 23

work page arXiv 2023
[28]

Learning Physics Informed Neural ODEs with Partial Measurements

Ghanem, P. et al. “Learning Physics Informed Neural ODEs with Partial Measurements”. In: Proceedings of the AAAI Conference on Artificial Intelligence39.16 (Apr. 2025), pp. 16799– 16807.doi:10.1609/aaai.v39i16.33846

work page doi:10.1609/aaai.v39i16.33846 2025
[29]

Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

Malani, S. et al. “Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information”. In:Computers & Chemical Engineering178 (Oct. 2023), p. 108343.doi:10.1016/j.compchemeng.2023.108343

work page doi:10.1016/j.compchemeng.2023.108343 2023
[30]

Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control

Zhong, Y. D., Dey, B. & Chakraborty, A. “Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control”. In:International Conference on Learning Representations. 2020. arXiv: 1909.12077

work page arXiv 2020
[31]

& Battaglia, P.Hamiltonian Graph Networks with ODE Integrators

Sanchez-Gonzalez, A., Bapst, V., Cranmer, K. & Battaglia, P.Hamiltonian Graph Networks with ODE Integrators. Sept. 2019. arXiv:1909.12790

work page arXiv 2019
[32]

Dynamical Evolution of Clusters of Galaxies, I

Aarseth, S. J. & Hoyle, F. “Dynamical Evolution of Clusters of Galaxies, I”. In:Monthly Notices of the Royal Astronomical Society126.3 (June 1963), pp. 223–255.doi:10.1093/ mnras/126.3.223

work page 1963
[33]

A., Kirk, J

Dehnen, W. “Towards optimal softening in three-dimensional N-body codes — I. Minimizing the force error”. In:Monthly Notices of the Royal Astronomical Society324.2 (June 2001), pp. 273–291.doi:10.1046/j.1365-8711.2001.04237.x

work page doi:10.1046/j.1365-8711.2001.04237.x 2001
[34]

Physics-informed machine learning

Karniadakis, G. E. et al. “Physics-informed machine learning”. In:Nature Reviews Physics 3.6 (June 2021), pp. 422–440.doi:10.1038/s42254-021-00314-5

work page doi:10.1038/s42254-021-00314-5 2021
[35]

Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems

Desai, S. A., Mattheakis, M., Sondak, D., Protopapas, P. & Roberts, S. J. “Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems”. In:Phys. Rev. E104 (3 Sept. 2021), p. 034312.doi:10.1103/PhysRevE.104.034312

work page doi:10.1103/physreve.104.034312 2021
[36]

et al.Lagrangian Neural Networks

Cranmer, M. et al.Lagrangian Neural Networks. July 2020. arXiv:2003.04630 [cs.LG]

work page arXiv 2020
[37]

Discovering Symbolic Models from Deep Learning with Inductive Bi- ases

Cranmer, M. et al. “Discovering Symbolic Models from Deep Learning with Inductive Bi- ases”. In:Advances in Neural Information Processing Systems. Ed. by H. Larochelle, M. Ran- zato, R. Hadsell, M. Balcan & H. Lin. Vol. 33. Curran Associates, Inc., 2020, pp. 17429– 17442.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2020 / file / c9...

work page 2020
[38]

Artificial neural networks for solving ordinary and partial differential equations

Lagaris, I., Likas, A. & Fotiadis, D. “Artificial neural networks for solving ordinary and partial differential equations”. In:IEEE Transactions on Neural Networks9.5 (1998), pp. 987–1000. doi:10.1109/72.712178

work page doi:10.1109/72.712178 1998
[39]

Raissi, P

Raissi, M., Perdikaris, P. & Karniadakis, G. E. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial dif- ferential equations”. In:Journal of Computational Physics378 (Feb. 2019), pp. 686–707.doi: 10.1016/j.jcp.2018.10.045

work page doi:10.1016/j.jcp.2018.10.045 2019
[40]

Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption

Tsitouras, C. “Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption”. In:Computers & Mathematics with Applications62.2 (July 2011), pp. 770–775. doi:10.1016/j.camwa.2011.06.002

work page doi:10.1016/j.camwa.2011.06.002 2011
[41]

Adam: A Method for Stochastic Optimization

Kingma, D. & Ba, J. “Adam: A Method for Stochastic Optimization”. In:International Conference on Learning Representations(Dec. 2014)

work page 2014
[42]

& Stasik, A

Meltzer, S., Eidnes, S. & Stasik, A. J.Neural Hamiltonian ordinary differential equations. Version 1.0. 2026.doi:10.5281/zenodo.20283041. 24

work page doi:10.5281/zenodo.20283041 2026
[43]

et al.JAX: composable transformations of Python+NumPy programs

Bradbury, J. et al.JAX: composable transformations of Python+NumPy programs. Ver- sion 0.3.13. 2018.url:http://github.com/jax-ml/jax

work page 2018
[44]

Equinox: neural networks in JAX via callable PyTrees and filtered transformations

Kidger, P. & Garcia, C. “Equinox: neural networks in JAX via callable PyTrees and filtered transformations”. In:Differentiable Programming workshop at Neural Information Processing Systems 2021(2021). arXiv:2111.00254. 25

work page arXiv 2021

[1] [1]

Hamiltonian Neural Networks

Greydanus, S., Dzamba, M. & Yosinski, J. “Hamiltonian Neural Networks”. In:Advances in Neural Information Processing Systems. Ed. by H. Wallach et al. Vol. 32. Curran Associates, Inc., 2019.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2019 / file / 26cd8ecadce0d4efd6cc8a8725cbd1f8-Paper.pdf

work page 2019

[2] [2]

Neural Ordinary Differ- ential Equations

Chen, R. T. Q., Rubanova, Y., Bettencourt, J. & Duvenaud, D. K. “Neural Ordinary Differ- ential Equations”. In:Advances in Neural Information Processing Systems. Ed. by S. Bengio et al. Vol. 31. Curran Associates, Inc., 2018.url:https : / / proceedings . neurips . cc / paper_files/paper/2018/file/69386f6bb1dfed68692a24c8686939b9-Paper.pdf

work page 2018

[3] [3]

A Proposal on Machine Learning via Dynamical Systems

E, W. “A Proposal on Machine Learning via Dynamical Systems”. In:Communications in Mathematics and Statistics5.1 (Mar. 2017), pp. 1–11.doi:10.1007/s40304-017-0103-z

work page doi:10.1007/s40304-017-0103-z 2017

[4] [4]

Stable architectures for deep neural networks

Haber, E. & Ruthotto, L. “Stable architectures for deep neural networks”. In:Inverse Prob- lems34.1 (Dec. 2017), p. 014004.doi:10.1088/1361-6420/aa9a90

work page doi:10.1088/1361-6420/aa9a90 2017

[5] [5]

A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems

Chu, S. R. & Shoureshi, R. “A Neural Network Approach for Identification of Continuous- Time Nonlinear Dynamic Systems”. In:1991 American Control Conference. 1991, pp. 1–5. doi:10.23919/ACC.1991.4791308

work page doi:10.23919/acc.1991.4791308 1991

[6] [6]

Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data

Rico-Mart´ ınez, R., Krischer, K., Kevrekidis, I. G., Kube, M. C. & Hudson, J. L. “Discrete- Vs. Continuous-Time Nonlinear Signal Processing of Cu Electrodissolution Data”. In:Chemical Engineering Communications118.1 (1992), pp. 25–48.doi:10.1080/00986449208936084

work page doi:10.1080/00986449208936084 1992

[7] [7]

On Neural Differential Equations

Kidger, P. “On Neural Differential Equations”. PhD thesis. University of Oxford, 2021. arXiv: 2202.02435

work page arXiv 2021

[8] [8]

Which priors matter? Benchmarking models for learning latent dynamics

Botev, A. et al. “Which priors matter? Benchmarking models for learning latent dynamics”. In:Thirty-fifth Conference on Neural Information Processing Systems Datasets and Bench- marks Track. 2021. arXiv:2111.05458. 22

work page arXiv 2021

[9] [9]

SNODE: Spectral Discretization of Neural ODEs for System Identification

Quaglino, A., Gallieri, M., Masci, J. & Koutn´ ık, J. “SNODE: Spectral Discretization of Neural ODEs for System Identification”. In:International Conference on Learning Representations

work page

[10] [10]

LocalEigenmotionControlforNearRectilinearHaloOrbits,

Rahman, A., Drgoˇ na, J., Tuor, A. & Strube, J. “Neural Ordinary Differential Equations for Nonlinear System Identification”. In:2022 American Control Conference (ACC). 2022, pp. 3979–3984.doi:10.23919/ACC53348.2022.9867586

work page doi:10.23919/acc53348.2022.9867586 2022

[11] [11]

& Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities

Matsubara, T. & Yaguchi, T.FINDE: Neural Differential Equations for Finding and Preserv- ing Invariant Quantities. Mar. 2023. arXiv:2210.00272

work page arXiv 2023

[12] [12]

et al.Universal Differential Equations for Scientific Machine Learning

Rackauckas, C. et al.Universal Differential Equations for Scientific Machine Learning. Nov

work page

[13] [13]

arXiv:2001.04385 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 2001

[14] [14]

et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations

Rackauckas, C. et al.DiffEqFlux.jl - A Julia Library for Neural Differential Equations. Feb

work page

[15] [15]

arXiv:1902.02376 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 1902

[16] [16]

et al.Bayesian Neural Ordinary Differential Equations

Dandekar, R. et al.Bayesian Neural Ordinary Differential Equations. Feb. 2022. arXiv:2012. 07244

work page 2022

[17] [17]

On learning Hamiltonian systems from data

Bertalan, T., Dietrich, F., Mezi´ c, I. & Kevrekidis, I. G. “On learning Hamiltonian systems from data”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29.12 (Dec. 2019), p. 121107.doi:10.1063/1.5128231

work page doi:10.1063/1.5128231 2019

[18] [18]

Symplectic Recurrent Neural Networks

Chen, Z., Zhang, J., Arjovsky, M. & Bottou, L. “Symplectic Recurrent Neural Networks”. In: International Conference on Learning Representations. 2020. arXiv:1909.13334

work page arXiv 2020

[19] [19]

Symplectic learning for Hamiltonian neural networks

David, M. & M´ ehats, F. “Symplectic learning for Hamiltonian neural networks”. In:Journal of Computational Physics494 (Dec. 2023), p. 112495.doi:10.1016/j.jcp.2023.112495

work page doi:10.1016/j.jcp.2023.112495 2023

[20] [20]

Pseudo-Hamiltonian neural networks with state-dependent external forces

Eidnes, S., Stasik, A. J., Sterud, C., Bøhn, E. & Riemer-Sørensen, S. “Pseudo-Hamiltonian neural networks with state-dependent external forces”. In:Physica D: Nonlinear Phenomena 446 (Apr. 2023), p. 133673.doi:10.1016/j.physd.2023.133673

work page doi:10.1016/j.physd.2023.133673 2023

[21] [21]

Learning dynamical systems from noisy data with inverse-explicit integrators

Celledoni, E., Eidnes, S. & Myhr, H. N. “Learning dynamical systems from noisy data with inverse-explicit integrators”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134471. doi:10.1016/j.physd.2024.134471

work page doi:10.1016/j.physd.2024.134471 2025

[22] [22]

& Tang, Y.Deep Hamiltonian networks based on symplectic integrators

Zhu, A., Jin, P. & Tang, Y.Deep Hamiltonian networks based on symplectic integrators. Apr

work page

[23] [23]

arXiv:2004.13830 [math.NA]

work page arXiv 2004

[24] [24]

Symplectic integration of learned Hamiltonian systems

Offen, C. & Ober-Bl¨ obaum, S. “Symplectic integration of learned Hamiltonian systems”. In: Chaos: An Interdisciplinary Journal of Nonlinear Science32.1 (Jan. 2022), p. 013122.doi: 10.1063/5.0065913

work page doi:10.1063/5.0065913 2022

[25] [25]

Hamiltonian Generative Networks

Toth, P. et al. “Hamiltonian Generative Networks”. In:International Conference on Learning Representations. Feb. 2020. arXiv:1909.13789

work page arXiv 2020

[26] [26]

Learning governing equations of unobserved states in dynamical systems

Grigorian, G., George, S. V. & Arridge, S. “Learning governing equations of unobserved states in dynamical systems”. In:Physica D: Nonlinear Phenomena472 (Feb. 2025), p. 134499.doi: 10.1016/j.physd.2024.134499

work page doi:10.1016/j.physd.2024.134499 2025

[27] [27]

& Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs

Buisson-Fenet, M., Morgenthaler, V., Trimpe, S. & Di Meglio, F.Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs. Jan. 2023. arXiv:2205.12550 [eess.SY]. 23

work page arXiv 2023

[28] [28]

Learning Physics Informed Neural ODEs with Partial Measurements

Ghanem, P. et al. “Learning Physics Informed Neural ODEs with Partial Measurements”. In: Proceedings of the AAAI Conference on Artificial Intelligence39.16 (Apr. 2025), pp. 16799– 16807.doi:10.1609/aaai.v39i16.33846

work page doi:10.1609/aaai.v39i16.33846 2025

[29] [29]

Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

Malani, S. et al. “Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information”. In:Computers & Chemical Engineering178 (Oct. 2023), p. 108343.doi:10.1016/j.compchemeng.2023.108343

work page doi:10.1016/j.compchemeng.2023.108343 2023

[30] [30]

Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control

Zhong, Y. D., Dey, B. & Chakraborty, A. “Symplectic ODE-Net: Learning Hamiltonian Dy- namics with Control”. In:International Conference on Learning Representations. 2020. arXiv: 1909.12077

work page arXiv 2020

[31] [31]

& Battaglia, P.Hamiltonian Graph Networks with ODE Integrators

Sanchez-Gonzalez, A., Bapst, V., Cranmer, K. & Battaglia, P.Hamiltonian Graph Networks with ODE Integrators. Sept. 2019. arXiv:1909.12790

work page arXiv 2019

[32] [32]

Dynamical Evolution of Clusters of Galaxies, I

Aarseth, S. J. & Hoyle, F. “Dynamical Evolution of Clusters of Galaxies, I”. In:Monthly Notices of the Royal Astronomical Society126.3 (June 1963), pp. 223–255.doi:10.1093/ mnras/126.3.223

work page 1963

[33] [33]

A., Kirk, J

Dehnen, W. “Towards optimal softening in three-dimensional N-body codes — I. Minimizing the force error”. In:Monthly Notices of the Royal Astronomical Society324.2 (June 2001), pp. 273–291.doi:10.1046/j.1365-8711.2001.04237.x

work page doi:10.1046/j.1365-8711.2001.04237.x 2001

[34] [34]

Physics-informed machine learning

Karniadakis, G. E. et al. “Physics-informed machine learning”. In:Nature Reviews Physics 3.6 (June 2021), pp. 422–440.doi:10.1038/s42254-021-00314-5

work page doi:10.1038/s42254-021-00314-5 2021

[35] [35]

Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems

Desai, S. A., Mattheakis, M., Sondak, D., Protopapas, P. & Roberts, S. J. “Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems”. In:Phys. Rev. E104 (3 Sept. 2021), p. 034312.doi:10.1103/PhysRevE.104.034312

work page doi:10.1103/physreve.104.034312 2021

[36] [36]

et al.Lagrangian Neural Networks

Cranmer, M. et al.Lagrangian Neural Networks. July 2020. arXiv:2003.04630 [cs.LG]

work page arXiv 2020

[37] [37]

Discovering Symbolic Models from Deep Learning with Inductive Bi- ases

Cranmer, M. et al. “Discovering Symbolic Models from Deep Learning with Inductive Bi- ases”. In:Advances in Neural Information Processing Systems. Ed. by H. Larochelle, M. Ran- zato, R. Hadsell, M. Balcan & H. Lin. Vol. 33. Curran Associates, Inc., 2020, pp. 17429– 17442.url:https : / / proceedings . neurips . cc / paper _ files / paper / 2020 / file / c9...

work page 2020

[38] [38]

Artificial neural networks for solving ordinary and partial differential equations

Lagaris, I., Likas, A. & Fotiadis, D. “Artificial neural networks for solving ordinary and partial differential equations”. In:IEEE Transactions on Neural Networks9.5 (1998), pp. 987–1000. doi:10.1109/72.712178

work page doi:10.1109/72.712178 1998

[39] [39]

Raissi, P

Raissi, M., Perdikaris, P. & Karniadakis, G. E. “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial dif- ferential equations”. In:Journal of Computational Physics378 (Feb. 2019), pp. 686–707.doi: 10.1016/j.jcp.2018.10.045

work page doi:10.1016/j.jcp.2018.10.045 2019

[40] [40]

Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption

Tsitouras, C. “Runge–Kutta pairs of order 5(4) satisfying only the first column simplifying assumption”. In:Computers & Mathematics with Applications62.2 (July 2011), pp. 770–775. doi:10.1016/j.camwa.2011.06.002

work page doi:10.1016/j.camwa.2011.06.002 2011

[41] [41]

Adam: A Method for Stochastic Optimization

Kingma, D. & Ba, J. “Adam: A Method for Stochastic Optimization”. In:International Conference on Learning Representations(Dec. 2014)

work page 2014

[42] [42]

& Stasik, A

Meltzer, S., Eidnes, S. & Stasik, A. J.Neural Hamiltonian ordinary differential equations. Version 1.0. 2026.doi:10.5281/zenodo.20283041. 24

work page doi:10.5281/zenodo.20283041 2026

[43] [43]

et al.JAX: composable transformations of Python+NumPy programs

Bradbury, J. et al.JAX: composable transformations of Python+NumPy programs. Ver- sion 0.3.13. 2018.url:http://github.com/jax-ml/jax

work page 2018

[44] [44]

Equinox: neural networks in JAX via callable PyTrees and filtered transformations

Kidger, P. & Garcia, C. “Equinox: neural networks in JAX via callable PyTrees and filtered transformations”. In:Differentiable Programming workshop at Neural Information Processing Systems 2021(2021). arXiv:2111.00254. 25

work page arXiv 2021