Co-Learning Port-Hamiltonian Systems and Optimal Energy-Shaping Control
Pith reviewed 2026-05-08 03:03 UTC · model grok-4.3
The pith
A framework co-learns port-Hamiltonian models and energy-shaping controllers from trajectory data to produce inherently passive and stable closed-loop behavior.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By parameterizing a port-Hamiltonian model and an energy-balancing passivity-based controller with neural networks and alternating between model refinement on policy-generated trajectories and controller re-optimization, the method produces a controller that renders the closed-loop system passive and provably stable while preserving the plant's natural energy structure.
What carries the argument
Alternating optimization of neural-network-parameterized port-Hamiltonian dynamics and energy-balancing passivity-based controllers, combined with dissipation regularization to enforce energy decay.
Load-bearing premise
The true dynamics admit an accurate port-Hamiltonian representation that the neural networks can capture, and the alternating optimization converges to a solution that preserves the stability and passivity guarantees.
What would settle it
A closed-loop experiment in which total energy increases over time or the system becomes unstable under the learned controller would show that the passivity and stability claims do not hold.
Figures
read the original abstract
We develop a physics-informed learning framework for energy-shaping control of port-Hamiltonian (pH) systems from trajectory data. The proposed approach co-learns a pH system model and an optimal energy-balancing passivity-based controller (EB-PBC) through alternating optimization with policy-aware data collection. At each iteration, the system model is refined using trajectory data collected under the current control policy, and the controller is re-optimized on the updated model. Both components are parameterized by neural networks that embed the pH dynamics and EB-PBC structure, ensuring interpretability in terms of energy interactions. The learned controller renders the closed-loop system inherently passive and provably stable, and exploits passive plant dynamics without canceling the natural potential. A dissipation regularization enforces strict energy decay during training, thereby enhancing robustness to sim-to-real gaps. The proposed framework is validated on state-regulation and swing-up tasks for planar and torsional pendulum systems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a physics-informed co-learning framework that alternates between fitting a neural-network-parameterized port-Hamiltonian (pH) model to trajectory data and re-optimizing an energy-balancing passivity-based controller (EB-PBC) on the updated model. Both the pH dynamics (skew-symmetric J, positive-semidefinite R, convex Hamiltonian H) and the EB-PBC law are embedded in the network architectures. A dissipation regularization term is added during training. The central claim is that the resulting controller renders the closed-loop system inherently passive and provably stable while exploiting rather than canceling the plant's natural potential; the method is demonstrated on state-regulation and swing-up tasks for planar and torsional pendulums.
Significance. If the stability and passivity guarantees can be shown to transfer from the learned model to the true plant under bounded approximation error, the approach would provide a principled route to structure-preserving, interpretable controllers for underactuated mechanical systems without requiring exact first-principles models. The combination of alternating optimization, policy-aware data collection, and dissipation regularization directly targets the sim-to-real gap while preserving energy-based passivity arguments.
major comments (2)
- [Abstract] Abstract: The claim that the learned controller is 'provably stable' and renders the closed-loop system 'inherently passive' is load-bearing for the contribution, yet the provided text establishes these properties only with respect to the learned pH model. No Lipschitz bounds on the neural-network approximation of J, R, and H, nor any robustness margin for the EB-PBC design under model mismatch, are referenced; without such arguments the transfer of stability to the true plant remains unproven.
- [Abstract] Abstract and alternating-optimization description: The framework relies on the alternating loop converging to a fixed point that preserves the pH structure (skew-symmetry of J, R ≽ 0, convexity of H) so that the EB-PBC passivity proof continues to hold. No convergence analysis, contraction mapping, or even empirical monitoring of these structural invariants across iterations is supplied; this omission directly affects whether the optimality and stability claims survive the co-learning procedure.
minor comments (1)
- [Abstract] The abstract states that the controller 'exploits passive plant dynamics without canceling the natural potential,' but the precise mechanism (e.g., how the learned Hamiltonian is used inside the EB-PBC law) is not expanded; a short clarifying sentence or equation reference would improve readability.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments. The points raised concerning the scope of the stability and passivity claims, as well as the convergence properties of the alternating optimization, are important for clarifying the manuscript's contributions. We address each major comment below and specify the revisions that will be made.
read point-by-point responses
-
Referee: [Abstract] Abstract: The claim that the learned controller is 'provably stable' and renders the closed-loop system 'inherently passive' is load-bearing for the contribution, yet the provided text establishes these properties only with respect to the learned pH model. No Lipschitz bounds on the neural-network approximation of J, R, and H, nor any robustness margin for the EB-PBC design under model mismatch, are referenced; without such arguments the transfer of stability to the true plant remains unproven.
Authors: We agree that the passivity and stability properties are formally established only for the learned port-Hamiltonian model, since the EB-PBC design and closed-loop analysis are performed with respect to the parameterized dynamics. The abstract will be revised to explicitly qualify these guarantees as holding for the learned model. The dissipation regularization term is introduced precisely to promote robustness against model mismatch, and the empirical validation on the pendulum systems provides supporting evidence for practical transfer. We will add a dedicated discussion paragraph addressing approximation errors and the absence of explicit robustness margins, while avoiding any unsubstantiated transfer claims. revision: partial
-
Referee: [Abstract] Abstract and alternating-optimization description: The framework relies on the alternating loop converging to a fixed point that preserves the pH structure (skew-symmetry of J, R ≽ 0, convexity of H) so that the EB-PBC passivity proof continues to hold. No convergence analysis, contraction mapping, or even empirical monitoring of these structural invariants across iterations is supplied; this omission directly affects whether the optimality and stability claims survive the co-learning procedure.
Authors: The neural-network architectures are explicitly constructed to enforce the required pH structure (skew-symmetric interconnection matrix, positive-semidefinite dissipation matrix, and convex Hamiltonian) at every iteration by design. While a theoretical convergence analysis of the alternating procedure is not provided, we will augment the manuscript with empirical monitoring of the structural invariants—specifically, the skew-symmetry residual norm, the minimum eigenvalue of the dissipation matrix, and convexity checks on the Hamiltonian—across co-learning iterations. These results will be reported to demonstrate that the invariants are preserved in practice, thereby supporting the applicability of the EB-PBC stability arguments to the learned models. revision: partial
Circularity Check
No significant circularity; derivation grounded in external pH theory
full rationale
The paper parameterizes both the pH model and EB-PBC controller with neural networks that embed the standard port-Hamiltonian structure (skew-symmetric J, positive-semidefinite R, convex H) and energy-balancing passivity-based control form. Stability and passivity claims follow directly from classical pH passivity theory applied to the learned model, which is fitted to external trajectory data via alternating optimization. No central quantity is defined in terms of itself, no fitted parameter is renamed as a prediction, and no load-bearing step reduces to a self-citation chain or ansatz smuggled from prior author work. The dissipation regularization is an added training term, not a definitional loop. The framework remains self-contained against the independent benchmarks of pH theory and trajectory data.
Axiom & Free-Parameter Ledger
free parameters (1)
- dissipation regularization weight
axioms (2)
- domain assumption System dynamics admit a port-Hamiltonian representation
- domain assumption EB-PBC applied to a pH plant yields passivity and stability
Reference graph
Works this paper leans on
-
[1]
Port-hamiltonian systems theory: An introductory overview,
A. Van Der Schaft and D. Jeltsema, “Port-hamiltonian systems theory: An introductory overview,”Foundations and Trends®in Systems and Control, vol. 1, no. 2-3, pp. 173–378, 2014
work page 2014
-
[2]
Putting energy back in control,
R. Ortega, A. J. Van Der Schaft, I. Mareels, and B. Maschke, “Putting energy back in control,”IEEE Control Systems Magazine, vol. 21, no. 2, pp. 18–33, 2002
work page 2002
-
[3]
Van der Schaft,L 2-Gain and Passivity Techniques in Nonlinear Control
A. Van der Schaft,L 2-Gain and Passivity Techniques in Nonlinear Control. Springer, 2000
work page 2000
-
[4]
Control by interconnection and standard passivity-based control of port-Hamiltonian systems,
R. Ortega, A. van der Schaft, F. Castanos, and A. Astolfi, “Control by interconnection and standard passivity-based control of port-Hamiltonian systems,”IEEE Transactions on Automatic Control, vol. 53, no. 11, pp. 2527–2542, 2008
work page 2008
-
[5]
Reinforcement learning for port-Hamiltonian systems,
O. Sprangers, R. Babuˇ ska, S. P. Nageshrao, and G. A. D. Lopes, “Reinforcement learning for port-Hamiltonian systems,” IEEE Transactions on Cybernetics, vol. 45, no. 5, pp. 1017–1027, 2015. 16
work page 2015
-
[6]
A. Gheibi, A. R. Ghiasi, S. Ghaemi, and M. A. Badamchizadeh, “Designing of robust adaptive passivity-based controller based on reinforcement learning for nonlinear port-Hamiltonian model with disturbance,”International Journal of Control, vol. 93, no. 8, pp. 1754–1764, 2020
work page 2020
-
[7]
Optimal energy shaping via neural approxima- tors,
S. Massaroli, M. Poli, F. Califano, J. Park, A. Yamashita, and H. Asama, “Optimal energy shaping via neural approxima- tors,”SIAM Journal on Applied Dynamical Systems, vol. 21, no. 3, pp. 2126–2147, 2022
work page 2022
-
[8]
Total energy shaping with neural interconnection and damping assignment-passivity based control,
S. S.-E. Plaza, R. Reyes-B´ aez, and B. Jayawardhana, “Total energy shaping with neural interconnection and damping assignment-passivity based control,” inLearning for Dynamics and Control Conference, pp. 520–531, PMLR, 2022
work page 2022
-
[9]
Y. Okura, K. Fujimoto, I. Maruta, A. Saito, and H. Ikeda, “Bayesian inference for path following control of port- Hamiltonian systems with training trajectory data,”SICE Journal of Control, Measurement, and System Integration, vol. 13, no. 2, pp. 40–46, 2020
work page 2020
-
[10]
Physics-informed multi-agent reinforcement learning for distributed multi-robot problems,
E. Sebasti´ an, T. Duong, N. Atanasov, E. Montijano, and C. Sag¨ u´ es, “Physics-informed multi-agent reinforcement learning for distributed multi-robot problems,”IEEE Transactions on Robotics, 2025
work page 2025
-
[11]
Dissipation obstacle hampers control-by-interconnection methodology,
M. Zhang, R. Ortega, D. Jeltsema, and H. Su, “Dissipation obstacle hampers control-by-interconnection methodology,” IFAC-PapersOnLine, vol. 48, no. 13, pp. 123–128, 2015. 5th IFAC Workshop on Lagrangian and Hamiltonian Methods for Nonlinear Control LHMNC 2015
work page 2015
-
[12]
S. Greydanus, M. Dzamba, and J. Yosinski, “Hamiltonian neural networks,”Advances in Neural Information Processing Systems, vol. 32, 2019
work page 2019
-
[13]
Symplectic ODE-Net: Learning Hamiltonian dynamics with control,
Y. D. Zhong, B. Dey, and A. Chakraborty, “Symplectic ODE-Net: Learning Hamiltonian dynamics with control,” in International Conference on Learning Representations, 2020
work page 2020
-
[14]
Port-hamiltonian neural ODE networks on Lie groups for robot dynamics learning and control,
T. Duong, A. Altawaitan, J. Stanley, and N. Atanasov, “Port-hamiltonian neural ODE networks on Lie groups for robot dynamics learning and control,”IEEE Transactions on Robotics, vol. 40, pp. 3695–3715, 2024
work page 2024
-
[15]
Stochastic port-Hamiltonian neural networks: Universal approximation with passivity guarantees,
L. Di Persio, M. Ehrhardt, and A. Khedher, “Stochastic port-Hamiltonian neural networks: Universal approximation with passivity guarantees,”arXiv preprint arXiv:2603.10078, 2025
-
[16]
Learning subsystem dynamics in nonlinear systems via port-hamiltonian neural networks,
G. Van Otterdijk, S. Moradi, S. Weiland, R. T´ oth, N. Jaensson, and M. Schoukens, “Learning subsystem dynamics in nonlinear systems via port-hamiltonian neural networks,” in2025 IEEE 64th Conference on Decision and Control (CDC), pp. 2071–2076, IEEE, 2025
work page 2071
-
[17]
Learning switching port-hamiltonian systems with uncertainty quantification,
T. Beckers, T. Z. Jiahao, and G. J. Pappas, “Learning switching port-hamiltonian systems with uncertainty quantification,” IFAC-PapersOnLine, vol. 56, no. 2, pp. 525–532, 2023
work page 2023
-
[18]
Gaussian process port-hamiltonian systems: Bayesian learning with physics prior,
T. Beckers, J. Seidman, P. Perdikaris, and G. J. Pappas, “Gaussian process port-hamiltonian systems: Bayesian learning with physics prior,” in2022 IEEE 61st Conference on Decision and Control (CDC), pp. 1447–1453, IEEE, 2022
work page 2022
-
[19]
Stable port-hamiltonian neural networks,
F. J. Roth, D. K. Klein, M. Kannapinn, J. Peters, and O. Weeger, “Stable port-hamiltonian neural networks,”arXiv preprint arXiv:2502.02480, 2025
-
[20]
Learning neural koopman operators with dissipativity guarantees,
Y. Xu, S. Sivaranjani, and V. Gupta, “Learning neural koopman operators with dissipativity guarantees,” in2025 IEEE 64th Conference on Decision and Control (CDC), pp. 2064–2070, IEEE, 2025
work page 2064
-
[21]
Control-oriented system identification: Classical, learning, and physics-informed approaches,
S. Sivaranjani, Y. Shi, N. Atanasov, T. Duong, J. Feng, T. Martin, Y. Xu, V. Gupta, and F. Allg¨ ower, “Control-oriented system identification: Classical, learning, and physics-informed approaches,”arXiv preprint arXiv:2512.06315, 2025
-
[22]
Lyapunov-stable neural-network control
H. Dai, B. Landry, L. Yang, M. Pavone, and R. Tedrake, “Lyapunov-stable neural-network control,”arXiv preprint arXiv:2109.14152, 2021
-
[23]
Y.-C. Chang, N. Roohi, and S. Gao, “Neural lyapunov control,”Advances in neural information processing systems, vol. 32, 2019
work page 2019
-
[24]
A survey on physics informed reinforcement learning: Review and open problems,
C. Banerjee, K. Nguyen, C. Fookes, and M. Raissi, “A survey on physics informed reinforcement learning: Review and open problems,”Expert Systems with Applications, vol. 287, p. 128166, 2025
work page 2025
-
[25]
Neural ordinary differential equations,
R. T. Q. Chen, Y. Rubanova, J. Bettencourt, and D. K. Duvenaud, “Neural ordinary differential equations,” inAdvances in Neural Information Processing Systems, vol. 31, 2018
work page 2018
-
[26]
Multi-objective loss balancing for physics-informed deep learning,
R. Bischof and M. A. Kraus, “Multi-objective loss balancing for physics-informed deep learning,”Computer Methods in Applied Mechanics and Engineering, vol. 439, p. 117914, 2025
work page 2025
-
[27]
Understanding and mitigating gradient flow pathologies in physics-informed neural networks,
S. Wang, Y. Teng, and P. Perdikaris, “Understanding and mitigating gradient flow pathologies in physics-informed neural networks,”SIAM Journal on Scientific Computing, vol. 43, no. 5, pp. A3055–A3081, 2021. 17
work page 2021
-
[28]
Automatic differentiation in machine learning: a survey,
A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, “Automatic differentiation in machine learning: a survey,”Journal of Machine Learning Research, vol. 18, no. 153, pp. 1–43, 2018
work page 2018
-
[29]
SGDR: Stochastic gradient descent with warm restarts,
I. Loshchilov and F. Hutter, “SGDR: Stochastic gradient descent with warm restarts,” inInternational Conference on Learning Representations, 2017
work page 2017
-
[30]
J. J. Mason, C. Allen-Blanchette, N. Zolman, E. Davison, and N. E. Leonard, “Learning to predict 3D rotational dynamics from images of a rigid body with unknown mass distribution,”Aerospace, vol. 10, no. 11, p. 921, 2023
work page 2023
-
[31]
Unsupervised learning of Lagrangian dynamics from images for prediction and control,
Y. D. Zhong and N. Leonard, “Unsupervised learning of Lagrangian dynamics from images for prediction and control,” Advances in Neural Information Processing Systems, vol. 33, pp. 10741–10752, 2020. 18
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.