pith. sign in

arxiv: 2511.21597 · v2 · pith:3GDKRFR4new · submitted 2025-11-26 · 🧮 math.NA · cs.NA

Low-Rank Solvers for Energy-Conserving Hamiltonian Boundary Value Methods

Pith reviewed 2026-05-21 19:03 UTC · model grok-4.3

classification 🧮 math.NA cs.NA
keywords Hamiltonian Boundary Value Methodslow-rank matrix equationsKrylov solversNewton-Krylov methodsenergy conservationHamiltonian systemswave equationsadaptive time-stepping
0
0 comments X

The pith

HBVM stage equations reformulate as low-rank matrix equations that admit efficient Krylov solvers.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper establishes that the stage equations arising in Hamiltonian Boundary Value Methods can be rewritten as matrix equations whose right-hand side has low rank. The authors use this algebraic structure to construct Krylov projection solvers for the linear case and to accelerate simplified Newton iterations as well as preconditioned Newton-Krylov schemes for nonlinear problems. Adaptive time-stepping is incorporated to maintain robustness. Tests on semi-discretized wave equations show that the resulting methods remain efficient while preserving the long-term energy conservation that makes HBVMs attractive for Hamiltonian simulations.

Core claim

The stage equations of energy-conserving Hamiltonian Boundary Value Methods reformulate as matrix equations with a low-rank right-hand side. This structure is exploited via Krylov projection solvers for linear systems and within simplified Newton iterations and as a preconditioner in Newton-Krylov frameworks for nonlinear systems, combined with adaptive time-stepping, to yield efficient and robust solutions as demonstrated on semi-discretized wave equations.

What carries the argument

The low-rank right-hand side in the reformulated HBVM stage matrix equations, which is exploited to build projection-based solvers that avoid full matrix operations.

If this is right

  • Linear stage systems are solved with Krylov methods whose cost depends on the low rank rather than the full problem size.
  • Nonlinear systems converge faster through simplified Newton steps that leverage the same low-rank property.
  • Preconditioning in Newton-Krylov iterations becomes cheaper and more effective.
  • Adaptive time-stepping ensures reliable performance without sacrificing the energy-conserving property of the underlying integrator.
  • Large-scale simulations of Hamiltonian systems, such as wave equations, become computationally feasible over long times.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach may generalize to other boundary value methods or multi-stage integrators that possess analogous low-rank structures.
  • Combining this solver with spatial adaptivity could further reduce costs in problems where both time and space features vary.
  • Similar low-rank reformulations might appear in related geometric integrators, suggesting a broader class of problems where this technique applies.
  • Verification on additional Hamiltonian systems like rigid-body or molecular dynamics models would test the generality of the savings.

Load-bearing premise

The right-hand side of the stage matrix equations exhibits a sufficiently low rank for the systems that arise from typical semi-discretizations of Hamiltonian problems.

What would settle it

Numerical experiments in which the effective rank of the right-hand side scales linearly with the spatial mesh size, causing Krylov iteration counts to match those of standard full-rank solvers.

Figures

Figures reproduced from arXiv: 2511.21597 by Fabio Durastante, Mariarosa Mazza.

Figure 1
Figure 1. Figure 1: Linear case. We report here the number of iteration of the Krylov projection method [PITH_FULL_IMAGE:figures/full_fig_p014_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Linear case. We report here the time per step of the overall HBVM( [PITH_FULL_IMAGE:figures/full_fig_p015_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: Nonlinear test case 1 (𝑠 = 2, 𝑘 = 3). The top figure reports an histogram of the number of Newton iterations per time-step on the left 𝑦-axis, while on the right there is the residual measured as the norm of ∥F (Φ) ∥𝐹. On the left plot of the second row we have an heatmap with the number of FGMRES iterations preconditioned with the matrix equation solver preconditioner per time step VS Newton iteration; on… view at source ↗
Figure 5
Figure 5. Figure 5: Nonlinear test case 2 (𝑠 = 3, 𝑘 = 6). The top figure reports an histogram of the number of Newton iterations per time-step (averaged every 5 time-steps) on the left 𝑦-axis, while on the right there is the residual measured as the norm of ∥F (Φ) ∥𝐹 for each time-step. On the left plot of the second row we have an heatmap with the number of FGMRES iterations preconditioned with the matrix equation solver pre… view at source ↗
read the original abstract

We study energy-conserving Hamiltonian Boundary Value Methods (HBVMs) for Hamiltonian systems, which arise in applications where long-term preservation of energy and symplecticity is essential. HBVMs are multi-stage schemes whose stage equations reformulate as matrix equations with a low-rank right-hand side. For linear systems, we exploit this structure directly via Krylov projection solvers. For nonlinear systems, we leverage it within simplified Newton iterations and as a preconditioner in a Newton--Krylov framework, combined with adaptive time-stepping for robust convergence. Numerical experiments on semi-discretized wave equations demonstrate the efficiency and robustness of the proposed approach.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript develops low-rank solvers for energy-conserving Hamiltonian Boundary Value Methods (HBVMs). Stage equations are reformulated as matrix equations whose right-hand side has low rank; this structure is exploited via Krylov projection methods for linear systems, simplified Newton iterations and Newton-Krylov preconditioning for nonlinear systems, and adaptive time-stepping. Numerical experiments on semi-discretized wave equations are presented to illustrate efficiency and robustness.

Significance. If the low-rank property of the right-hand side persists under typical semi-discretizations and remains small relative to the spatial dimension, the approach could yield substantial savings in the solution of HBVM stage equations while preserving the energy-conserving and symplectic properties of the integrator. This would be particularly valuable for long-time integration of Hamiltonian PDEs such as wave equations.

major comments (2)
  1. [§3 (reformulation of stage equations)] The central performance claim rests on the assumption that the effective numerical rank of the right-hand side remains small compared with the dimension of the semi-discretized spatial operator. No a priori bound or scaling analysis is given for how this rank behaves with mesh size, number of stages, or the HBVM collocation matrix.
  2. [§5 (numerical experiments)] Numerical experiments assert efficiency and robustness on wave equations, yet the reported results contain no tables or plots of effective rank, iteration counts versus spatial degrees of freedom, or timing comparisons that would confirm the low-rank structure delivers the claimed savings.
minor comments (2)
  1. [Abstract] The abstract would be strengthened by a single quantitative statement (e.g., observed rank or iteration reduction factor) drawn from the experiments.
  2. [§2–§4] Notation for the low-rank factorization of the right-hand side should be introduced once and used consistently across the linear and nonlinear sections.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thorough review and valuable suggestions. We address each major comment below and will revise the manuscript to strengthen the presentation of the low-rank structure and its computational benefits.

read point-by-point responses
  1. Referee: [§3 (reformulation of stage equations)] The central performance claim rests on the assumption that the effective numerical rank of the right-hand side remains small compared with the dimension of the semi-discretized spatial operator. No a priori bound or scaling analysis is given for how this rank behaves with mesh size, number of stages, or the HBVM collocation matrix.

    Authors: We agree that an explicit discussion of the rank bound would clarify the theoretical foundation. In the reformulation of the stage equations, the right-hand side takes the form of a product involving the HBVM collocation matrix, which has rank at most equal to the number of stages s. This yields an a priori bound on the rank that is independent of the spatial mesh size. We will add a dedicated remark in §3 stating this bound, discussing its scaling with s and the properties of the collocation matrix, and noting that the effective numerical rank observed in practice remains close to this theoretical value for the wave-equation discretizations considered. revision: yes

  2. Referee: [§5 (numerical experiments)] Numerical experiments assert efficiency and robustness on wave equations, yet the reported results contain no tables or plots of effective rank, iteration counts versus spatial degrees of freedom, or timing comparisons that would confirm the low-rank structure delivers the claimed savings.

    Authors: We acknowledge that the current numerical section would benefit from more direct evidence of the savings. In the revised manuscript we will expand §5 with additional tables and figures that report the effective numerical rank as a function of spatial degrees of freedom, solver iteration counts versus mesh refinement, and wall-clock timing comparisons between the proposed low-rank solvers and standard dense implementations. These additions will quantify the computational advantage while preserving the energy-conserving properties of the HBVM. revision: yes

Circularity Check

0 steps flagged

No circularity: standard low-rank Krylov exploitation on HBVM reformulation

full rationale

The paper reformulates HBVM stage equations as matrix equations possessing a low-rank right-hand side and then applies established Krylov projection methods for the linear case together with simplified Newton and Newton-Krylov preconditioning for the nonlinear case. These steps rest on well-known linear-algebra facts about low-rank matrices and iterative solvers rather than on any self-definition, parameter fitting that is later relabeled as prediction, or load-bearing self-citation chains. The reported numerical results on semi-discretized wave equations function as external validation of computational savings; they do not enter the derivation of the solver itself. Consequently the claimed derivation chain remains independent of the experimental outcomes and receives the lowest circularity score.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the standard domain assumption that HBVMs exactly conserve energy for Hamiltonian systems and on the algebraic fact that the stage right-hand side admits a low-rank factorization; no new entities or fitted parameters are introduced in the abstract.

axioms (2)
  • domain assumption Hamiltonian systems conserve a quadratic or general energy functional that HBVMs are constructed to preserve exactly.
    Invoked implicitly when the paper states that HBVMs are energy-conserving schemes for Hamiltonian systems.
  • domain assumption The stage equations of an HBVM can be rewritten as a matrix equation whose right-hand side has low numerical rank.
    This algebraic structure is the starting point for all proposed solvers.

pith-pipeline@v0.9.0 · 5633 in / 1440 out tokens · 87632 ms · 2026-05-21T19:03:46.787120+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages

  1. [1]

    Beckermann, B.: An error analysis for rational Galerkin projection applied to the Sylvester equation. SIAM J. Numer. Anal.49(6), 2430–2450 (2011). DOI 10.1137/110824590. URL https://doi.org/10.1137/110824590

  2. [2]

    In: Numerical methods for ordinary differential equations (L’Aquila, 1987),Lecture Notes in Math., vol

    Bellen, A.: Parallelism across the steps for difference and differential equations. In: Numerical methods for ordinary differential equations (L’Aquila, 1987),Lecture Notes in Math., vol. 1386, pp. 22–35. Springer, Berlin, Berlin (1989). DOI 10.1007/BFb0089229. URL https://doi.org/10. 1007/BFb0089229

  3. [3]

    Bellen, A., Vermiglio, R., Zennaro, M.: Parallel ODE-solvers with stepsize control. J. Comput. Appl. Math.31(2), 277–293 (1990). DOI 10.1016/0377-0427(90)90170-5. URL https://doi. org/10.1016/0377-0427(90)90170-5

  4. [4]

    Bergermann, K., Stoll, M.: Adaptive rational Krylov methods for exponential Runge-Kutta integrators. SIAM J. Matrix Anal. Appl.45(1), 744–770 (2024). DOI 10.1137/23M1559439. URL https://doi.org/10.1137/23M1559439

  5. [5]

    Berljafa, M., G¨ uttel, S.: The RKFIT algorithm for nonlinear rational approximation. SIAM J. Sci. Comput.39(5), A2049–A2071 (2017). DOI 10.1137/15M1025426. URL https://doi.org/10. 1137/15M1025426

  6. [6]

    Brugnano, L., Gurioli, G., Iavernaro, F.: Numerical solution of FDE-IVPs by using frac- tional HBVMs: the fhbvm code. Numer. Algorithms99(1), 463–489 (2025). DOI 10.1007/s11075-024-01884-y. URLhttps://doi.org/10.1007/s11075-024-01884-y

  7. [7]

    Monographs and Research Notes in Mathematics

    Brugnano, L., Iavernaro, F.: Line integral methods for conservative problems. Monographs and Research Notes in Mathematics. CRC Press, Boca Raton, FL (2016). DOI https://doi.org/10.1201/ b19319

  8. [8]

    Brugnano, L., Iavernaro, F., Trigiante, D.: Hamiltonian boundary value methods (energy preserving discrete line integral methods). JNAIAM. J. Numer. Anal. Ind. Appl. Math.5(1-2), 17–37 (2010)

  9. [9]

    Brugnano, L., Iavernaro, F., Trigiante, D.: A note on the efficient implementation of Hamiltonian BVMs. J. Comput. Appl. Math.236(3), 375–383 (2011). DOI 10.1016/j.cam.2011.07.022. URL https://doi.org/10.1016/j.cam.2011.07.022

  10. [10]

    SIAM Journal on Scientific Computing46(2), A798–A824 (2024)

    Casulli, A., Robol, L.: An Efficient Block Rational Krylov Solver for Sylvester Equations with Adaptive Pole Selection. SIAM Journal on Scientific Computing46(2), A798–A824 (2024). DOI 10.1137/23M1548463. URLhttps://doi.org/10.1137/23M1548463

  11. [11]

    Software Impacts15(2023)

    D’ Ambra, P., Durastante, F., Filippone, S.: Parallel sparse computation toolkit[formula presented]. Software Impacts15(2023). DOI 10.1016/j.simpa.2022.100463. URL http://doi.org/10. 1016/j.simpa.2022.100463 Low-Rank Solvers for HBVMs 21

  12. [12]

    ACM Trans

    Davis, T.A.: Algorithm 832: UMFPACK V4.3—an unsymmetric-pattern multifrontal method. ACM Trans. Math. Software30(2), 196–199 (2004). DOI 10.1145/992200.992206. URL https://doi.org/10.1145/992200.992206

  13. [13]

    Proceedings of the Royal Irish Academy

    Dirac, P.A.M.: Hamiltonian methods and quantum mechanics. Proceedings of the Royal Irish Academy. Section A: Mathematical and Physical Sciences63, 49–59 (1963). URL http://www. jstor.org/stable/20488626

  14. [14]

    Dravins, I., Serra-Capizzano, S., Neytcheva, M.: Spectral analysis of preconditioned matrices arising from stage-parallel implicit Runge-Kutta methods of arbitrarily high order. SIAM J. Matrix Anal. Appl.45(2), 1007–1034 (2024). DOI 10.1137/23M1552498. URL https://doi.org/10. 1137/23M1552498

  15. [15]

    URLhttps://arxiv.org/abs/2505.17719

    Durastante, F., Mazza, M.: Stage-parallel implicit runge–kutta methods via low-rank matrix equation corrections (2025). URLhttps://arxiv.org/abs/2505.17719

  16. [16]

    Gander, M.J., Wu, S.L.: A diagonalization-based parareal algorithm for dissipative and wave propagation problems. SIAM J. Numer. Anal.58(5), 2981–3009 (2020). DOI 10.1137/19M1271683. URLhttps://doi.org/10.1137/19M1271683

  17. [17]

    Oxford Univer- sity Press (2018).https://doi.org/10.1093/oso/9780198814788.001.0001

    Gautschi, W.: Orthogonal polynomials: computation and approximation. Numerical Mathematics and Scientific Computation. Oxford University Press, New York (2004). DOI https://doi.org/10. 1093/oso/9780198506720.001.0001

  18. [18]

    II,Springer Series in Computational Mathematics, vol

    Hairer, E., Wanner, G.: Solving ordinary differential equations. II,Springer Series in Computational Mathematics, vol. 14, second edn. Springer-Verlag, Berlin (1996). DOI 10.1007/978-3-642-05221-7

  19. [19]

    Monthly Notices of the Royal Astronomical Society468(3), 2614– 2636 (2017)

    Hernandez, D.M., Dehnen, W.: A study of symplectic integrators for planetary system problems: error analysis and comparisons. Monthly Notices of the Royal Astronomical Society468(3), 2614– 2636 (2017). DOI 10.1093/mnras/stx547. URLhttps://doi.org/10.1093/mnras/stx547

  20. [20]

    van der Houwen, P.J., Sommeijer, B.P.: Iterated Runge-Kutta methods on parallel computers. SIAM J. Sci. Statist. Comput.12(5), 1000–1028 (1991). DOI 10.1137/0912054

  21. [21]

    van der Houwen, P.J., Sommeijer, B.P.: Preconditioning in parallel Runge-Kutta methods for stiff initial value problems. pp. 17–31 (1994). DOI 10.1016/0898-1221(94)00183-9

  22. [22]

    van der Houwen, P.J., Sommeijer, B.P., Couzy, W.: Embedded diagonally implicit Runge-Kutta algorithms on parallel computers. Math. Comp.58(197), 135–159 (1992). DOI 10.2307/2153025. URLhttps://doi.org/10.2307/2153025

  23. [23]

    Iserles, A., Nørsett, S.P.: On the theory of parallel Runge-Kutta methods. IMA J. Numer. Anal. 10(4), 463–488 (1990). DOI 10.1093/imanum/10.4.463. URL https://doi.org/10.1093/ imanum/10.4.463

  24. [24]

    Jay, L.O.: Inexact simplified Newton iterations for implicit Runge-Kutta methods. SIAM J. Numer. Anal.38(4), 1369–1388 (2000). DOI 10.1137/S0036142999360573. URL https: //doi.org/10.1137/S0036142999360573

  25. [25]

    Kelley, C.T.: Iterative methods for linear and nonlinear equations,Frontiers in Applied Mathematics, vol. 16. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA (1995). DOI 10.1137/1.9781611970944. URLhttps://doi.org/10.1137/1.9781611970944

  26. [26]

    Cambridge Monographs on Applied and Computational Mathematics

    Leimkuhler, B., Reich, S.: Simulating Hamiltonian Dynamics. Cambridge Monographs on Applied and Computational Mathematics. Cambridge University Press (2005)

  27. [27]

    Leimkuhler, B.J., Reich, S., Skeel, R.D.: Integration Methods for Molecular Dynamics, pp. 161–

  28. [28]

    DOI 10.1007/978-1-4612-4066-2 10

    Springer New York, New York, NY (1996). DOI 10.1007/978-1-4612-4066-2 10. URL https://doi.org/10.1007/978-1-4612-4066-2_10

  29. [29]

    Leveque, S., Bergamaschi, L., Mart ´ınez, A., Pearson, J.W.: Parallel-in-time solver for the all- at-once Runge-Kutta discretization. SIAM J. Matrix Anal. Appl.45(4), 1902–1928 (2024). DOI 10.1137/23M1567862. URLhttps://doi.org/10.1137/23M1567862

  30. [30]

    Physics of Plasmas12(5), 058102 (2005)

    Morrison, P.J.: Hamiltonian and action principle formulations of plasma physicsa). Physics of Plasmas12(5), 058102 (2005). DOI 10.1063/1.1882353. URL https://doi.org/10.1063/1. 1882353

  31. [31]

    Munch, P., Dravins, I., Kronbichler, M., Neytcheva, M.: Stage-parallel fully implicit Runge- Kutta implementations with optimal multilevel preconditioners at the scaling limit. SIAM J. Sci. Comput.46(2), S71–S96 (2024). DOI 10.1137/22M1503270. URL https://doi.org/10. 1137/22M1503270

  32. [32]

    Saad, Y.: A flexible inner-outer preconditioned GMRES algorithm. SIAM J. Sci. Comput.14(2), 461–469 (1993). DOI 10.1137/0914028. URLhttps://doi.org/10.1137/0914028 22 F. Durastante, M. Mazza

  33. [33]

    Saad, Iterative Methods for Sparse Linear Systems, Society for Industrial and Applied Mathematics, 2003

    Saad, Y.: Iterative methods for sparse linear systems, second edn. Society for Industrial and Applied Mathematics, Philadelphia, PA (2003). DOI 10.1137/1.9780898718003

  34. [34]

    Saad, Y., Schultz, M.H.: GMRES: a generalized minimal residual algorithm for solving nonsym- metric linear systems. SIAM J. Sci. Statist. Comput.7(3), 856–869 (1986). DOI 10.1137/0907058. URLhttps://doi.org/10.1137/0907058

  35. [35]

    Simoncini, V.: A new iterative method for solving large-scale Lyapunov matrix equations. SIAM J. Sci. Comput.29(3), 1268–1288 (2007). DOI 10.1137/06066120X. URL https://doi.org/10. 1137/06066120X

  36. [36]

    Simoncini , Computational methods for linear matrix equations , SIAM Rev., 58 (2016), pp

    Simoncini, V.: Computational methods for linear matrix equations. SIAM Rev.58(3), 377–441 (2016). DOI 10.1137/130912839. URLhttps://doi.org/10.1137/130912839

  37. [37]

    Sleijpen, G.L.G., van der Vorst, H.A., Fokkema, D.R.: BiCGstab(𝑙) and other hybrid Bi-CG methods. Numer. Algorithms7(1), 75–109 (1994). DOI 10.1007/BF02141261. URL https: //doi.org/10.1007/BF02141261

  38. [38]

    Southworth, B.S., Krzysik, O., Pazner, W.: Fast solution of fully implicit Runge-Kutta and discontinuous Galerkin in time for numerical PDEs, Part II: Nonlinearities and DAEs. SIAM J. Sci. Comput.44(2), A636–A663 (2022). DOI 10.1137/21M1390438. URL https://doi.org/10. 1137/21M1390438

  39. [39]

    Southworth, B.S., Krzysik, O., Pazner, W., De Sterck, H.: Fast solution of fully implicit Runge-Kutta and discontinuous Galerkin in time for numerical PDEs, Part I: The linear setting. SIAM J. Sci. Comput.44(1), A416–A443 (2022). DOI 10.1137/21M1389742. URL https://doi.org/10. 1137/21M1389742