An Energy Stable Approach for Learning Derivative Operators from Noisy Data for Maxwells Equations
Pith reviewed 2026-05-16 18:23 UTC · model grok-4.3
The pith
Reduced parameterization lets SP-ADMM learn energy-conserving derivative operators for Maxwell equations directly from noisy data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
SP-ADMM enforces skew-adjointness of the learned derivative operator by construction through a reduced parameterization of the stencil coefficients, enabling it to learn accurate operators from noisy data while preserving energy conservation to roundoff accuracy in the underlying Maxwell system, outperforming standard constrained ADMM in error metrics across multiple regimes.
What carries the argument
The reduced parameterization using only independent positive-side stencil coefficients in the compact periodic convolution representation, which enforces the skew-adjoint property required for energy stability by construction.
If this is right
- SP-ADMM achieves the smallest final-time electric-field error while preserving energy to roundoff accuracy across clean data, noisy derivative data, multiple initial conditions, different hidden operators, training-set sizes, and long-time simulations.
- The learned stencil remains competitive with classical finite differences in physical reflection and transmission settings.
- Performance holds under constraint ablations and changes in regularization parameters.
- The method works effectively in hidden-operator and noisy-data regimes without post-hoc projection steps.
Where Pith is reading between the lines
- The same reduced-parameterization idea could apply to learning structure-preserving operators for other wave or transport equations where energy or other invariants must be respected.
- Lowering the number of free coefficients may reduce the volume of training data needed to reach a given accuracy level.
- The technique suggests a route to embed discrete conservation laws directly into neural or stencil-based PDE solvers without separate penalty terms.
- Testing the approach on two- or three-dimensional Maxwell systems would reveal whether the one-dimensional construction scales when skew-adjointness involves more coupled components.
Load-bearing premise
Enforcing skew-adjointness solely through the reduced parameterization of positive-side stencil coefficients is sufficient to guarantee energy stability for the learned operator in the Maxwell system without introducing approximation errors or limiting expressivity.
What would settle it
A long-time simulation in which the learned operator produces energy drift larger than roundoff or final-time electric-field errors exceeding those of classical finite differences in the layered-medium test.
Figures
read the original abstract
We develop a structure-preserving ADMM method, denoted SP-ADMM, for learning energy-stable spatial derivative stencils for Maxwell equations from noisy data. Starting from the source-free Maxwell system, we focus on a one-dimensional reduction whose energy conservation depends on the skew-adjointness of the spatial derivative operator. The learned derivative is represented by a compact periodic convolution stencil. Unlike standard constrained ADMM, which learns the full stencil and imposes skew-adjointness through equality constraints, SP-ADMM enforces skew-adjointness by construction through a reduced parameterization using only the independent positive-side stencil coefficients. Numerical experiments show that SP-ADMM is especially effective in hidden operator and noisy-data regimes. Across clean data, noisy derivative data, multiple initial conditions, different hidden skew-adjoint operators, training-set sizes, regularization parameters, constraint ablations, and long-time simulations, SP-ADMM achieves the smallest final-time electric-field error while preserving energy to roundoff accuracy. A layered-medium Maxwell propagation test further shows that the learned structure-preserving stencil remains competitive with classical finite differences in a physical reflection/transmission setting. Overall, SP-ADMM provides a data-driven way to learn accurate Maxwell stencils while retaining the energy-conserving structure of the underlying equations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes SP-ADMM, a structure-preserving ADMM algorithm for learning energy-stable derivative stencils for the 1D Maxwell equations from noisy data. The key innovation is a reduced parameterization of the compact periodic convolution stencil that enforces skew-adjointness by construction using only the positive-side coefficients, thereby guaranteeing discrete energy conservation. Experiments across clean and noisy data, various initial conditions, hidden operators, and long-time simulations show SP-ADMM yielding the smallest final-time electric field errors while maintaining energy to roundoff accuracy, and performing competitively in a layered-medium propagation test.
Significance. If the results hold, this work provides a principled data-driven approach to learning numerical operators that preserve the energy stability essential for long-time accuracy in wave propagation problems like Maxwell's equations. The exact enforcement of skew-adjointness without loss of expressivity (the reduced parameterization spans the full space of skew-adjoint stencils of given width) is a notable strength, as is the demonstration of robustness to noise and the competitiveness with classical finite differences in a physical setting. This could impact fields requiring stable learned discretizations from data.
minor comments (3)
- §2.2: The notation for the positive-side coefficients p_k could be introduced with an explicit low-order example (e.g., m=1) to make the reparameterization immediately clear to readers.
- Table 2: The caption should state the precise stencil width and regularization parameter values used for each row so that the ablation results can be reproduced without consulting the text.
- Figure 4: The long-time energy plot would be more informative if the vertical axis were scaled to show the roundoff-level deviation explicitly (e.g., 10^{-14} to 10^{-16}) rather than a generic log scale.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our manuscript on SP-ADMM for learning energy-stable derivative stencils from noisy data. The recommendation for minor revision is appreciated, and we note that no specific major comments were raised in the report.
Circularity Check
No significant circularity identified
full rationale
The paper's derivation enforces skew-adjointness directly via reduced parameterization of positive-side stencil coefficients, which by the 1D Maxwell energy identity guarantees discrete conservation to machine precision for any choice of free coefficients. This is a valid structural enforcement rather than a self-definitional loop or fitted input renamed as prediction. The performance claims (smallest final-time error across regimes) rest on numerical experiments, not on tautological reduction of outputs to inputs. No self-citations, uniqueness theorems from prior author work, or ansatzes smuggled via citation appear in the load-bearing steps; the construction is self-contained against the stated energy identity.
Axiom & Free-Parameter Ledger
free parameters (1)
- regularization parameters
axioms (1)
- domain assumption Energy conservation in the source-free Maxwell system depends on the skew-adjointness of the spatial derivative operator.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel (J-cost uniqueness) unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 2 (Characterization of skew-symmetric stencils)... w0=0, w_{-ℓ}=-w_{+ℓ}... equivalent to skew-adjointness... guarantees exact conservation of discrete electromagnetic energy
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking (D=3 forcing) unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SP-ADMM enforces skew-adjointness by construction through a reduced parameterization using only the independent positive-side stencil coefficients
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
J. Bar-Sinai, S. Hoyer, J. Hickey, and M. Levine. Learning data-driven discretizations for partial differential equations.Proceedings of the National Academy of Sciences, 116(31):15344– 15349, 2019
work page 2019
-
[2]
Vrushali A. Bokil and Nathan L. Gibson. Analysis of spatial high-order finite difference methods for maxwell’s equations in dispersive media.IMA Journal of Numerical Analysis, 31(3):1322–1350, 2011
work page 2011
-
[3]
Vrushali A. Bokil and Nathan L. Gibson. High spatial order energy stable fdtd methods for maxwell’s equations in nonlinear optical media in one dimension.Journal of Scientific Computing, 77(1):330–371, 2018
work page 2018
-
[4]
Vrushali A. Bokil, Olivia A. Keefer, and Amy C.-Y. Leung. Operator splitting methods for maxwell’s equations in dispersive media with orientational polarization.Journal of Computa- tional and Applied Mathematics, 263:160–188, 2014
work page 2014
-
[5]
Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers.Foun- dations and Trends in Machine Learning, 3(1):1–122, 2011
work page 2011
-
[6]
Message passing neural pde solvers.arXiv preprint arXiv:2202.03376, 2022
Johannes Brandstetter, Daniel E. Worrall, and Max Welling. Message passing neural pde solvers.arXiv preprint arXiv:2202.03376, 2022
-
[7]
Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Discovering governing equations from data by sparse identification of nonlinear dynamical systems.Proceedings of the National Academy of Sciences, 113(15):3932–3937, 2016
work page 2016
-
[8]
Ernst Hairer, Christian Lubich, and Gerhard Wanner.Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations. Springer, 2nd edition, 2006. 18
work page 2006
-
[9]
Ernst Hairer and Gerhard Wanner.Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems, volume 14 ofSpringer Series in Computational Mathematics. Springer, 2 edition, 2010
work page 2010
-
[10]
Alexander Heinlein, Yuwei Chen, and George E. Karniadakis. Multi-fidelity domain- decomposition physics-informed neural operators for time-dependent partial differential equa- tions.arXiv preprint arXiv:2508.03774, 2025
-
[11]
Smith, Ayya Alieva, Qing Wang, Michael P
Dmitrii Kochkov, Jamie A. Smith, Ayya Alieva, Qing Wang, Michael P. Brenner, and Stephan Hoyer. Machine learning–accelerated computational fluid dynamics.Proceedings of the Na- tional Academy of Sciences, 118(21):e2101784118, 2021
work page 2021
-
[12]
LeVeque.Finite Difference Methods for Ordinary and Partial Differential Equations
Randall J. LeVeque.Finite Difference Methods for Ordinary and Partial Differential Equations. SIAM, 2007
work page 2007
-
[13]
Fourier Neural Operator for Parametric Partial Differential Equations
Zongyi Li, Nikola Borislavov Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhat- tacharya, Andrew M. Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial differential equations. InProceedings of the International Conference on Learning Rep- resentations (ICLR), 2021. arXiv:2010.08895
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[14]
Julia Ling, Andrew Kurzawski, and Jeremy Templeton. Reynolds averaged turbulence mod- elling using deep neural networks with embedded invariance.Journal of Fluid Mechanics, 807:155–166, 2016
work page 2016
-
[15]
Cheng Liu, Johannes Brandstetter, Nils Thuerey, and Kiwon Um. clawno: Conservation law–encoded neural operators.Advances in Neural Information Processing Systems, 37, 2024
work page 2024
-
[16]
Yuxuan Liu, Harsh Patel, Ana Renganathan, and Lu Lu. Conservation-preserved fourier neural operator through adaptive correction.Computer Methods in Applied Mechanics and Engineering, 2025. To appear
work page 2025
-
[17]
Yunfan Ma, Mingfei Wu, Zhihao Peng, Baile Zhang, and Zhenglong Xu. Pic 2o-sim: A physics- inspired causality-aware dynamic convolutional neural operator for ultra-fast photonic device fdtd simulation.arXiv preprint arXiv:2406.17810, 2024
-
[18]
Yunan Mao, Yiming Li, Luca Magri, and George E. Karniadakis. Towards general neural surrogate solvers with specialized neural accelerators: Domain-decomposition neural operators for large-scale pdes.arXiv preprint arXiv:2405.02351, 2024
-
[19]
Peter Monk.Finite Element Methods for Maxwell’s Equations. Oxford University Press, 2003
work page 2003
-
[20]
K. W. Morton and D. F. Mayers.Numerical Solution of Partial Differential Equations: An Introduction. Cambridge University Press, 1994
work page 1994
-
[21]
Eric J. Parish and Karthik Duraisamy. A paradigm for data-driven predictive modeling using field inversion and machine learning.Journal of Computational Physics, 305:758–774, 2016
work page 2016
-
[22]
Maziar Raissi, Paris Perdikaris, and George E. Karniadakis. Hidden physics models: Machine learning of nonlinear partial differential equations.Journal of Computational Physics, 357:125– 141, 2018
work page 2018
-
[23]
R. D. Richtmyer and K. W. Morton.Difference Methods for Initial-Value Problems. Wiley- Interscience, 2 edition, 1967. 19
work page 1967
-
[24]
Samuel H. Rudy, Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Data-driven discovery of partial differential equations.Science Advances, 3(4):e1602614, 2017
work page 2017
-
[25]
Hagness.Computational Electrodynamics: The Finite-Difference Time-Domain Method
Allen Taflove and Susan C. Hagness.Computational Electrodynamics: The Finite-Difference Time-Domain Method. Artech House, 3rd edition, 2005
work page 2005
- [26]
-
[27]
Solver-in-the- loop: Learning from differentiable physics to interact with iterative pde-solvers
Kiwon Um, Robert Brand, Yun (Raymond) Fei, Philipp Holl, and Nils Thuerey. Solver-in-the- loop: Learning from differentiable physics to interact with iterative pde-solvers. InAdvances in Neural Information Processing Systems (NeurIPS), volume 33, 2020
work page 2020
-
[28]
Kane S. Yee. Numerical solution of initial boundary value problems involving maxwell’s equa- tions in isotropic media.IEEE Transactions on Antennas and Propagation, 14(3):302–307, 1966. 9 Appendix Appendix: Spectral structure of the learned stencil Theorem 3(Spectral structure and modal energy conservation for the learned stencil).Letw ⋆ ∈ R2R+1 be any so...
work page 1966
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.