Equilibrium Propagation and Hamiltonian Inference in the Diffusive Fitzhugh-Nagumo Model
Pith reviewed 2026-05-22 09:30 UTC · model grok-4.3
The pith
Stationary solutions of the diffusively coupled Fitzhugh-Nagumo model are described by self-adjoint operators, allowing direct application of equilibrium propagation for credit assignment.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We show that since stationary solutions of the Fitzhugh-Nagumo model are described by self-adjoint operators, the methods of equilibrium propagation for performing credit assignment can be applied. Furthermore, for Fitzhugh-Nagumo networks with the topology of a deep residual network, we show that the steady state solutions admit a (spatial) Hamiltonian, and thus the methods of Hamiltonian Echo Backpropagation can be applied. We end by deriving an explicit layer-wise Hamiltonian recurrence relation governing inference for stationary solutions of both deep Fitzhugh-Nagumo networks and deep Energy-Based Models.
What carries the argument
The self-adjoint operator that describes the stationary solutions of the Fitzhugh-Nagumo model, which directly supports equilibrium propagation credit assignment, together with the spatial Hamiltonian admitted by residual topologies that supports Hamiltonian Echo Backpropagation.
If this is right
- Credit assignment in Fitzhugh-Nagumo networks can be performed using equilibrium propagation without explicit gradient computation.
- Hamiltonian Echo Backpropagation becomes applicable for inference in residual-topology Fitzhugh-Nagumo networks.
- A layer-wise recurrence relation provides an efficient way to compute the Hamiltonian governing inference in both Fitzhugh-Nagumo networks and deep energy-based models.
- Deep energy-based models become equivalent to Hamiltonian neural networks under the stationary-solution mapping shown here.
Where Pith is reading between the lines
- The same self-adjoint property might allow equilibrium propagation to be applied to other skew-gradient neuron models with diffusive coupling.
- The derived Hamiltonian recurrence could simplify local inference rules in hardware implementations that physically realize diffusive Fitzhugh-Nagumo dynamics.
- If the equivalence holds, training algorithms for energy-based models might transfer directly to networks of biological or neuromorphic neurons.
- Residual topologies may be a general route for introducing Hamiltonian structure into otherwise non-conservative dynamical systems.
Load-bearing premise
Stationary solutions of the Fitzhugh-Nagumo model are described by self-adjoint operators.
What would settle it
A direct computation or simulation showing that the linear operator governing a stationary solution of a diffusively coupled Fitzhugh-Nagumo network is not self-adjoint, or that equilibrium propagation fails to assign credit correctly in such a network.
Figures
read the original abstract
In this work, we extend the Equilibrium Propagation framework to skew-gradient systems and show an equivalence between deep Energy-Based Models and Hamiltonian neural networks. We focus on networks of diffusively coupled Fitzhugh-Nagumo neurons as a prototypical example. We show that since stationary solutions of the Fitzhugh-Nagumo model are described by self-adjoint operators, the methods of equilibrium propagation for performing credit assignment can be applied. Furthermore, for Fitzhugh-Nagumo networks with the topology of a deep residual network, we show that the steady state solutions admit a (spatial) Hamiltonian, and thus the methods of Hamiltonian Echo Backpropagation can be applied. We end by deriving an explicit layer-wise Hamiltonian recurrence relation governing inference for stationary solutions of both deep Fitzhugh-Nagumo networks and deep Energy-Based Models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript extends Equilibrium Propagation (EP) to skew-gradient systems by analyzing networks of diffusively coupled Fitzhugh-Nagumo (FN) neurons. It asserts that stationary solutions of the FN model are governed by self-adjoint operators, permitting direct use of EP for credit assignment. For FN networks arranged in a deep residual topology, it further claims that steady-state solutions admit a spatial Hamiltonian, enabling Hamiltonian Echo Backpropagation (HEBP). The paper derives an explicit layer-wise Hamiltonian recurrence relation for inference in both deep FN networks and deep Energy-Based Models (EBMs) and establishes an equivalence between deep EBMs and Hamiltonian neural networks.
Significance. If the self-adjointness and Hamiltonian claims are rigorously substantiated, the work would provide a useful theoretical link between continuous dynamical systems from neuroscience and credit-assignment techniques in machine learning. The recurrence relation could serve as a practical tool for inference, and the equivalence result would strengthen connections between energy-based and Hamiltonian modeling approaches.
major comments (2)
- [§3.2] §3.2 (linearization around stationary profiles): The assertion that the stationary operator is self-adjoint rests on an unshown symmetry. The reaction Jacobian [[1-v², -1],[ε, -ε b]] is nonsymmetric for generic parameters; adding the symmetric diffusion term D Δ yields J_reaction(x) + D Δ, which is not automatically self-adjoint on the relevant Sobolev space. No explicit inner-product identity or change of variables is supplied to restore self-adjointness, undermining the direct applicability of EP.
- [§4.1] §4.1 (residual topology and Hamiltonian): The claim that steady states admit a spatial Hamiltonian for deep residual FN topologies is load-bearing for invoking HEBP, yet the manuscript provides neither the explicit Hamiltonian functional nor a variational derivation showing that the steady-state equations follow from a gradient of this Hamiltonian.
minor comments (2)
- [Abstract and §1] The term 'skew-gradient systems' is introduced in the abstract and introduction without a concise definition or pointer to the precise extension of the EP framework being used.
- [§5] The layer-wise Hamiltonian recurrence in §5 would be clearer with an accompanying algorithmic pseudocode or explicit update rules for the inference procedure.
Simulated Author's Rebuttal
We thank the referee for their careful reading of the manuscript and for identifying points that require additional clarification. We address each major comment below and indicate the revisions that will be incorporated.
read point-by-point responses
-
Referee: [§3.2] §3.2 (linearization around stationary profiles): The assertion that the stationary operator is self-adjoint rests on an unshown symmetry. The reaction Jacobian [[1-v², -1],[ε, -ε b]] is nonsymmetric for generic parameters; adding the symmetric diffusion term D Δ yields J_reaction(x) + D Δ, which is not automatically self-adjoint on the relevant Sobolev space. No explicit inner-product identity or change of variables is supplied to restore self-adjointness, undermining the direct applicability of EP.
Authors: The referee is correct that the manuscript does not supply an explicit inner-product identity or change of variables demonstrating self-adjointness of the linearized stationary operator. While the skew-gradient structure of the Fitzhugh-Nagumo system together with diffusive coupling permits self-adjointness under a suitably weighted inner product that incorporates the separation of time scales, this identity is not written out in the current text. We will add a dedicated paragraph (or short appendix) that defines the inner product and verifies symmetry of the operator with respect to it, thereby justifying direct application of equilibrium propagation. revision: yes
-
Referee: [§4.1] §4.1 (residual topology and Hamiltonian): The claim that steady states admit a spatial Hamiltonian for deep residual FN topologies is load-bearing for invoking HEBP, yet the manuscript provides neither the explicit Hamiltonian functional nor a variational derivation showing that the steady-state equations follow from a gradient of this Hamiltonian.
Authors: We agree that the current version does not exhibit the explicit spatial Hamiltonian functional nor the variational derivation establishing that the steady-state equations are its critical points. The layer-wise recurrence relation is derived from the residual topology, but the underlying Hamiltonian is only implicitly present. In the revision we will state the explicit Hamiltonian functional for the deep residual Fitzhugh-Nagumo network and show that its variational derivative recovers the steady-state equations, thereby rigorously supporting the invocation of Hamiltonian Echo Backpropagation. revision: yes
Circularity Check
No circularity: claims rest on asserted mathematical properties rather than definitional reduction or self-referential fits
full rationale
The paper asserts that stationary solutions of the Fitzhugh-Nagumo model are described by self-adjoint operators (allowing direct application of equilibrium propagation) and that residual topologies admit a spatial Hamiltonian (allowing Hamiltonian Echo Backpropagation). These are presented as results to be shown, followed by derivation of a layer-wise recurrence. No equations in the provided text define the self-adjointness or Hamiltonian in terms of the credit-assignment methods themselves, nor rename a fitted quantity as a prediction, nor reduce the central equivalence to a self-citation chain. The extension to skew-gradient systems and claimed equivalence between deep EBMs and Hamiltonian networks is framed as an independent argument. The derivation chain therefore remains self-contained against external mathematical verification of the operator properties.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Stationary solutions of the Fitzhugh-Nagumo model are described by self-adjoint operators
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
Since L1 and L2 are symmetric by construction, and f'(u) and γ are diagonal matrices, and since the inverse of a symmetric matrix is again symmetric, the matrix M^{-1} must therefore be symmetric... Thus, the methods of Equilibrium Propagation can be applied.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Frontiers in computational neuroscience , volume=
Equilibrium propagation: Bridging the gap between energy-based models and backpropagation , author=. Frontiers in computational neuroscience , volume=. 2017 , publisher=
work page 2017
-
[2]
Early Inference in Energy-Based Models Approximates Back-Propagation
Early inference in energy-based models approximates back-propagation , author=. arXiv preprint arXiv:1510.02777 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[3]
Journal of Differential Equations , volume=
Mini-maximizers for reaction-diffusion systems with skew-gradient structure , author=. Journal of Differential Equations , volume=. 2002 , publisher=
work page 2002
-
[4]
arXiv preprint arXiv:2206.02629 , year=
Backpropagation at the infinitesimal inference limit of energy-based models: Unifying predictive coding, equilibrium propagation, and contrastive hebbian learning , author=. arXiv preprint arXiv:2206.02629 , year=
-
[5]
Learning and inference in the brain , author=. Neural networks , volume=. 2003 , publisher=
work page 2003
-
[6]
Journal of mathematical psychology , volume=
A tutorial on the free-energy framework for modelling perception and learning , author=. Journal of mathematical psychology , volume=. 2017 , publisher=
work page 2017
-
[7]
Journal of Dynamics and Differential Equations , volume=
Standing pulse solutions in reaction-diffusion systems with skew-gradient structure , author=. Journal of Dynamics and Differential Equations , volume=. 2002 , publisher=
work page 2002
-
[8]
arXiv preprint arXiv:2006.01981 , year=
Training end-to-end analog neural networks with equilibrium propagation , author=. arXiv preprint arXiv:2006.01981 , year=
-
[9]
Nature Communications , volume=
Training an ising machine with equilibrium propagation , author=. Nature Communications , volume=. 2024 , publisher=
work page 2024
-
[10]
Physical Review Applied , volume=
Experimental demonstration of coupled learning in elastic networks , author=. Physical Review Applied , volume=. 2024 , publisher=
work page 2024
-
[11]
Training of photonic neural networks through in situ backpropagation and gradient measurement , author=. Optica , volume=. 2018 , publisher=
work page 2018
-
[12]
Nature Communications , volume=
Quantum equilibrium propagation for efficient training of quantum systems based on Onsager reciprocity , author=. Nature Communications , volume=. 2025 , publisher=
work page 2025
-
[13]
arXiv preprint arXiv:2406.00879 , year=
Quantum equilibrium propagation: Gradient-descent training of quantum systems , author=. arXiv preprint arXiv:2406.00879 , year=
-
[14]
Annual Review of Condensed Matter Physics , volume=
Learning without neurons in physical systems , author=. Annual Review of Condensed Matter Physics , volume=. 2023 , publisher=
work page 2023
-
[15]
Self-learning machines based on Hamiltonian echo backpropagation , author=. Physical Review X , volume=. 2023 , publisher=
work page 2023
-
[16]
Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning , author=. arXiv preprint arXiv:2506.06248 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[17]
Activity-difference training of deep neural networks using memristor crossbars , author=. Nature Electronics , volume=. 2023 , publisher=
work page 2023
-
[18]
Eqspike: spike-driven equilibrium propagation for neuromorphic implementations , author=. Iscience , volume=. 2021 , publisher=
work page 2021
-
[19]
Impulses and physiological states in theoretical models of nerve membrane , author=. Biophysical journal , volume=. 1961 , publisher=
work page 1961
-
[20]
Proceedings of the IRE , volume=
An active pulse transmission line simulating nerve axon , author=. Proceedings of the IRE , volume=. 1962 , publisher=
work page 1962
-
[21]
Six decades of the FitzHugh--Nagumo model: A guide through its spatio-temporal dynamics and influence across disciplines , author=. Physics Reports , volume=. 2024 , publisher=
work page 2024
-
[22]
Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , volume=
Turing patterns on discrete topologies: from networks to higher-order structures , author=. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences , volume=. 2024 , publisher=
work page 2024
-
[23]
Nature Reviews Neuroscience , volume=
Backpropagation and the brain , author=. Nature Reviews Neuroscience , volume=. 2020 , publisher=
work page 2020
-
[24]
arXiv preprint arXiv:2501.10271 , year=
Spatial localization in the FitzHugh-Nagumo model , author=. arXiv preprint arXiv:2501.10271 , year=
-
[25]
SIAM Journal on Applied Mathematics , volume=
On the Turing patterns in one-dimensional gradient/skew-gradient dissipative systems , author=. SIAM Journal on Applied Mathematics , volume=. 2004 , publisher=
work page 2004
-
[26]
Annals of Operations Research , volume=
Structure preserving integration and model order reduction of skew-gradient reaction--diffusion systems , author=. Annals of Operations Research , volume=. 2017 , publisher=
work page 2017
-
[27]
The Collected Papers of Stephen Smale: Volume 2 , pages=
On the mathematical foundations of electrical circuit theory , author=. The Collected Papers of Stephen Smale: Volume 2 , pages=. 1971 , publisher=
work page 1971
-
[28]
arXiv preprint arXiv:2506.05259 , year=
Learning long range dependencies through time reversal symmetry breaking , author=. arXiv preprint arXiv:2506.05259 , year=
-
[29]
Event-based backpropagation can compute exact gradients for spiking neural networks , author=. Scientific Reports , volume=. 2021 , publisher=
work page 2021
-
[30]
Frontiers in neuroscience , volume=
Training deep spiking neural networks using backpropagation , author=. Frontiers in neuroscience , volume=. 2016 , publisher=
work page 2016
-
[31]
Competitive learning: From interactive activation to adaptive resonance , author=. Cognitive science , volume=. 1987 , publisher=
work page 1987
-
[32]
New Frontiers in Associative Memories-Workshop at ICLR 2026 , year=
Training a Convergent Energy Transformer with Equilibrium Propagation , author=. New Frontiers in Associative Memories-Workshop at ICLR 2026 , year=
work page 2026
-
[33]
Frontiers in neuroscience , volume=
Scaling equilibrium propagation to deep convnets by drastically reducing its gradient estimator bias , author=. Frontiers in neuroscience , volume=. 2021 , publisher=
work page 2021
-
[34]
Advances in neural information processing systems , volume=
Energy-based learning algorithms for analog computing: a comparative study , author=. Advances in neural information processing systems , volume=
-
[35]
Advances in Neural Information Processing Systems , volume=
Compositional visual generation with energy based models , author=. Advances in Neural Information Processing Systems , volume=
-
[36]
Advances in Neural Information Processing Systems , volume=
-PC: Scaling Predictive Coding to 100+ Layer Networks , author=. Advances in Neural Information Processing Systems , volume=
-
[37]
arXiv preprint arXiv:2107.12979 , year=
Predictive coding: a theoretical and experimental review , author=. arXiv preprint arXiv:2107.12979 , year=
-
[38]
Advances in neural information processing systems , volume=
Dense associative memory for pattern recognition , author=. Advances in neural information processing systems , volume=
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.