pith. sign in

arxiv: 2301.06217 · v1 · submitted 2023-01-15 · 🪐 quant-ph · cs.AI· cs.LG

Analogy between Boltzmann machines and Feynman path integrals

Pith reviewed 2026-05-24 10:06 UTC · model grok-4.3

classification 🪐 quant-ph cs.AIcs.LG
keywords Boltzmann machinesFeynman path integralsquantum circuitsmachine learninghidden layersquantum statistical mechanicsinverse quantum scattering
0
0 comments X

The pith

Hidden layers in Boltzmann machines correspond to discrete versions of Feynman path elements.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes an equivalence between Boltzmann machines used in machine learning and the Feynman path-integral formalism from quantum statistical mechanics. This equivalence supports the view that hidden layers function as discrete path elements, so that machine learning amounts to selecting combinations of such paths together with their accumulated weights to produce the desired input-to-output map. A sympathetic reader would care because the same structure then supplies quantum circuit models that apply uniformly to both Boltzmann machines and path integrals. The analysis further connects the hidden layers to inverse quantum scattering problems as a route to interpretability.

Core claim

The formal connections between Boltzmann machines and Feynman's description of quantum statistical mechanics allow the interpretation that the hidden layers in Boltzmann machines and other neural network formalisms are discrete versions of path elements present within the Feynman path-integral formalism. Feynman paths thereby supply a natural depiction of interference phenomena, implying that machine learning seeks an appropriate combination of paths along with accumulated path-weights through a network that cumulatively capture the correct mapping. As a direct consequence, general quantum circuit models applicable to both Boltzmann machines and Feynman path integral descriptions can be set,

What carries the argument

The path-element interpretation that equates hidden layers in Boltzmann machines with discrete Feynman path elements and thereby yields quantum circuit models.

Load-bearing premise

The formal structures of Boltzmann machines and Feynman path integrals are sufficiently isomorphic that the path-element interpretation and derived quantum-circuit models follow directly.

What would settle it

A concrete calculation on a small Boltzmann machine showing that its probability distribution over configurations does not match the distribution obtained by summing the corresponding discrete Feynman paths with the same weights.

Figures

Figures reproduced from arXiv: 2301.06217 by Sabre Kais, Srinivasan S. Iyengar.

Figure 1
Figure 1. Figure 1: FIG. 1. RBM network G=(V,E) showing the biases [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Explicit version of Figure [PITH_FULL_IMAGE:figures/full_fig_p003_4.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Neural network depiction of Eq. ( [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Illustration of the approach in Section [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. 3-local Neural network depiction with entropy in Eq. [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Same as Figure [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
read the original abstract

We provide a detailed exposition of the connections between Boltzmann machines commonly utilized in machine learning problems and the ideas already well known in quantum statistical mechanics through Feynman's description of the same. We find that this equivalence allows the interpretation that the hidden layers in Boltzmann machines and other neural network formalisms are in fact discrete versions of path elements that are present within the Feynman path-integral formalism. Since Feynman paths are the natural and elegant depiction of interference phenomena germane to quantum mechanics, it appears that in machine learning, the goal is to find an appropriate combination of ``paths'', along with accumulated path-weights, through a network that cumulatively capture the correct $x \rightarrow y$ map for a given mathematical problem. As a direct consequence of this analysis, we are able to provide general quantum circuit models that are applicable to both Boltzmann machines and to Feynman path integral descriptions. Connections are also made to inverse quantum scattering problems which allow a robust way to define ``interpretable'' hidden layers.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript provides an expository analogy between Boltzmann machines (and related neural networks) and Feynman path integrals in quantum statistical mechanics. It interprets hidden layers as discrete versions of path elements, argues that machine learning seeks appropriate combinations of such paths with accumulated weights to realize x→y maps, and derives general quantum circuit models applicable to both formalisms, with additional links to inverse quantum scattering for interpretability of hidden units.

Significance. If the structural mapping can be made rigorous, the work could usefully bridge classical machine-learning architectures with quantum path-integral techniques, potentially motivating quantum-circuit realizations of probabilistic models and new interpretive tools for neural-network layers. The paper is credited for explicitly connecting the two domains and for highlighting the inverse-scattering route to interpretability; however, because the central step remains interpretive rather than a derivation with controlled approximations or numerical checks, the immediate technical impact is modest.

major comments (2)
  1. [Abstract] Abstract and the paragraph introducing the path-element interpretation: the claim that hidden layers 'are in fact discrete versions of path elements' is asserted to follow directly from the equivalence, yet the manuscript does not supply an explicit isomorphism that maps the real Boltzmann factor exp(−E) onto the complex Feynman amplitude exp(iS/ℏ) while preserving interference phases; without this step the subsequent quantum-circuit constructions risk capturing only classical summation.
  2. [Quantum circuit models] Section presenting the general quantum circuit models: these models are presented as immediate consequences of the analogy, but the text does not demonstrate how real-valued RBM weights are promoted to complex path weights or how the discrete-path discretization avoids uncontrolled errors in the dynamics; the skeptic concern that the mapping may omit phases therefore remains unaddressed and is load-bearing for the claimed equivalence.
minor comments (2)
  1. Notation for the energy/action functionals is introduced without a consolidated table comparing the Boltzmann and Feynman conventions side-by-side; this would improve readability.
  2. [Abstract] The abstract states that the analysis 'allows' the path-element interpretation but does not distinguish this interpretive step from any new theorem or derivation; a single clarifying sentence would help.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful review and for noting the expository nature of the work. The manuscript draws a structural analogy between the two formalisms rather than claiming a rigorous isomorphism. We address each major comment below and will revise the manuscript to clarify the interpretive character of the connection.

read point-by-point responses
  1. Referee: [Abstract] Abstract and the paragraph introducing the path-element interpretation: the claim that hidden layers 'are in fact discrete versions of path elements' is asserted to follow directly from the equivalence, yet the manuscript does not supply an explicit isomorphism that maps the real Boltzmann factor exp(−E) onto the complex Feynman amplitude exp(iS/ℏ) while preserving interference phases; without this step the subsequent quantum-circuit constructions risk capturing only classical summation.

    Authors: We agree that no explicit isomorphism is supplied that maps real Boltzmann factors onto complex amplitudes while preserving phases. The manuscript presents the link as an expository analogy that permits interpreting hidden units as discrete path elements; it does not assert a direct mathematical equivalence that incorporates quantum interference. The quantum-circuit constructions follow from this structural view but remain illustrative. We will revise the abstract and the relevant introductory paragraph to state explicitly that the connection is interpretive rather than a phase-preserving isomorphism, thereby removing any implication of a stronger equivalence. revision: yes

  2. Referee: [Quantum circuit models] Section presenting the general quantum circuit models: these models are presented as immediate consequences of the analogy, but the text does not demonstrate how real-valued RBM weights are promoted to complex path weights or how the discrete-path discretization avoids uncontrolled errors in the dynamics; the skeptic concern that the mapping may omit phases therefore remains unaddressed and is load-bearing for the claimed equivalence.

    Authors: The referee correctly observes that the manuscript does not demonstrate promotion of real RBM weights to complex path weights, nor does it analyze discretization errors or phase factors. Because the paper focuses on the conceptual analogy rather than a quantitative derivation, these technical steps are absent. We will add a clarifying paragraph to the quantum-circuit section that acknowledges these limitations, states that the models are motivated by the analogy but do not constitute a controlled mapping, and notes that phases are not automatically retained. revision: yes

Circularity Check

0 steps flagged

No circularity: interpretive analogy between independent formalisms

full rationale

The paper advances an interpretive mapping: hidden layers in Boltzmann machines are viewed as discrete versions of Feynman path elements. The abstract and reader's summary indicate this follows from structural equivalence of the two formalisms without introducing fitted parameters, self-referential equations, or load-bearing self-citations that reduce the central claim to its own inputs. No derivation step equates a prediction to a fitted quantity by construction, and the real-vs-complex distinction raised by the skeptic is a correctness concern rather than a circularity reduction. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the structural similarity between the two partition-function formalisms; no new free parameters, ad-hoc axioms, or invented entities are introduced.

axioms (1)
  • domain assumption The partition function of a Boltzmann machine can be identified with a discretized Feynman path integral without loss of the essential probabilistic interpretation.
    Invoked in the abstract when the hidden-layer interpretation is asserted.

pith-pipeline@v0.9.0 · 5698 in / 1193 out tokens · 24681 ms · 2026-05-24T10:06:03.607366+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages

  1. [1]

    interpretability

    from machine learning is palpable. These connections will be further explored in the following sections. |x⟩ |h1⟩ · · · |hP ⟩ FIG. 2. Neural network depiction of Eq. (8). |x⟩ |h1⟩ · · · |hP ⟩ |y⟩ FIG. 3. Neural network depiction of Eq. (8). Similar to Figure 2, but now |x′⟩ is assumed to be a different space from |x⟩. This figure is elaborated in Figure 4 t...

  2. [2]

    Sajjan, J

    M. Sajjan, J. Li, R. Selvarajan, S. H. Sureshbabu, S. S. Kale, R. Gupta, V. Singh, and S. Kais, Chemical Society Reviews (2022)

  3. [3]

    Wiebe, A

    N. Wiebe, A. Kapoor, and K. M. Svore, Quantum Infor- mation and Computation 16, 0541 (2015)

  4. [4]

    R. G. Melko, G. Carleo, J. Carrasquilla, and J. I. Cirac, Nature Physics 15, 887 (2019), ISSN 17452481, URL http://dx.doi.org/10.1038/s41567-019-0545-1

  5. [5]

    Le Roux and Y

    N. Le Roux and Y. Bengio, Neural Computation 20, 1631 (2008)

  6. [6]

    Nomura, A

    Y. Nomura, A. S. Darmawan, Y. Yamaji, and M. Imada, Phys. Rev. B 96, 205152 (2017), URL https://link. aps.org/doi/10.1103/PhysRevB.96.205152

  7. [7]

    C. K. Lee, P. Patil, S. Zhang, and C. Y. Hsieh, Phys. Rev. Research 3, 023095 (2021), URL https://link. aps.org/doi/10.1103/PhysRevResearch.3.023095

  8. [8]

    Sajjan, S

    M. Sajjan, S. H. Sureshbabu, and S. Kais, J. Am. Chem. Soc (2021), URL https://pubs.acs.org/doi/10.1021/ jacs.1c06246

  9. [9]

    Xia and S

    R. Xia and S. Kais, Nature communications 9, 1 (2018)

  10. [10]

    Kumar and S

    A. Kumar and S. S. Iyengar, J. Chem. Theory Comput. 15, 5769 (2019)

  11. [11]

    Kumar, N

    A. Kumar, N. DeGregorio, and S. S. Iyengar, J. Chem. Theory Comput. 17, 6671 (2021)

  12. [12]

    Ciliberto, M

    C. Ciliberto, M. Herbster, A. D. Ialongo, M. Pon- til, A. Rocchetto, S. Severini, and L. Wossnig, Pro- ceedings of the Royal Society A: Mathematical, Phys- ical and Engineering Sciences 474, 20170551 (2018), URL https://royalsocietypublishing.org/doi/abs/ 10.1098/rspa.2017.0551

  13. [13]

    D.-L. Deng, X. Li, and S. Das Sarma, Phys. Rev. X 7, 021021 (2017), URL https://link.aps.org/doi/10. 1103/PhysRevX.7.021021

  14. [14]

    P. M. Long and R. A. Servedio, ICML 2010 - Proceed- ings, 27th International Conference on Machine Learning pp. 703–710 (2010)

  15. [15]

    Kumar, N

    A. Kumar, N. DeGregorio, T. Ricard, and S. S. Iyengar, J. Chem. Theory Comput. In Press (2022)

  16. [16]

    J. H. Zhang and S. S. Iyengar, J. Chem. Theory Comput. 18, 2885 (2022)

  17. [17]

    Zhu and S

    X. Zhu and S. S. Iyengar, J. Chem. Theory Comput. 18, 5125 (2022)

  18. [18]

    Y. Che, C. Gneiting, and F. Nori, arXiv preprint arXiv:2202.02750 (2022)

  19. [19]

    R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill Book Company, New York, 1965)

  20. [20]

    R. P. Feynman, Statistical Mechanics (Addison-Wesley, MA, 1972), chap. 3

  21. [21]

    J. R. Klauder, in A Garden of Quanta (World Scien- tific, 2003), pp. 55–76, URL https://doi.org/10.1142% 2F9789812795106_0005

  22. [22]

    Kleinert, Path Integrals in Quantum Mechanics, Statistics, Polymer Physics, and Financial Markets 9 (World Scientific, 2009), 5th ed

    H. Kleinert, Path Integrals in Quantum Mechanics, Statistics, Polymer Physics, and Financial Markets 9 (World Scientific, 2009), 5th ed

  23. [23]

    Makri, Comp

    N. Makri, Comp. Phys. Comm. 63, 389 (1991)

  24. [24]

    D. J. Kouri, W. Zhu, X. Ma, B. M. Pettitt, and D. K. Hoffman, J. Phys. Chem. 96, 9622 (1992)

  25. [25]

    B. J. Berne and D. Thirumalai, Annu. Rev. Phys. Chem. 37, 401 (1986)

  26. [26]

    Chandler and P

    D. Chandler and P. G. Wolynes, J. Chem. Phys.74, 4078 (1981)

  27. [27]

    M. E. Tuckerman, D. Marx, M. L. Klein, and M. Par- rinello, J. Chem. Phys. 104, 5579 (1996)

  28. [28]

    M. E. Tuckerman and D. Marx, Phys. Rev. Lett. 86, 4946 (2001)

  29. [29]

    Cao and G

    J. Cao and G. A. Voth, J. Chem. Phys.99, 10070 (1993)

  30. [30]

    Jang and G

    S. Jang and G. A. Voth, J. Chem. Phys. 111, 2357 (1999)

  31. [31]

    Jang and G

    S. Jang and G. A. Voth, J. Chem. Phys. 111, 2371 (1999)

  32. [32]

    Habershon, D

    S. Habershon, D. E. Manolopoulos, T. E. Markland, and T. F. Miller, Ann. Rev. Phys. Chem. 64, 387 (2013)

  33. [33]

    Monroe, W

    C. Monroe, W. C. Campbell, L.-M. Duan, Z.-X. Gong, A. V. Gorshkov, P. W. Hess, R. Islam, K. Kim, N. M. Linke, G. Pagano, et al., Rev. Mod. Phys. 93, 025001 (2021)

  34. [34]

    Labuhn, D

    H. Labuhn, D. Barredo, S. Ravets, S. De L´ es´ eleuc, T. Macr` ı, T. Lahaye, and A. Browaeys, Nature534, 667 (2016)

  35. [35]

    B. Yan, S. A. Moses, B. Gadway, J. P. Covey, K. R. Hazzard, A. M. Rey, D. S. Jin, and J. Ye, Nature 501, 521 (2013)

  36. [36]

    Bloch, J

    I. Bloch, J. Dalibard, and S. Nascimbene, Nature Physics 8, 267 (2012)

  37. [37]

    Barends, A

    R. Barends, A. Shabani, L. Lamata, J. Kelly, A. Mezza- capo, U. L. Heras, R. Babbush, A. G. Fowler, B. Camp- bell, Y. Chen, et al., Nature 534, 222 (2016)

  38. [38]

    R. G. Newton, Scattering Theory of Waves and Particles (Springer-Verlag, New York, 1982)

  39. [39]

    Bj¨ orklund, T

    A. Bj¨ orklund, T. Husfeldt, and M. Koivisto, SIAM J. Comput. 39, 546 (2009)

  40. [40]

    J. S. Yedidia, W. T. Freeman, and Y. Weiss,Understand- ing Belief Propagation and Its Generalizations (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2003), p. 239–269, ISBN 1558608117

  41. [41]

    Sajjan, V

    M. Sajjan, V. Singh, R. Selvarajan, and S. Kais, Imagi- nary components of out-of-time correlators and informa- tion scrambling for navigating the learning landscape of a quantum machine learning model (2022), URL https: //arxiv.org/abs/2208.13384

  42. [42]

    T. K. Dey and N. R. Shah, Comput. Geom. 8, 267 (1997)

  43. [43]

    M. A. Armstrong, Basic topology (Springer Science & Business Media, 2013)

  44. [44]

    T. C. Ricard and S. S. Iyengar, J. Chem. Theory Com- put. 16, 4790 (2020)