pith. sign in

arxiv: 2605.15899 · v1 · pith:AKZVM2KTnew · submitted 2026-05-15 · ❄️ cond-mat.dis-nn · quant-ph

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

Pith reviewed 2026-05-19 17:52 UTC · model grok-4.3

classification ❄️ cond-mat.dis-nn quant-ph
keywords spin glassesneural quantum statesdeep Boltzmann machinesIsing modelscombinatorial optimizationground state searchquantum many-body systemsscheduling problems
0
0 comments X

The pith

Deep Boltzmann Quantum States solve large classical and quantum spin glasses by matching exact or best-known ground states.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces Deep Boltzmann Quantum States as a variational neural approach to find ground states in spin glass models that are hard due to disorder and frustration. These states draw on deep Boltzmann machines to enable efficient block Gibbs sampling for global updates that help escape local minima. Training combines natural gradient methods with a schedule that gradually increases problem hardness from easy to hard regimes without needing to track the full adiabatic path at each step. A sympathetic reader would care because the approach solves instances with hundreds of spins in both classical and quantum Ising models and extends to NP-hard scheduling tasks at scales beyond current quantum annealing hardware.

Core claim

Deep Boltzmann Quantum States, inspired by deep Boltzmann machines, inherit efficient block Gibbs sampling. When trained using natural-gradient updates together with a hardness-interpolation schedule, these states match the exact solution or the best available estimate for several instances of classical and quantum Ising spin-glass models with infinite-range interactions and hundreds of spins. They also solve instances of the NP-hard Job Shop Scheduling Problem that exceed the current limitations of quantum annealing hardware.

What carries the argument

Deep Boltzmann Quantum States, a neural quantum state ansatz that supports efficient block Gibbs sampling, paired with a hardness-interpolation training schedule.

If this is right

  • Classical and quantum infinite-range Ising spin-glass models with hundreds of spins can be solved to exact or best-known accuracy.
  • NP-hard Job Shop Scheduling problems can be addressed at scales exceeding current quantum annealing hardware.
  • The approach supplies a framework for solving real-world hard combinatorial optimization tasks and for investigating disordered quantum many-body systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The method could extend to spin glasses with short-range or finite-dimensional interactions where exact benchmarks are unavailable.
  • Hybrid schemes that combine these states with quantum resources might handle still larger frustrated systems.
  • Annealing-style schedules in neural training may prove useful for other optimization problems that involve many competing minima.

Load-bearing premise

The neural network ansatz with block Gibbs sampling and gradual hardness tuning can reach the global ground state without becoming trapped in the many local minima of spin glasses.

What would settle it

A concrete counterexample would be an infinite-range Ising spin-glass instance with 100 or more spins where the computed energy lies above the known exact ground-state energy.

Figures

Figures reproduced from arXiv: 2605.15899 by Arka Dutta, Enrico Prati, Luca Leone, Markus Heyl, Pietro Torta.

Figure 1
Figure 1. Figure 1: Visualization of the NQA trajectory for a 4-site Ising toy model with all-to-all random couplings. Each subplot [PITH_FULL_IMAGE:figures/full_fig_p009_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Comparison of the typical residual energies on the [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Residual energy histograms for the SK model with [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: (a) Interaction matrix and longitudinal field vector of the Ising problem encoding a typical JSSP square instance. The represented instance has N = 5 and is displayed after variable pruning. (b) The Gantt diagram of the corresponding optimal schedule. Colors represent operations pertaining to a specific job, whereas the index of each operation indicate their ordering within that job. These results demonstr… view at source ↗
Figure 5
Figure 5. Figure 5: Convergence analysis for two representative [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Aggregated results for the N = 100 instances of the transverse-field SK model at g = 0.1. We plot a histogram of the final energies obtained via the HPO, subtracting the cor￾responding classical energy. For each instance, we rank the HPO runs by final energy, with rank 1 corresponding to the best result. Colors indicate the average rank of the trials in the corresponding bin. The best runs for each instanc… view at source ↗
Figure 7
Figure 7. Figure 7: Convergence analysis for a representative [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Comparison of the magnetization IAT between the [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Residual energy error histograms for each combination of variational ansatz and optimization strategy on the [PITH_FULL_IMAGE:figures/full_fig_p028_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Residual energies histogram for each combination of variational ansatz and optimization strategy for the 10 real [PITH_FULL_IMAGE:figures/full_fig_p028_10.png] view at source ↗
read the original abstract

Variational neural network models have achieved remarkable success in solving ground-state problems of quantum many-body systems. However, addressing classical and quantum spin glasses remains challenging, as disorder and energy frustration give rise to an exponentially large number of local energy minima separated by high-energy barriers, hindering the efficiency of conventional Metropolis-based Monte Carlo methods. To bridge this gap, we introduce Deep Boltzmann Quantum States, a class of neural quantum states inspired by deep Boltzmann machines that inherit efficient block Gibbs sampling. We also propose two key advances in the training algorithm. Firstly, we combine natural-gradient updates with state-of-the-art stochastic optimizers. Secondly, we gradually tune the hardness of the problem Hamiltonian by interpolating from an easy to a hard regime, without the need to closely approximate the instantaneous adiabatic state at intermediate times. We match the exact solution or the best available estimate for several instances of classical and quantum Ising spin-glass models with infinite-range interactions and hundreds of spins. We also solve instances of the NP-hard Job Shop Scheduling Problem exceeding the current limitations of quantum annealing hardware. To summarize, deep neural architectures with efficient global update rules and trained within an annealing-like scheme, provide a powerful framework for solving real-world hard combinatorial optimization and for investigating disordered quantum many-body systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Deep Boltzmann Quantum States (DBQS), a variational neural ansatz inspired by deep Boltzmann machines that supports efficient block Gibbs sampling. Combined with natural-gradient stochastic optimization and a hardness-interpolation schedule that ramps the problem Hamiltonian from an easy (paramagnetic) regime to the target infinite-range Ising spin-glass Hamiltonian, the method is reported to recover exact or best-known ground-state energies for classical and quantum instances with hundreds of spins. The authors further apply the framework to NP-hard Job Shop Scheduling problems that exceed the size limits of current quantum annealing hardware.

Significance. If the reported matches to exact or best-known solutions are robust, the work demonstrates that deep neural variational states with global update rules and an annealing-like training protocol can address the exponential number of local minima in spin glasses at scales relevant to both condensed-matter physics and combinatorial optimization. The explicit use of block Gibbs sampling and the avoidance of strict adiabatic tracking are technically interesting strengths that could extend the reach of neural quantum states beyond translationally invariant systems.

major comments (2)
  1. [§4.1] §4.1 and the hardness-interpolation procedure: the central claim that the schedule reliably reaches the global ground state without trapping in local minima for N≈200 infinite-range instances rests on the assumption that the DBQS manifold remains connected to the target minimum at intermediate hardness values. No quantitative diagnostic (e.g., overlap with the instantaneous ground state or barrier-height estimates) is provided to substantiate this for instances known to possess exponentially many metastable states; a single counter-example run that fails to match the exact energy would falsify the performance claim.
  2. [Table 2] Results section, Table 2 (quantum SK model, N=100): the reported energy matches the best-known estimate to 0.001, yet the manuscript supplies neither the number of independent optimization runs nor the standard deviation across runs. Without these statistics it is impossible to determine whether the match reflects systematic success or a fortunate initialization that happened to avoid the dominant local minima.
minor comments (2)
  1. [§2.2] The definition of the DBQS wavefunction in §2.2 would be clearer if the mapping from visible units to physical spins and the precise form of the block-Gibbs conditional probabilities were written explicitly rather than left to the supplementary material.
  2. [Figure 4] Figure 4 (convergence curves) lacks a horizontal reference line at the exact or best-known energy; adding this line would make the visual assessment of convergence to the global minimum immediate.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the constructive comments. We address each major comment below and indicate the revisions we intend to make.

read point-by-point responses
  1. Referee: [§4.1] §4.1 and the hardness-interpolation procedure: the central claim that the schedule reliably reaches the global ground state without trapping in local minima for N≈200 infinite-range instances rests on the assumption that the DBQS manifold remains connected to the target minimum at intermediate hardness values. No quantitative diagnostic (e.g., overlap with the instantaneous ground state or barrier-height estimates) is provided to substantiate this for instances known to possess exponentially many metastable states; a single counter-example run that fails to match the exact energy would falsify the performance claim.

    Authors: We appreciate the referee highlighting the need for stronger evidence supporting the hardness-interpolation schedule. The procedure is designed to gradually ramp the problem Hamiltonian while performing variational optimization at each stage, enabling the DBQS to adapt without strict adiabatic following. Our empirical results show consistent recovery of exact or best-known energies across multiple N≈200 instances, which we view as practical evidence that the variational manifold permits effective navigation of the landscape. Nevertheless, we agree that quantitative diagnostics would improve the manuscript. In the revision we will add overlap measurements between the optimized DBQS and the instantaneous ground state at selected intermediate hardness values for representative instances. We note that a single unsuccessful run would not necessarily falsify the overall performance claim, which is based on systematic success over an ensemble of instances rather than a guarantee for every possible realization. revision: yes

  2. Referee: [Table 2] Results section, Table 2 (quantum SK model, N=100): the reported energy matches the best-known estimate to 0.001, yet the manuscript supplies neither the number of independent optimization runs nor the standard deviation across runs. Without these statistics it is impossible to determine whether the match reflects systematic success or a fortunate initialization that happened to avoid the dominant local minima.

    Authors: We agree that the absence of run statistics makes it difficult to assess the robustness of the reported energies. In the revised manuscript we will explicitly state the number of independent optimization runs performed for the quantum SK instances shown in Table 2 and include the corresponding standard deviations (or ranges) of the final energies. revision: yes

Circularity Check

0 steps flagged

No circularity: computational method with independent empirical validation

full rationale

The paper introduces a variational neural ansatz (Deep Boltzmann Quantum States) trained via natural-gradient stochastic optimization and a hardness-interpolation schedule. Reported matches to exact or best-estimate ground states for infinite-range Ising instances are presented as numerical outcomes of this procedure, not as algebraic identities or self-referential fits. No equations reduce a claimed prediction to a quantity defined in terms of the target result itself, and no load-bearing uniqueness theorem is imported via self-citation. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete. No explicit free parameters, axioms, or invented entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5759 in / 1251 out tokens · 43211 ms · 2026-05-19T17:52:00.568346+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

145 extracted references · 145 canonical work pages · 7 internal anchors

  1. [1]

    A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq

    Natural gradients and advanced optimizers In Section II A, we introduced the real and imaginary- time TDVP and defined the Fisher information matrix. A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq. (8) via natural gradient descent, a seminal al- gorithm known as Stochastic Recon...

  2. [2]

    The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algorithm 2

    Variational Quantum Annealing pseudo-code As explained in Section II C, an alternative approach to ground-state optimization with NQS relies on inter- polating from aneasyto ahardregime by tuning a pa- rameter in the Hamiltonian. The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algor...

  3. [3]

    [123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function

    pmRBM and cRBM A conceptually straightforward extension proposed in Ref. [123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function. The wavefunction is expressed in the spin eigenbasis, which is commonly associated with the eigen- states of the Pauli-Z operators: ψ(x;θ) = p pm RBM(x;θ m) exp{i pp RBM(x;θ p...

  4. [4]

    DBM wavefunctions A possible quantum extension of DBMs based on the same principle of the cRBM was introduced in Ref. [32] and reads ψ(x;θ)= X x(l) l>0 e PNL −1 l=0 x(l)T W (l)x(l+1)+PNL l=0 b(l)T x(l) .(C3) The original reference also provides an interesting con- structive approach that does not require a variational optimization of the DBM parameters. H...

  5. [5]

    Quantum Boltzmann Machines Another possible extension, dubbed Quantum Boltz- mann Machine (QBM), was proposed in Ref. [122]. The units of the BM are promoted to quantum spins, and the state of the system is described as the exponential of a transverse-field Ising Hamiltonian HQBM =− X a Γaσx a − X a baσz a − X ab wabσz aσz b (C4) yielding the density matr...

  6. [6]

    Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement structure

    Ground-state structure of the extended Hilbert space and expectation values The BQS formalism describes both visible and hid- den spins as quantum spins in an extended Hilbert space Hvh =H v ⊗ Hh. Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement st...

  7. [7]

    X h′ ⟨χh|h′⟩ ⟨v,h ′|ψvh⟩ # |v⟩ ⊗ X h ⟨h|χh⟩ |h⟩ = X v

    Relation to the cRBM architecture and universality of DBQS Here, we show that applying a projection operator to the hidden partition collapses the visible spins into a pure state. Then, we demonstrate that specific projections reduce the RBQS and DBQS ansatzes to the cRBM and DBM wavefunctions of Refs. [25, 32], establishing that the DBQS framework genera...

  8. [8]

    Dataset–free initialization of the DBQS parameters Typical uses of DBMs in machine learning rely on greedy layer-wise pretraining [27]. In the NQA setting such pretraining yields no benefit: the first few opti- mization steps rapidly overwrite any pre-trained weights, effectively undoing the layer-wise initialization long be- fore the algorithm approaches...

  9. [9]

    This autocorrelation reduces the efficiency of the sampling procedure by decreasing the number of effectively inde- pendent samples obtained from the chain

    Comparison of block Gibbs chains and Metropolis-Hastings chains In Markov Chain Monte Carlo (MCMC) sampling, suc- cessive samples are typically correlated because each new sample is generated from the previous one, rather than drawn independently from the target distribution. This autocorrelation reduces the efficiency of the sampling procedure by decreas...

  10. [10]

    The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization

    Algorithmic overview As discussed in Section IV, NQA combines standard SR with momentum acceleration and persistent Gibbs chains enabled by our novel DBQS architecture. The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization. It comprises three actions: 1.Sampler update: Advance the persistent chains with a few bloc...

  11. [11]

    NQA Hyperparameters In the following, we enumerate the tunable hyperpa- rameters of NQA, organizing them into five groups based on their role in the algorithm. a. Annealing Hyperparameters These parameters characterize the annealing schedule and the parametric HamiltonianH(s) in Eq. (32). •Number of annealing stepsN A: it fixes the num- ber of discretized...

  12. [12]

    Computational complexity The computational complexity of NQA is linear in the number of calls of theNQA UpdateStepfunction, which amounts toN W +N F +N ANU. The first two terms in the sum are usually negligible, as is the computational cost of the rest of the code; hence, the overall compu- tational cost is essentially proportional toN ANU times the compu...

  13. [13]

    Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO

    HPO for SK benchmarks The HPO for the SK benchmark instances with cRBM and RBQS ansatzes are performed by only optimizing the learning rate and momentum, as shown in Table I, whereas all other hyperparameters are fixed as reported in Table II. Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO. The...

  14. [14]

    New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI

    HPO for JSSP benchmarks The ranges and settings for the HPO studies on the JSSP benchmark instances are reported in Table V and Table VI. New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI. Fixed settings for the JSSP benchmark. Setting Value Number of hidden layers 2 Number of warm-up update steps...

  15. [15]

    The first 100 trials are sampled via CMA-ES, and the last 50 via TPE

    HPO for transverse-field SK benchmarks The ranges and settings for the HPO studies on the transverse-field SK benchmark instances withN= 16 are reported in Table VII and Table VIII, and those for theN= 100 benchmarks are reported in Table IX and Table X. The first 100 trials are sampled via CMA-ES, and the last 50 via TPE. Here, we implement a 1-hour wall...

  16. [16]

    Table XIII shows the re- sults for our benchmark instances withN= 100 spins

    Additional results on the benchmark The energies obtained for theN= 16 benchmark are reported in Table XII. Table XIII shows the re- sults for our benchmark instances withN= 100 spins. Additional figures showing the data for all the ex- amined instances are available at [117] or [51] in the Table VIII. Fixed settings for the N = 16 transverse-field SK ben...

  17. [17]

    Sherrington and S

    D. Sherrington and S. Kirkpatrick, Solvable model of a spin-glass, Phys. Rev. Lett.35, 1792 (1975)

  18. [18]

    Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

    F. Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

  19. [19]

    Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

    A. Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

  20. [20]

    Ben Arous and A

    G. Ben Arous and A. Jagannath, Spectral Gap Esti- mates in Mean Field Spin Glasses, Communications in Mathematical Physics361, 1–52 (2018)

  21. [21]

    A. P. Young, ed.,Spin Glasses and Random Fields (World Scientific, Singapore, 1998)

  22. [22]

    M´ ezard, G

    M. M´ ezard, G. Parisi, and M. A. Virasoro,Spin Glass Theory and Beyond: An Introduction to the Replica Method and Its Applications(World Scientific, Singa- pore, 1987)

  23. [23]

    Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p

    S. Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p. 28 0 2 4 6 8 10Counts SR RBQS cRBM NQA RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB 0 2 4 6 8 10Counts RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB RBQS cRBM Figure 9. Residual energy error histograms for each combination of variational ansatz and optimization strategy on...

  24. [24]

    Rieger and A

    H. Rieger and A. P. Young, Griffiths singularities in the disordered phase of a quantum ising spin glass, Phys. Rev. B54, 3328 (1996)

  25. [25]

    L. F. Cugliandolo and M. Mueller, Quantum glasses – a review (2022), arXiv:2208.05417 [cond-mat.dis-nn]

  26. [26]

    Schultzen, T

    P. Schultzen, T. Franz, S. Geier, A. Salzinger, A. Tebben, C. Hainaut, G. Z¨ urn, M. Weidem¨ uller, and M. G¨ arttner, Glassy quantum dynamics of disordered ising spins, Phys. Rev. B105, L020201 (2022)

  27. [27]

    The complexity of quantum spin systems on a two-dimensional square lattice

    R. Oliveira and B. M. Terhal, The complexity of quan- tum spin systems on a two-dimensional square lattice (2008), arXiv:quant-ph/0504050 [quant-ph]

  28. [28]

    Padberg and G

    M. Padberg and G. Rinaldi, A branch-and-cut algo- rithm for the resolution of large-scale symmetric trav- eling salesman problems, SIAM Review33, 60 (1991), 29 Table XI. Typical energy errors on theN= 200 SK bench- mark. Method [εB]typ [εQ]typ cRBM-SR 1.353×10 −1 1.360×10 −1 cRBM-SR-cata 9.661×10 −2 9.685×10 −2 cRBM-NQA 1.653×10 −1 1.687×10 −1 cRBM-NQA-ca...

  29. [29]

    R. H. Swendsen and J.-S. Wang, Nonuniversal critical dynamics in monte carlo simulations, Phys. Rev. Lett. 58, 86 (1987)

  30. [30]

    Wolff, Collective monte carlo updating for spin sys- tems, Phys

    U. Wolff, Collective monte carlo updating for spin sys- tems, Phys. Rev. Lett.62, 361 (1989)

  31. [31]

    Marinari and G

    E. Marinari and G. Parisi, Simulated tempering: A new monte carlo scheme, Europhysics Letters19, 451 (1992)

  32. [32]

    Hukushima, K

    K. Hukushima and K. Nemoto, Exchange monte carlo method and applications to spin glass simulations, Jour- nal of the Physical Society of Japan65, 1604 (1996), https://doi.org/10.1143/JPSJ.65.1604

  33. [33]

    Bernaschi, I

    M. Bernaschi, I. Gonz´ alez-Adalid Pemart´ ın, V. Mart´ ın- Mayor, and G. Parisi, The quantum transition of the two-dimensional ising spin glass, Nature631, 749–754 (2024)

  34. [34]

    J.-G. Liu, L. Wang, and P. Zhang, Tropical tensor net- work for ground states of spin glasses, Phys. Rev. Lett. 126, 090506 (2021)

  35. [35]

    Ishii and T

    H. Ishii and T. Yamamoto, Monte carlo study of the sherrington-kirkpatrick spin glass model in a transverse field, inQuantum Monte Carlo Methods in Equilib- rium and Nonequilibrium Systems, edited by M. Suzuki (Springer Berlin Heidelberg, Berlin, Heidelberg, 1987) pp. 176–185

  36. [36]

    L. L. Viteritti, R. Rende, G. B. Testasecca, J. Niedda, R. Moessner, G. Carleo, and A. Scardicchio, Quantum spin glass in the two-dimensional disordered heisenberg Table XIII. Comparison between ED and NQA for theN= 16 benchmark. Instance ED NQA 0 -16.8595 -16.8589±0.0002 1 -17.0621 -17.0600±0.0002 2 -19.8560 -19.8557±0.0001 3 -18.45559 -18.4555±0.0001 4 ...

  37. [37]

    Neural Combinatorial Optimization with Reinforcement Learning

    I. Bello, H. Pham, Q. V. Le, M. Norouzi, and S. Bengio, Neural combinatorial optimization with reinforcement learning (2017), arXiv:1611.09940 [cs.AI]

  38. [38]

    Khalil, H

    E. Khalil, H. Dai, Y. Zhang, B. Dilkina, and L. Song, Learning combinatorial optimization algorithms over graphs, inAdvances in Neural Information Processing Systems, Vol. 30, edited by I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (2017)

  39. [39]

    Gabor, S

    T. Gabor, S. Feld, H. Safi, T. Phan, and C. Linnhoff- Popien, Insights on training neural networks for qubo tasks, inProceedings of the IEEE/ACM 42nd Interna- tional Conference on Software Engineering Workshops (Association for Computing Machinery, New York, NY, USA, 2020) pp. 436–441

  40. [40]

    He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S

    H. He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S. Ghosh and Z. Zhang (Springer Nature Singapore, Singapore, 2024) pp. 134–145

  41. [41]

    Carleo and M

    G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017)

  42. [42]

    J. J. Hopfield, Neural networks and physical systems with emergent collective computational abilities, Pro- ceedings of the National Academy of Sciences79, 2554 (1982)

  43. [43]

    Goodfellow, Y

    I. Goodfellow, Y. Bengio, A. Courville, and Y. Bengio, Deep learning(MIT Press, Cambridge, 2016)

  44. [44]

    Sorella, Green function monte carlo with stochastic reconfiguration, Phys

    S. Sorella, Green function monte carlo with stochastic reconfiguration, Phys. Rev. Lett.80, 4558 (1998)

  45. [45]

    Chen and M

    A. Chen and M. Heyl, Empowering deep neural quantum states through efficient optimization, Nature Physics20, 1476 (2024)

  46. [46]

    D.-L. Deng, X. Li, and S. Das Sarma, Quantum en- tanglement in neural network states, Phys. Rev. X7, 021021 (2017)

  47. [47]

    C. Roth, A. Szab´ o, and A. H. MacDonald, High- accuracy variational monte carlo for frustrated magnets with deep neural networks, Phys. Rev. B108, 054410 (2023)

  48. [48]

    Carleo, Y

    G. Carleo, Y. Nomura, and M. Imada, Constructing exact representations of quantum many-body systems with deep neural networks, Nature Communications9, 30 5322 (2018)

  49. [49]

    Schmitt and M

    M. Schmitt and M. Heyl, Quantum many-body dynam- ics in two dimensions with artificial neural networks, Phys. Rev. Lett.125, 100503 (2020)

  50. [50]

    Kadowaki and H

    T. Kadowaki and H. Nishimori, Quantum annealing in the transverse Ising model, Phys. Rev. E58, 5355 (1998)

  51. [51]

    G. E. Santoro, R. Martoˇ n´ ak, E. Tosatti, and R. Car, Theory of quantum annealing of an ising spin glass, Sci- ence295, 2427 (2002)

  52. [52]

    Zanca and G

    T. Zanca and G. E. Santoro, Quantum annealing speedup over simulated annealing on random ising chains, Phys. Rev. B93, 224431 (2016)

  53. [53]

    Albash and D

    T. Albash and D. A. Lidar, Adiabatic quantum compu- tation, Rev. Mod. Phys.90, 015002 (2018)

  54. [54]

    M. W. Johnson, M. H. S. Amin, S. Gildert, T. Lant- ing, F. Hamze, N. Dickson, R. Harris, A. J. Berkley, J. Johansson, P. Bunyk, E. M. Chapple, C. Enderud, J. P. Hilton, K. Karimi, E. Ladizinsky, N. Ladizinsky, T. Oh, I. Perminov, C. Rich, M. C. Thom, E. Tolka- cheva, C. J. S. Truncik, S. Uchaikin, J. Wang, B. Wil- son, and G. Rose, Quantum annealing with ...

  55. [55]

    Yarkoni, E

    S. Yarkoni, E. Raponi, T. B¨ ack, and S. Schmitt, Quan- tum annealing for industry applications: introduction and review, Reports on Progress in Physics85, 104001 (2022)

  56. [56]

    J. Cai, W. G. Macready, and A. Roy, A practical heuris- tic for finding graph minors (2014), arXiv:1406.2741 [quant-ph]

  57. [57]

    Marton´ ak, G

    R. Marton´ ak, G. E. Santoro, and E. Tosatti, Quantum annealing by the path-integral monte carlo method: The two-dimensional random ising model, Physical Review B66, 094203 (2002)

  58. [58]

    B. Heim, T. F. Rønnow, S. V. Isakov, and M. Troyer, Quantum versus classical annealing of ising spin glasses, Science348, 215 (2015)

  59. [59]

    Crosson and A

    E. Crosson and A. W. Harrow, Simulated quantum an- nealing can be exponentially faster than classical simu- lated annealing, Physical Review A93, 042307 (2016)

  60. [60]

    Baldassi and R

    C. Baldassi and R. Zecchina, Efficiency of quantum vs. classical annealing in nonconvex learning problems, Pro- ceedings of the National Academy of Sciences115, 1457 (2018)

  61. [61]

    Hibat-Allah, E

    M. Hibat-Allah, E. M. Inack, R. Wiersema, R. G. Melko, and J. Carrasquilla, Variational neural annealing, Na- ture Machine Intelligence3, 952 (2021)

  62. [62]

    P. M. Long and R. A. Servedio, Restricted boltzmann machines are hard to approximately evaluate or simu- late, inProceedings of the 27th International Confer- ence on Machine Learning (ICML), ICML’10 (Omni- press, Madison, WI, USA, 2010) pp. 703–710

  63. [63]

    Jastrow, Many-body problem with strong forces, Phys

    R. Jastrow, Many-body problem with strong forces, Phys. Rev.98, 1479 (1955)

  64. [64]

    Becca and S

    F. Becca and S. Sorella,Quantum Monte Carlo Ap- proaches for Correlated Systems(Cambridge University Press, 2017)

  65. [65]

    Akiba, S

    T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama, Optuna: A next-generation hyperparameter optimiza- tion framework, inProceedings of the 25th ACM SIGKDD International Conference on Knowledge Dis- covery and Data Mining(2019)

  66. [66]

    Bradbury, R

    J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. Van- derPlas, S. Wanderman-Milne, and Q. Zhang, JAX: composable transformations of Python+NumPy pro- grams (2018). [51]https://github.com/lucaleonect/nqa_server

  67. [67]

    A. Chen, V. D. Naik, and M. Heyl, Convolutional trans- former wave functions (2025), arXiv:2503.10462 [cond- mat.dis-nn]

  68. [68]

    Schmitt, M

    M. Schmitt, M. M. Rams, J. Dziarmaga, M. Heyl, and W. H. Zurek, Quantum phase transition dynamics in the two-dimensional transverse-field ising model, Sci- ence Advances8, eabl6850 (2022)

  69. [69]

    Mendes-Santos, M

    T. Mendes-Santos, M. Schmitt, A. Angelone, A. Ro- driguez, P. Scholl, H. J. Williams, D. Barredo, T. La- haye, A. Browaeys, M. Heyl, and M. Dalmonte, Wave- function network description and kolmogorov complex- ity of quantum many-body systems, Phys. Rev. X14, 021029 (2024)

  70. [70]

    Lange, A

    H. Lange, A. Van de Walle, A. Abedinnia, and A. Bohrdt, From architectures to applications: A re- view of neural quantum states (2024), arXiv:2402.09402 [cond-mat.dis-nn]

  71. [71]

    Schmitt and M

    M. Schmitt and M. Heyl, Simulating dynamics of cor- related matter with neural quantum states (2025), arXiv:2506.03124 [quant-ph]

  72. [72]

    (n.a.), we denote the Pauli-Z eigenvalues withx i =±1 for notational consistency with the following sections

  73. [73]

    Rattray, D

    M. Rattray, D. Saad, and S.-i. Amari, Natural gradient descent for on-line learning, Phys. Rev. Lett.81, 5461 (1998)

  74. [74]

    Revisiting Natural Gradient for Deep Networks

    R. Pascanu and Y. Bengio, Revisiting natural gradient for deep networks (2014), arXiv:1301.3584 [cs.LG]

  75. [75]

    Ben-Israel and T

    A. Ben-Israel and T. N. E. Greville,Generalized in- verses: theory and applications(Springer Science & Business Media, 2006)

  76. [76]

    B. T. Polyak, Some methods of speeding up the conver- gence of iteration methods, USSR Computational Math- ematics and Mathematical Physics4, 1 (1964)

  77. [77]

    D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature323, 533 (1986)

  78. [78]

    Sutskever, J

    I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, inProceedings of the 30th International Conference on Machine Learning (ICML), ICML’13 (JMLR.org, 2013) pp. 1139–1147

  79. [79]

    Das and B

    A. Das and B. K. Chakrabarti, Colloquium: Quantum annealing and analog quantum computation, Rev. Mod. Phys.80, 1061 (2008)

  80. [80]

    A. B. Finnila, M. A. Gomez, C. Sebenik, C. Stenson, and J. D. Doll, Quantum annealing: A new method for minimizing multidimensional functions, Chemical Physics Letters219, 343 (1994)

Showing first 80 references.