Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

Arka Dutta; Enrico Prati; Luca Leone; Markus Heyl; Pietro Torta

arxiv: 2605.15899 · v1 · pith:AKZVM2KTnew · submitted 2026-05-15 · ❄️ cond-mat.dis-nn · quant-ph

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

Luca Leone , Arka Dutta , Markus Heyl , Enrico Prati , Pietro Torta This is my paper

Pith reviewed 2026-05-19 17:52 UTC · model grok-4.3

classification ❄️ cond-mat.dis-nn quant-ph

keywords spin glassesneural quantum statesdeep Boltzmann machinesIsing modelscombinatorial optimizationground state searchquantum many-body systemsscheduling problems

0 comments

The pith

Deep Boltzmann Quantum States solve large classical and quantum spin glasses by matching exact or best-known ground states.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces Deep Boltzmann Quantum States as a variational neural approach to find ground states in spin glass models that are hard due to disorder and frustration. These states draw on deep Boltzmann machines to enable efficient block Gibbs sampling for global updates that help escape local minima. Training combines natural gradient methods with a schedule that gradually increases problem hardness from easy to hard regimes without needing to track the full adiabatic path at each step. A sympathetic reader would care because the approach solves instances with hundreds of spins in both classical and quantum Ising models and extends to NP-hard scheduling tasks at scales beyond current quantum annealing hardware.

Core claim

Deep Boltzmann Quantum States, inspired by deep Boltzmann machines, inherit efficient block Gibbs sampling. When trained using natural-gradient updates together with a hardness-interpolation schedule, these states match the exact solution or the best available estimate for several instances of classical and quantum Ising spin-glass models with infinite-range interactions and hundreds of spins. They also solve instances of the NP-hard Job Shop Scheduling Problem that exceed the current limitations of quantum annealing hardware.

What carries the argument

Deep Boltzmann Quantum States, a neural quantum state ansatz that supports efficient block Gibbs sampling, paired with a hardness-interpolation training schedule.

If this is right

Classical and quantum infinite-range Ising spin-glass models with hundreds of spins can be solved to exact or best-known accuracy.
NP-hard Job Shop Scheduling problems can be addressed at scales exceeding current quantum annealing hardware.
The approach supplies a framework for solving real-world hard combinatorial optimization tasks and for investigating disordered quantum many-body systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could extend to spin glasses with short-range or finite-dimensional interactions where exact benchmarks are unavailable.
Hybrid schemes that combine these states with quantum resources might handle still larger frustrated systems.
Annealing-style schedules in neural training may prove useful for other optimization problems that involve many competing minima.

Load-bearing premise

The neural network ansatz with block Gibbs sampling and gradual hardness tuning can reach the global ground state without becoming trapped in the many local minima of spin glasses.

What would settle it

A concrete counterexample would be an infinite-range Ising spin-glass instance with 100 or more spins where the computed energy lies above the known exact ground-state energy.

Figures

Figures reproduced from arXiv: 2605.15899 by Arka Dutta, Enrico Prati, Luca Leone, Markus Heyl, Pietro Torta.

**Figure 1.** Figure 1: Visualization of the NQA trajectory for a 4-site Ising toy model with all-to-all random couplings. Each subplot [PITH_FULL_IMAGE:figures/full_fig_p009_1.png] view at source ↗

**Figure 2.** Figure 2: Comparison of the typical residual energies on the [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Residual energy histograms for the SK model with [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗

**Figure 4.** Figure 4: (a) Interaction matrix and longitudinal field vector of the Ising problem encoding a typical JSSP square instance. The represented instance has N = 5 and is displayed after variable pruning. (b) The Gantt diagram of the corresponding optimal schedule. Colors represent operations pertaining to a specific job, whereas the index of each operation indicate their ordering within that job. These results demonstr… view at source ↗

**Figure 5.** Figure 5: Convergence analysis for two representative [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗

**Figure 6.** Figure 6: Aggregated results for the N = 100 instances of the transverse-field SK model at g = 0.1. We plot a histogram of the final energies obtained via the HPO, subtracting the corresponding classical energy. For each instance, we rank the HPO runs by final energy, with rank 1 corresponding to the best result. Colors indicate the average rank of the trials in the corresponding bin. The best runs for each instanc… view at source ↗

**Figure 7.** Figure 7: Convergence analysis for a representative [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗

**Figure 8.** Figure 8: Comparison of the magnetization IAT between the [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗

**Figure 9.** Figure 9: Residual energy error histograms for each combination of variational ansatz and optimization strategy on the [PITH_FULL_IMAGE:figures/full_fig_p028_9.png] view at source ↗

**Figure 10.** Figure 10: Residual energies histogram for each combination of variational ansatz and optimization strategy for the 10 real [PITH_FULL_IMAGE:figures/full_fig_p028_10.png] view at source ↗

read the original abstract

Variational neural network models have achieved remarkable success in solving ground-state problems of quantum many-body systems. However, addressing classical and quantum spin glasses remains challenging, as disorder and energy frustration give rise to an exponentially large number of local energy minima separated by high-energy barriers, hindering the efficiency of conventional Metropolis-based Monte Carlo methods. To bridge this gap, we introduce Deep Boltzmann Quantum States, a class of neural quantum states inspired by deep Boltzmann machines that inherit efficient block Gibbs sampling. We also propose two key advances in the training algorithm. Firstly, we combine natural-gradient updates with state-of-the-art stochastic optimizers. Secondly, we gradually tune the hardness of the problem Hamiltonian by interpolating from an easy to a hard regime, without the need to closely approximate the instantaneous adiabatic state at intermediate times. We match the exact solution or the best available estimate for several instances of classical and quantum Ising spin-glass models with infinite-range interactions and hundreds of spins. We also solve instances of the NP-hard Job Shop Scheduling Problem exceeding the current limitations of quantum annealing hardware. To summarize, deep neural architectures with efficient global update rules and trained within an annealing-like scheme, provide a powerful framework for solving real-world hard combinatorial optimization and for investigating disordered quantum many-body systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper's Deep Boltzmann Quantum States with hardness interpolation and natural gradient training claim to solve large spin glass instances and some scheduling problems where other methods struggle.

read the letter

The main takeaway is that this paper shows a deep neural variational ansatz can find ground states for classical and quantum spin glasses with hundreds of spins, plus some scheduling problems, using block Gibbs sampling and a hardness ramp. The new part is the Deep Boltzmann Quantum State, which takes the structure from deep Boltzmann machines to enable efficient global sampling updates. They add natural gradient descent with modern stochastic optimizers and introduce this hardness interpolation that starts from an easy regime and tunes up to the target Hamiltonian without requiring the state to stay close to the instantaneous ground state during the process. This combination appears to handle the frustration and disorder better than standard approaches. Reporting exact matches or best estimates on multiple instances is a good sign that the method scales practically where quantum hardware falls short. A soft spot could be the assumption that the optimization doesn't get stuck despite the many local minima. The interpolation helps smooth the path, but for infinite-range models with hundreds of spins, one would want to see evidence that the natural gradient steps consistently reach the global minimum across runs, not just selected successes. Minor details like the exact form of the interpolation and hyperparameter choices would also clarify reproducibility. The work is aimed at physicists and computer scientists interested in variational methods for hard optimization and many-body problems. Anyone studying neural quantum states or spin glass solvers would pick up useful techniques here. It should go to peer review. The approach is distinct enough from prior neural state methods to merit checking the implementation and results in detail.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Deep Boltzmann Quantum States (DBQS), a variational neural ansatz inspired by deep Boltzmann machines that supports efficient block Gibbs sampling. Combined with natural-gradient stochastic optimization and a hardness-interpolation schedule that ramps the problem Hamiltonian from an easy (paramagnetic) regime to the target infinite-range Ising spin-glass Hamiltonian, the method is reported to recover exact or best-known ground-state energies for classical and quantum instances with hundreds of spins. The authors further apply the framework to NP-hard Job Shop Scheduling problems that exceed the size limits of current quantum annealing hardware.

Significance. If the reported matches to exact or best-known solutions are robust, the work demonstrates that deep neural variational states with global update rules and an annealing-like training protocol can address the exponential number of local minima in spin glasses at scales relevant to both condensed-matter physics and combinatorial optimization. The explicit use of block Gibbs sampling and the avoidance of strict adiabatic tracking are technically interesting strengths that could extend the reach of neural quantum states beyond translationally invariant systems.

major comments (2)

[§4.1] §4.1 and the hardness-interpolation procedure: the central claim that the schedule reliably reaches the global ground state without trapping in local minima for N≈200 infinite-range instances rests on the assumption that the DBQS manifold remains connected to the target minimum at intermediate hardness values. No quantitative diagnostic (e.g., overlap with the instantaneous ground state or barrier-height estimates) is provided to substantiate this for instances known to possess exponentially many metastable states; a single counter-example run that fails to match the exact energy would falsify the performance claim.
[Table 2] Results section, Table 2 (quantum SK model, N=100): the reported energy matches the best-known estimate to 0.001, yet the manuscript supplies neither the number of independent optimization runs nor the standard deviation across runs. Without these statistics it is impossible to determine whether the match reflects systematic success or a fortunate initialization that happened to avoid the dominant local minima.

minor comments (2)

[§2.2] The definition of the DBQS wavefunction in §2.2 would be clearer if the mapping from visible units to physical spins and the precise form of the block-Gibbs conditional probabilities were written explicitly rather than left to the supplementary material.
[Figure 4] Figure 4 (convergence curves) lacks a horizontal reference line at the exact or best-known energy; adding this line would make the visual assessment of convergence to the global minimum immediate.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the constructive comments. We address each major comment below and indicate the revisions we intend to make.

read point-by-point responses

Referee: [§4.1] §4.1 and the hardness-interpolation procedure: the central claim that the schedule reliably reaches the global ground state without trapping in local minima for N≈200 infinite-range instances rests on the assumption that the DBQS manifold remains connected to the target minimum at intermediate hardness values. No quantitative diagnostic (e.g., overlap with the instantaneous ground state or barrier-height estimates) is provided to substantiate this for instances known to possess exponentially many metastable states; a single counter-example run that fails to match the exact energy would falsify the performance claim.

Authors: We appreciate the referee highlighting the need for stronger evidence supporting the hardness-interpolation schedule. The procedure is designed to gradually ramp the problem Hamiltonian while performing variational optimization at each stage, enabling the DBQS to adapt without strict adiabatic following. Our empirical results show consistent recovery of exact or best-known energies across multiple N≈200 instances, which we view as practical evidence that the variational manifold permits effective navigation of the landscape. Nevertheless, we agree that quantitative diagnostics would improve the manuscript. In the revision we will add overlap measurements between the optimized DBQS and the instantaneous ground state at selected intermediate hardness values for representative instances. We note that a single unsuccessful run would not necessarily falsify the overall performance claim, which is based on systematic success over an ensemble of instances rather than a guarantee for every possible realization. revision: yes
Referee: [Table 2] Results section, Table 2 (quantum SK model, N=100): the reported energy matches the best-known estimate to 0.001, yet the manuscript supplies neither the number of independent optimization runs nor the standard deviation across runs. Without these statistics it is impossible to determine whether the match reflects systematic success or a fortunate initialization that happened to avoid the dominant local minima.

Authors: We agree that the absence of run statistics makes it difficult to assess the robustness of the reported energies. In the revised manuscript we will explicitly state the number of independent optimization runs performed for the quantum SK instances shown in Table 2 and include the corresponding standard deviations (or ranges) of the final energies. revision: yes

Circularity Check

0 steps flagged

No circularity: computational method with independent empirical validation

full rationale

The paper introduces a variational neural ansatz (Deep Boltzmann Quantum States) trained via natural-gradient stochastic optimization and a hardness-interpolation schedule. Reported matches to exact or best-estimate ground states for infinite-range Ising instances are presented as numerical outcomes of this procedure, not as algebraic identities or self-referential fits. No equations reduce a claimed prediction to a quantity defined in terms of the target result itself, and no load-bearing uniqueness theorem is imported via self-citation. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete. No explicit free parameters, axioms, or invented entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5759 in / 1251 out tokens · 43211 ms · 2026-05-19T17:52:00.568346+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/ArrowOfTime.lean arrow_from_z unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

we gradually tune the hardness of the problem Hamiltonian by interpolating from an easy to a hard regime, without the need to closely approximate the instantaneous adiabatic state at intermediate times
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Deep Boltzmann Quantum States... inherit efficient block Gibbs sampling

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

145 extracted references · 145 canonical work pages · 7 internal anchors

[1]

A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq

Natural gradients and advanced optimizers In Section II A, we introduced the real and imaginary- time TDVP and defined the Fisher information matrix. A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq. (8) via natural gradient descent, a seminal al- gorithm known as Stochastic Recon...

work page
[2]

The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algorithm 2

Variational Quantum Annealing pseudo-code As explained in Section II C, an alternative approach to ground-state optimization with NQS relies on inter- polating from aneasyto ahardregime by tuning a pa- rameter in the Hamiltonian. The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algor...

work page
[3]

[123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function

pmRBM and cRBM A conceptually straightforward extension proposed in Ref. [123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function. The wavefunction is expressed in the spin eigenbasis, which is commonly associated with the eigen- states of the Pauli-Z operators: ψ(x;θ) = p pm RBM(x;θ m) exp{i pp RBM(x;θ p...

work page
[4]

DBM wavefunctions A possible quantum extension of DBMs based on the same principle of the cRBM was introduced in Ref. [32] and reads ψ(x;θ)= X x(l) l>0 e PNL −1 l=0 x(l)T W (l)x(l+1)+PNL l=0 b(l)T x(l) .(C3) The original reference also provides an interesting con- structive approach that does not require a variational optimization of the DBM parameters. H...

work page
[5]

Quantum Boltzmann Machines Another possible extension, dubbed Quantum Boltz- mann Machine (QBM), was proposed in Ref. [122]. The units of the BM are promoted to quantum spins, and the state of the system is described as the exponential of a transverse-field Ising Hamiltonian HQBM =− X a Γaσx a − X a baσz a − X ab wabσz aσz b (C4) yielding the density matr...

work page
[6]

Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement structure

Ground-state structure of the extended Hilbert space and expectation values The BQS formalism describes both visible and hid- den spins as quantum spins in an extended Hilbert space Hvh =H v ⊗ Hh. Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement st...

work page
[7]

X h′ ⟨χh|h′⟩ ⟨v,h ′|ψvh⟩ # |v⟩ ⊗ X h ⟨h|χh⟩ |h⟩ = X v

Relation to the cRBM architecture and universality of DBQS Here, we show that applying a projection operator to the hidden partition collapses the visible spins into a pure state. Then, we demonstrate that specific projections reduce the RBQS and DBQS ansatzes to the cRBM and DBM wavefunctions of Refs. [25, 32], establishing that the DBQS framework genera...

work page
[8]

Dataset–free initialization of the DBQS parameters Typical uses of DBMs in machine learning rely on greedy layer-wise pretraining [27]. In the NQA setting such pretraining yields no benefit: the first few opti- mization steps rapidly overwrite any pre-trained weights, effectively undoing the layer-wise initialization long be- fore the algorithm approaches...

work page
[9]

This autocorrelation reduces the efficiency of the sampling procedure by decreasing the number of effectively inde- pendent samples obtained from the chain

Comparison of block Gibbs chains and Metropolis-Hastings chains In Markov Chain Monte Carlo (MCMC) sampling, suc- cessive samples are typically correlated because each new sample is generated from the previous one, rather than drawn independently from the target distribution. This autocorrelation reduces the efficiency of the sampling procedure by decreas...

work page
[10]

The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization

Algorithmic overview As discussed in Section IV, NQA combines standard SR with momentum acceleration and persistent Gibbs chains enabled by our novel DBQS architecture. The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization. It comprises three actions: 1.Sampler update: Advance the persistent chains with a few bloc...

work page
[11]

NQA Hyperparameters In the following, we enumerate the tunable hyperpa- rameters of NQA, organizing them into five groups based on their role in the algorithm. a. Annealing Hyperparameters These parameters characterize the annealing schedule and the parametric HamiltonianH(s) in Eq. (32). •Number of annealing stepsN A: it fixes the num- ber of discretized...

work page
[12]

Computational complexity The computational complexity of NQA is linear in the number of calls of theNQA UpdateStepfunction, which amounts toN W +N F +N ANU. The first two terms in the sum are usually negligible, as is the computational cost of the rest of the code; hence, the overall compu- tational cost is essentially proportional toN ANU times the compu...

work page
[13]

Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO

HPO for SK benchmarks The HPO for the SK benchmark instances with cRBM and RBQS ansatzes are performed by only optimizing the learning rate and momentum, as shown in Table I, whereas all other hyperparameters are fixed as reported in Table II. Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO. The...

work page
[14]

New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI

HPO for JSSP benchmarks The ranges and settings for the HPO studies on the JSSP benchmark instances are reported in Table V and Table VI. New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI. Fixed settings for the JSSP benchmark. Setting Value Number of hidden layers 2 Number of warm-up update steps...

work page
[15]

The first 100 trials are sampled via CMA-ES, and the last 50 via TPE

HPO for transverse-field SK benchmarks The ranges and settings for the HPO studies on the transverse-field SK benchmark instances withN= 16 are reported in Table VII and Table VIII, and those for theN= 100 benchmarks are reported in Table IX and Table X. The first 100 trials are sampled via CMA-ES, and the last 50 via TPE. Here, we implement a 1-hour wall...

work page
[16]

Table XIII shows the re- sults for our benchmark instances withN= 100 spins

Additional results on the benchmark The energies obtained for theN= 16 benchmark are reported in Table XII. Table XIII shows the re- sults for our benchmark instances withN= 100 spins. Additional figures showing the data for all the ex- amined instances are available at [117] or [51] in the Table VIII. Fixed settings for the N = 16 transverse-field SK ben...

work page
[17]

Sherrington and S

D. Sherrington and S. Kirkpatrick, Solvable model of a spin-glass, Phys. Rev. Lett.35, 1792 (1975)

work page 1975
[18]

Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

F. Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

work page 1982
[19]

Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

A. Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

work page 2014
[20]

Ben Arous and A

G. Ben Arous and A. Jagannath, Spectral Gap Esti- mates in Mean Field Spin Glasses, Communications in Mathematical Physics361, 1–52 (2018)

work page 2018
[21]

A. P. Young, ed.,Spin Glasses and Random Fields (World Scientific, Singapore, 1998)

work page 1998
[22]

M´ ezard, G

M. M´ ezard, G. Parisi, and M. A. Virasoro,Spin Glass Theory and Beyond: An Introduction to the Replica Method and Its Applications(World Scientific, Singa- pore, 1987)

work page 1987
[23]

Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p

S. Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p. 28 0 2 4 6 8 10Counts SR RBQS cRBM NQA RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB 0 2 4 6 8 10Counts RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB RBQS cRBM Figure 9. Residual energy error histograms for each combination of variational ansatz and optimization strategy on...

work page 2011
[24]

Rieger and A

H. Rieger and A. P. Young, Griffiths singularities in the disordered phase of a quantum ising spin glass, Phys. Rev. B54, 3328 (1996)

work page 1996
[25]

L. F. Cugliandolo and M. Mueller, Quantum glasses – a review (2022), arXiv:2208.05417 [cond-mat.dis-nn]

work page arXiv 2022
[26]

Schultzen, T

P. Schultzen, T. Franz, S. Geier, A. Salzinger, A. Tebben, C. Hainaut, G. Z¨ urn, M. Weidem¨ uller, and M. G¨ arttner, Glassy quantum dynamics of disordered ising spins, Phys. Rev. B105, L020201 (2022)

work page 2022
[27]

The complexity of quantum spin systems on a two-dimensional square lattice

R. Oliveira and B. M. Terhal, The complexity of quan- tum spin systems on a two-dimensional square lattice (2008), arXiv:quant-ph/0504050 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2008
[28]

Padberg and G

M. Padberg and G. Rinaldi, A branch-and-cut algo- rithm for the resolution of large-scale symmetric trav- eling salesman problems, SIAM Review33, 60 (1991), 29 Table XI. Typical energy errors on theN= 200 SK bench- mark. Method [εB]typ [εQ]typ cRBM-SR 1.353×10 −1 1.360×10 −1 cRBM-SR-cata 9.661×10 −2 9.685×10 −2 cRBM-NQA 1.653×10 −1 1.687×10 −1 cRBM-NQA-ca...

work page doi:10.1137/1033004 1991
[29]

R. H. Swendsen and J.-S. Wang, Nonuniversal critical dynamics in monte carlo simulations, Phys. Rev. Lett. 58, 86 (1987)

work page 1987
[30]

Wolff, Collective monte carlo updating for spin sys- tems, Phys

U. Wolff, Collective monte carlo updating for spin sys- tems, Phys. Rev. Lett.62, 361 (1989)

work page 1989
[31]

Marinari and G

E. Marinari and G. Parisi, Simulated tempering: A new monte carlo scheme, Europhysics Letters19, 451 (1992)

work page 1992
[32]

Hukushima, K

K. Hukushima and K. Nemoto, Exchange monte carlo method and applications to spin glass simulations, Jour- nal of the Physical Society of Japan65, 1604 (1996), https://doi.org/10.1143/JPSJ.65.1604

work page doi:10.1143/jpsj.65.1604 1996
[33]

Bernaschi, I

M. Bernaschi, I. Gonz´ alez-Adalid Pemart´ ın, V. Mart´ ın- Mayor, and G. Parisi, The quantum transition of the two-dimensional ising spin glass, Nature631, 749–754 (2024)

work page 2024
[34]

J.-G. Liu, L. Wang, and P. Zhang, Tropical tensor net- work for ground states of spin glasses, Phys. Rev. Lett. 126, 090506 (2021)

work page 2021
[35]

Ishii and T

H. Ishii and T. Yamamoto, Monte carlo study of the sherrington-kirkpatrick spin glass model in a transverse field, inQuantum Monte Carlo Methods in Equilib- rium and Nonequilibrium Systems, edited by M. Suzuki (Springer Berlin Heidelberg, Berlin, Heidelberg, 1987) pp. 176–185

work page 1987
[36]

L. L. Viteritti, R. Rende, G. B. Testasecca, J. Niedda, R. Moessner, G. Carleo, and A. Scardicchio, Quantum spin glass in the two-dimensional disordered heisenberg Table XIII. Comparison between ED and NQA for theN= 16 benchmark. Instance ED NQA 0 -16.8595 -16.8589±0.0002 1 -17.0621 -17.0600±0.0002 2 -19.8560 -19.8557±0.0001 3 -18.45559 -18.4555±0.0001 4 ...

work page arXiv 2025
[37]

Neural Combinatorial Optimization with Reinforcement Learning

I. Bello, H. Pham, Q. V. Le, M. Norouzi, and S. Bengio, Neural combinatorial optimization with reinforcement learning (2017), arXiv:1611.09940 [cs.AI]

work page internal anchor Pith review Pith/arXiv arXiv 2017
[38]

Khalil, H

E. Khalil, H. Dai, Y. Zhang, B. Dilkina, and L. Song, Learning combinatorial optimization algorithms over graphs, inAdvances in Neural Information Processing Systems, Vol. 30, edited by I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (2017)

work page 2017
[39]

Gabor, S

T. Gabor, S. Feld, H. Safi, T. Phan, and C. Linnhoff- Popien, Insights on training neural networks for qubo tasks, inProceedings of the IEEE/ACM 42nd Interna- tional Conference on Software Engineering Workshops (Association for Computing Machinery, New York, NY, USA, 2020) pp. 436–441

work page 2020
[40]

He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S

H. He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S. Ghosh and Z. Zhang (Springer Nature Singapore, Singapore, 2024) pp. 134–145

work page 2024
[41]

Carleo and M

G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017)

work page 2017
[42]

J. J. Hopfield, Neural networks and physical systems with emergent collective computational abilities, Pro- ceedings of the National Academy of Sciences79, 2554 (1982)

work page 1982
[43]

Goodfellow, Y

I. Goodfellow, Y. Bengio, A. Courville, and Y. Bengio, Deep learning(MIT Press, Cambridge, 2016)

work page 2016
[44]

Sorella, Green function monte carlo with stochastic reconfiguration, Phys

S. Sorella, Green function monte carlo with stochastic reconfiguration, Phys. Rev. Lett.80, 4558 (1998)

work page 1998
[45]

Chen and M

A. Chen and M. Heyl, Empowering deep neural quantum states through efficient optimization, Nature Physics20, 1476 (2024)

work page 2024
[46]

D.-L. Deng, X. Li, and S. Das Sarma, Quantum en- tanglement in neural network states, Phys. Rev. X7, 021021 (2017)

work page 2017
[47]

C. Roth, A. Szab´ o, and A. H. MacDonald, High- accuracy variational monte carlo for frustrated magnets with deep neural networks, Phys. Rev. B108, 054410 (2023)

work page 2023
[48]

Carleo, Y

G. Carleo, Y. Nomura, and M. Imada, Constructing exact representations of quantum many-body systems with deep neural networks, Nature Communications9, 30 5322 (2018)

work page 2018
[49]

Schmitt and M

M. Schmitt and M. Heyl, Quantum many-body dynam- ics in two dimensions with artificial neural networks, Phys. Rev. Lett.125, 100503 (2020)

work page 2020
[50]

Kadowaki and H

T. Kadowaki and H. Nishimori, Quantum annealing in the transverse Ising model, Phys. Rev. E58, 5355 (1998)

work page 1998
[51]

G. E. Santoro, R. Martoˇ n´ ak, E. Tosatti, and R. Car, Theory of quantum annealing of an ising spin glass, Sci- ence295, 2427 (2002)

work page 2002
[52]

Zanca and G

T. Zanca and G. E. Santoro, Quantum annealing speedup over simulated annealing on random ising chains, Phys. Rev. B93, 224431 (2016)

work page 2016
[53]

Albash and D

T. Albash and D. A. Lidar, Adiabatic quantum compu- tation, Rev. Mod. Phys.90, 015002 (2018)

work page 2018
[54]

M. W. Johnson, M. H. S. Amin, S. Gildert, T. Lant- ing, F. Hamze, N. Dickson, R. Harris, A. J. Berkley, J. Johansson, P. Bunyk, E. M. Chapple, C. Enderud, J. P. Hilton, K. Karimi, E. Ladizinsky, N. Ladizinsky, T. Oh, I. Perminov, C. Rich, M. C. Thom, E. Tolka- cheva, C. J. S. Truncik, S. Uchaikin, J. Wang, B. Wil- son, and G. Rose, Quantum annealing with ...

work page 2011
[55]

Yarkoni, E

S. Yarkoni, E. Raponi, T. B¨ ack, and S. Schmitt, Quan- tum annealing for industry applications: introduction and review, Reports on Progress in Physics85, 104001 (2022)

work page 2022
[56]

J. Cai, W. G. Macready, and A. Roy, A practical heuris- tic for finding graph minors (2014), arXiv:1406.2741 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2014
[57]

Marton´ ak, G

R. Marton´ ak, G. E. Santoro, and E. Tosatti, Quantum annealing by the path-integral monte carlo method: The two-dimensional random ising model, Physical Review B66, 094203 (2002)

work page 2002
[58]

B. Heim, T. F. Rønnow, S. V. Isakov, and M. Troyer, Quantum versus classical annealing of ising spin glasses, Science348, 215 (2015)

work page 2015
[59]

Crosson and A

E. Crosson and A. W. Harrow, Simulated quantum an- nealing can be exponentially faster than classical simu- lated annealing, Physical Review A93, 042307 (2016)

work page 2016
[60]

Baldassi and R

C. Baldassi and R. Zecchina, Efficiency of quantum vs. classical annealing in nonconvex learning problems, Pro- ceedings of the National Academy of Sciences115, 1457 (2018)

work page 2018
[61]

Hibat-Allah, E

M. Hibat-Allah, E. M. Inack, R. Wiersema, R. G. Melko, and J. Carrasquilla, Variational neural annealing, Na- ture Machine Intelligence3, 952 (2021)

work page 2021
[62]

P. M. Long and R. A. Servedio, Restricted boltzmann machines are hard to approximately evaluate or simu- late, inProceedings of the 27th International Confer- ence on Machine Learning (ICML), ICML’10 (Omni- press, Madison, WI, USA, 2010) pp. 703–710

work page 2010
[63]

Jastrow, Many-body problem with strong forces, Phys

R. Jastrow, Many-body problem with strong forces, Phys. Rev.98, 1479 (1955)

work page 1955
[64]

Becca and S

F. Becca and S. Sorella,Quantum Monte Carlo Ap- proaches for Correlated Systems(Cambridge University Press, 2017)

work page 2017
[65]

Akiba, S

T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama, Optuna: A next-generation hyperparameter optimiza- tion framework, inProceedings of the 25th ACM SIGKDD International Conference on Knowledge Dis- covery and Data Mining(2019)

work page 2019
[66]

Bradbury, R

J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. Van- derPlas, S. Wanderman-Milne, and Q. Zhang, JAX: composable transformations of Python+NumPy pro- grams (2018). [51]https://github.com/lucaleonect/nqa_server

work page 2018
[67]

A. Chen, V. D. Naik, and M. Heyl, Convolutional trans- former wave functions (2025), arXiv:2503.10462 [cond- mat.dis-nn]

work page arXiv 2025
[68]

Schmitt, M

M. Schmitt, M. M. Rams, J. Dziarmaga, M. Heyl, and W. H. Zurek, Quantum phase transition dynamics in the two-dimensional transverse-field ising model, Sci- ence Advances8, eabl6850 (2022)

work page 2022
[69]

Mendes-Santos, M

T. Mendes-Santos, M. Schmitt, A. Angelone, A. Ro- driguez, P. Scholl, H. J. Williams, D. Barredo, T. La- haye, A. Browaeys, M. Heyl, and M. Dalmonte, Wave- function network description and kolmogorov complex- ity of quantum many-body systems, Phys. Rev. X14, 021029 (2024)

work page 2024
[70]

Lange, A

H. Lange, A. Van de Walle, A. Abedinnia, and A. Bohrdt, From architectures to applications: A re- view of neural quantum states (2024), arXiv:2402.09402 [cond-mat.dis-nn]

work page arXiv 2024
[71]

Schmitt and M

M. Schmitt and M. Heyl, Simulating dynamics of cor- related matter with neural quantum states (2025), arXiv:2506.03124 [quant-ph]

work page arXiv 2025
[72]

(n.a.), we denote the Pauli-Z eigenvalues withx i =±1 for notational consistency with the following sections

work page
[73]

Rattray, D

M. Rattray, D. Saad, and S.-i. Amari, Natural gradient descent for on-line learning, Phys. Rev. Lett.81, 5461 (1998)

work page 1998
[74]

Revisiting Natural Gradient for Deep Networks

R. Pascanu and Y. Bengio, Revisiting natural gradient for deep networks (2014), arXiv:1301.3584 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 2014
[75]

Ben-Israel and T

A. Ben-Israel and T. N. E. Greville,Generalized in- verses: theory and applications(Springer Science & Business Media, 2006)

work page 2006
[76]

B. T. Polyak, Some methods of speeding up the conver- gence of iteration methods, USSR Computational Math- ematics and Mathematical Physics4, 1 (1964)

work page 1964
[77]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature323, 533 (1986)

work page 1986
[78]

Sutskever, J

I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, inProceedings of the 30th International Conference on Machine Learning (ICML), ICML’13 (JMLR.org, 2013) pp. 1139–1147

work page 2013
[79]

Das and B

A. Das and B. K. Chakrabarti, Colloquium: Quantum annealing and analog quantum computation, Rev. Mod. Phys.80, 1061 (2008)

work page 2008
[80]

A. B. Finnila, M. A. Gomez, C. Sebenik, C. Stenson, and J. D. Doll, Quantum annealing: A new method for minimizing multidimensional functions, Chemical Physics Letters219, 343 (1994)

work page 1994

Showing first 80 references.

[1] [1]

A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq

Natural gradients and advanced optimizers In Section II A, we introduced the real and imaginary- time TDVP and defined the Fisher information matrix. A noteworthy observation is that imaginary-time TDVP is formally equivalent to the optimization of the cost func- tion in Eq. (8) via natural gradient descent, a seminal al- gorithm known as Stochastic Recon...

work page

[2] [2]

The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algorithm 2

Variational Quantum Annealing pseudo-code As explained in Section II C, an alternative approach to ground-state optimization with NQS relies on inter- polating from aneasyto ahardregime by tuning a pa- rameter in the Hamiltonian. The resulting high-level im- plementation of VQA for optimizing a variational ansatz (typically an NQS) is illustrated in Algor...

work page

[3] [3]

[123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function

pmRBM and cRBM A conceptually straightforward extension proposed in Ref. [123] and known as pmRBM uses two independent RBMs to model the phase and the modulus of the wave- function. The wavefunction is expressed in the spin eigenbasis, which is commonly associated with the eigen- states of the Pauli-Z operators: ψ(x;θ) = p pm RBM(x;θ m) exp{i pp RBM(x;θ p...

work page

[4] [4]

DBM wavefunctions A possible quantum extension of DBMs based on the same principle of the cRBM was introduced in Ref. [32] and reads ψ(x;θ)= X x(l) l>0 e PNL −1 l=0 x(l)T W (l)x(l+1)+PNL l=0 b(l)T x(l) .(C3) The original reference also provides an interesting con- structive approach that does not require a variational optimization of the DBM parameters. H...

work page

[5] [5]

Quantum Boltzmann Machines Another possible extension, dubbed Quantum Boltz- mann Machine (QBM), was proposed in Ref. [122]. The units of the BM are promoted to quantum spins, and the state of the system is described as the exponential of a transverse-field Ising Hamiltonian HQBM =− X a Γaσx a − X a baσz a − X ab wabσz aσz b (C4) yielding the density matr...

work page

[6] [6]

Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement structure

Ground-state structure of the extended Hilbert space and expectation values The BQS formalism describes both visible and hid- den spins as quantum spins in an extended Hilbert space Hvh =H v ⊗ Hh. Although the two HamiltoniansH v and 21 Hvh =H v ⊗I h share the same spectrum, the ground- state of the extended system can have a non-trivial en- tanglement st...

work page

[7] [7]

X h′ ⟨χh|h′⟩ ⟨v,h ′|ψvh⟩ # |v⟩ ⊗ X h ⟨h|χh⟩ |h⟩ = X v

Relation to the cRBM architecture and universality of DBQS Here, we show that applying a projection operator to the hidden partition collapses the visible spins into a pure state. Then, we demonstrate that specific projections reduce the RBQS and DBQS ansatzes to the cRBM and DBM wavefunctions of Refs. [25, 32], establishing that the DBQS framework genera...

work page

[8] [8]

Dataset–free initialization of the DBQS parameters Typical uses of DBMs in machine learning rely on greedy layer-wise pretraining [27]. In the NQA setting such pretraining yields no benefit: the first few opti- mization steps rapidly overwrite any pre-trained weights, effectively undoing the layer-wise initialization long be- fore the algorithm approaches...

work page

[9] [9]

This autocorrelation reduces the efficiency of the sampling procedure by decreasing the number of effectively inde- pendent samples obtained from the chain

Comparison of block Gibbs chains and Metropolis-Hastings chains In Markov Chain Monte Carlo (MCMC) sampling, suc- cessive samples are typically correlated because each new sample is generated from the previous one, rather than drawn independently from the target distribution. This autocorrelation reduces the efficiency of the sampling procedure by decreas...

work page

[10] [10]

The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization

Algorithmic overview As discussed in Section IV, NQA combines standard SR with momentum acceleration and persistent Gibbs chains enabled by our novel DBQS architecture. The update step is shown in Algorithm 3, and recursively ex- ecuted throughout the optimization. It comprises three actions: 1.Sampler update: Advance the persistent chains with a few bloc...

work page

[11] [11]

NQA Hyperparameters In the following, we enumerate the tunable hyperpa- rameters of NQA, organizing them into five groups based on their role in the algorithm. a. Annealing Hyperparameters These parameters characterize the annealing schedule and the parametric HamiltonianH(s) in Eq. (32). •Number of annealing stepsN A: it fixes the num- ber of discretized...

work page

[12] [12]

Computational complexity The computational complexity of NQA is linear in the number of calls of theNQA UpdateStepfunction, which amounts toN W +N F +N ANU. The first two terms in the sum are usually negligible, as is the computational cost of the rest of the code; hence, the overall compu- tational cost is essentially proportional toN ANU times the compu...

work page

[13] [13]

Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO

HPO for SK benchmarks The HPO for the SK benchmark instances with cRBM and RBQS ansatzes are performed by only optimizing the learning rate and momentum, as shown in Table I, whereas all other hyperparameters are fixed as reported in Table II. Here, no maximum time limit per run is set, as each run requires the same runtime within this simplified HPO. The...

work page

[14] [14]

New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI

HPO for JSSP benchmarks The ranges and settings for the HPO studies on the JSSP benchmark instances are reported in Table V and Table VI. New trial settings are sampled via CMA-ES; here, we implement a 4-hour wall-clock time limit per 27 Table VI. Fixed settings for the JSSP benchmark. Setting Value Number of hidden layers 2 Number of warm-up update steps...

work page

[15] [15]

The first 100 trials are sampled via CMA-ES, and the last 50 via TPE

HPO for transverse-field SK benchmarks The ranges and settings for the HPO studies on the transverse-field SK benchmark instances withN= 16 are reported in Table VII and Table VIII, and those for theN= 100 benchmarks are reported in Table IX and Table X. The first 100 trials are sampled via CMA-ES, and the last 50 via TPE. Here, we implement a 1-hour wall...

work page

[16] [16]

Table XIII shows the re- sults for our benchmark instances withN= 100 spins

Additional results on the benchmark The energies obtained for theN= 16 benchmark are reported in Table XII. Table XIII shows the re- sults for our benchmark instances withN= 100 spins. Additional figures showing the data for all the ex- amined instances are available at [117] or [51] in the Table VIII. Fixed settings for the N = 16 transverse-field SK ben...

work page

[17] [17]

Sherrington and S

D. Sherrington and S. Kirkpatrick, Solvable model of a spin-glass, Phys. Rev. Lett.35, 1792 (1975)

work page 1975

[18] [18]

Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

F. Barahona, On the computational complexity of ising spin glass models, Journal of Physics A: Mathematical and General15, 3241 (1982)

work page 1982

[19] [19]

Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

A. Lucas, Ising formulations of many np problems, Fron- tiers in Physics2, 5 (2014)

work page 2014

[20] [20]

Ben Arous and A

G. Ben Arous and A. Jagannath, Spectral Gap Esti- mates in Mean Field Spin Glasses, Communications in Mathematical Physics361, 1–52 (2018)

work page 2018

[21] [21]

A. P. Young, ed.,Spin Glasses and Random Fields (World Scientific, Singapore, 1998)

work page 1998

[22] [22]

M´ ezard, G

M. M´ ezard, G. Parisi, and M. A. Virasoro,Spin Glass Theory and Beyond: An Introduction to the Replica Method and Its Applications(World Scientific, Singa- pore, 1987)

work page 1987

[23] [23]

Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p

S. Sachdev, Quantum spin glasses, inQuantum Phase Transitions(Cambridge University Press, 2011) p. 28 0 2 4 6 8 10Counts SR RBQS cRBM NQA RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB 0 2 4 6 8 10Counts RBQS cRBM 10 9 10 7 10 5 10 3 10 1 εB RBQS cRBM Figure 9. Residual energy error histograms for each combination of variational ansatz and optimization strategy on...

work page 2011

[24] [24]

Rieger and A

H. Rieger and A. P. Young, Griffiths singularities in the disordered phase of a quantum ising spin glass, Phys. Rev. B54, 3328 (1996)

work page 1996

[25] [25]

L. F. Cugliandolo and M. Mueller, Quantum glasses – a review (2022), arXiv:2208.05417 [cond-mat.dis-nn]

work page arXiv 2022

[26] [26]

Schultzen, T

P. Schultzen, T. Franz, S. Geier, A. Salzinger, A. Tebben, C. Hainaut, G. Z¨ urn, M. Weidem¨ uller, and M. G¨ arttner, Glassy quantum dynamics of disordered ising spins, Phys. Rev. B105, L020201 (2022)

work page 2022

[27] [27]

The complexity of quantum spin systems on a two-dimensional square lattice

R. Oliveira and B. M. Terhal, The complexity of quan- tum spin systems on a two-dimensional square lattice (2008), arXiv:quant-ph/0504050 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2008

[28] [28]

Padberg and G

M. Padberg and G. Rinaldi, A branch-and-cut algo- rithm for the resolution of large-scale symmetric trav- eling salesman problems, SIAM Review33, 60 (1991), 29 Table XI. Typical energy errors on theN= 200 SK bench- mark. Method [εB]typ [εQ]typ cRBM-SR 1.353×10 −1 1.360×10 −1 cRBM-SR-cata 9.661×10 −2 9.685×10 −2 cRBM-NQA 1.653×10 −1 1.687×10 −1 cRBM-NQA-ca...

work page doi:10.1137/1033004 1991

[29] [29]

R. H. Swendsen and J.-S. Wang, Nonuniversal critical dynamics in monte carlo simulations, Phys. Rev. Lett. 58, 86 (1987)

work page 1987

[30] [30]

Wolff, Collective monte carlo updating for spin sys- tems, Phys

U. Wolff, Collective monte carlo updating for spin sys- tems, Phys. Rev. Lett.62, 361 (1989)

work page 1989

[31] [31]

Marinari and G

E. Marinari and G. Parisi, Simulated tempering: A new monte carlo scheme, Europhysics Letters19, 451 (1992)

work page 1992

[32] [32]

Hukushima, K

K. Hukushima and K. Nemoto, Exchange monte carlo method and applications to spin glass simulations, Jour- nal of the Physical Society of Japan65, 1604 (1996), https://doi.org/10.1143/JPSJ.65.1604

work page doi:10.1143/jpsj.65.1604 1996

[33] [33]

Bernaschi, I

M. Bernaschi, I. Gonz´ alez-Adalid Pemart´ ın, V. Mart´ ın- Mayor, and G. Parisi, The quantum transition of the two-dimensional ising spin glass, Nature631, 749–754 (2024)

work page 2024

[34] [34]

J.-G. Liu, L. Wang, and P. Zhang, Tropical tensor net- work for ground states of spin glasses, Phys. Rev. Lett. 126, 090506 (2021)

work page 2021

[35] [35]

Ishii and T

H. Ishii and T. Yamamoto, Monte carlo study of the sherrington-kirkpatrick spin glass model in a transverse field, inQuantum Monte Carlo Methods in Equilib- rium and Nonequilibrium Systems, edited by M. Suzuki (Springer Berlin Heidelberg, Berlin, Heidelberg, 1987) pp. 176–185

work page 1987

[36] [36]

L. L. Viteritti, R. Rende, G. B. Testasecca, J. Niedda, R. Moessner, G. Carleo, and A. Scardicchio, Quantum spin glass in the two-dimensional disordered heisenberg Table XIII. Comparison between ED and NQA for theN= 16 benchmark. Instance ED NQA 0 -16.8595 -16.8589±0.0002 1 -17.0621 -17.0600±0.0002 2 -19.8560 -19.8557±0.0001 3 -18.45559 -18.4555±0.0001 4 ...

work page arXiv 2025

[37] [37]

Neural Combinatorial Optimization with Reinforcement Learning

I. Bello, H. Pham, Q. V. Le, M. Norouzi, and S. Bengio, Neural combinatorial optimization with reinforcement learning (2017), arXiv:1611.09940 [cs.AI]

work page internal anchor Pith review Pith/arXiv arXiv 2017

[38] [38]

Khalil, H

E. Khalil, H. Dai, Y. Zhang, B. Dilkina, and L. Song, Learning combinatorial optimization algorithms over graphs, inAdvances in Neural Information Processing Systems, Vol. 30, edited by I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (2017)

work page 2017

[39] [39]

Gabor, S

T. Gabor, S. Feld, H. Safi, T. Phan, and C. Linnhoff- Popien, Insights on training neural networks for qubo tasks, inProceedings of the IEEE/ACM 42nd Interna- tional Conference on Software Engineering Workshops (Association for Computing Machinery, New York, NY, USA, 2020) pp. 436–441

work page 2020

[40] [40]

He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S

H. He, Quantum annealing and gnn for solving tsp with qubo, inAlgorithmic Aspects in Information and Man- agement, edited by S. Ghosh and Z. Zhang (Springer Nature Singapore, Singapore, 2024) pp. 134–145

work page 2024

[41] [41]

Carleo and M

G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017)

work page 2017

[42] [42]

J. J. Hopfield, Neural networks and physical systems with emergent collective computational abilities, Pro- ceedings of the National Academy of Sciences79, 2554 (1982)

work page 1982

[43] [43]

Goodfellow, Y

I. Goodfellow, Y. Bengio, A. Courville, and Y. Bengio, Deep learning(MIT Press, Cambridge, 2016)

work page 2016

[44] [44]

Sorella, Green function monte carlo with stochastic reconfiguration, Phys

S. Sorella, Green function monte carlo with stochastic reconfiguration, Phys. Rev. Lett.80, 4558 (1998)

work page 1998

[45] [45]

Chen and M

A. Chen and M. Heyl, Empowering deep neural quantum states through efficient optimization, Nature Physics20, 1476 (2024)

work page 2024

[46] [46]

D.-L. Deng, X. Li, and S. Das Sarma, Quantum en- tanglement in neural network states, Phys. Rev. X7, 021021 (2017)

work page 2017

[47] [47]

C. Roth, A. Szab´ o, and A. H. MacDonald, High- accuracy variational monte carlo for frustrated magnets with deep neural networks, Phys. Rev. B108, 054410 (2023)

work page 2023

[48] [48]

Carleo, Y

G. Carleo, Y. Nomura, and M. Imada, Constructing exact representations of quantum many-body systems with deep neural networks, Nature Communications9, 30 5322 (2018)

work page 2018

[49] [49]

Schmitt and M

M. Schmitt and M. Heyl, Quantum many-body dynam- ics in two dimensions with artificial neural networks, Phys. Rev. Lett.125, 100503 (2020)

work page 2020

[50] [50]

Kadowaki and H

T. Kadowaki and H. Nishimori, Quantum annealing in the transverse Ising model, Phys. Rev. E58, 5355 (1998)

work page 1998

[51] [51]

G. E. Santoro, R. Martoˇ n´ ak, E. Tosatti, and R. Car, Theory of quantum annealing of an ising spin glass, Sci- ence295, 2427 (2002)

work page 2002

[52] [52]

Zanca and G

T. Zanca and G. E. Santoro, Quantum annealing speedup over simulated annealing on random ising chains, Phys. Rev. B93, 224431 (2016)

work page 2016

[53] [53]

Albash and D

T. Albash and D. A. Lidar, Adiabatic quantum compu- tation, Rev. Mod. Phys.90, 015002 (2018)

work page 2018

[54] [54]

M. W. Johnson, M. H. S. Amin, S. Gildert, T. Lant- ing, F. Hamze, N. Dickson, R. Harris, A. J. Berkley, J. Johansson, P. Bunyk, E. M. Chapple, C. Enderud, J. P. Hilton, K. Karimi, E. Ladizinsky, N. Ladizinsky, T. Oh, I. Perminov, C. Rich, M. C. Thom, E. Tolka- cheva, C. J. S. Truncik, S. Uchaikin, J. Wang, B. Wil- son, and G. Rose, Quantum annealing with ...

work page 2011

[55] [55]

Yarkoni, E

S. Yarkoni, E. Raponi, T. B¨ ack, and S. Schmitt, Quan- tum annealing for industry applications: introduction and review, Reports on Progress in Physics85, 104001 (2022)

work page 2022

[56] [56]

J. Cai, W. G. Macready, and A. Roy, A practical heuris- tic for finding graph minors (2014), arXiv:1406.2741 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2014

[57] [57]

Marton´ ak, G

R. Marton´ ak, G. E. Santoro, and E. Tosatti, Quantum annealing by the path-integral monte carlo method: The two-dimensional random ising model, Physical Review B66, 094203 (2002)

work page 2002

[58] [58]

B. Heim, T. F. Rønnow, S. V. Isakov, and M. Troyer, Quantum versus classical annealing of ising spin glasses, Science348, 215 (2015)

work page 2015

[59] [59]

Crosson and A

E. Crosson and A. W. Harrow, Simulated quantum an- nealing can be exponentially faster than classical simu- lated annealing, Physical Review A93, 042307 (2016)

work page 2016

[60] [60]

Baldassi and R

C. Baldassi and R. Zecchina, Efficiency of quantum vs. classical annealing in nonconvex learning problems, Pro- ceedings of the National Academy of Sciences115, 1457 (2018)

work page 2018

[61] [61]

Hibat-Allah, E

M. Hibat-Allah, E. M. Inack, R. Wiersema, R. G. Melko, and J. Carrasquilla, Variational neural annealing, Na- ture Machine Intelligence3, 952 (2021)

work page 2021

[62] [62]

P. M. Long and R. A. Servedio, Restricted boltzmann machines are hard to approximately evaluate or simu- late, inProceedings of the 27th International Confer- ence on Machine Learning (ICML), ICML’10 (Omni- press, Madison, WI, USA, 2010) pp. 703–710

work page 2010

[63] [63]

Jastrow, Many-body problem with strong forces, Phys

R. Jastrow, Many-body problem with strong forces, Phys. Rev.98, 1479 (1955)

work page 1955

[64] [64]

Becca and S

F. Becca and S. Sorella,Quantum Monte Carlo Ap- proaches for Correlated Systems(Cambridge University Press, 2017)

work page 2017

[65] [65]

Akiba, S

T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama, Optuna: A next-generation hyperparameter optimiza- tion framework, inProceedings of the 25th ACM SIGKDD International Conference on Knowledge Dis- covery and Data Mining(2019)

work page 2019

[66] [66]

Bradbury, R

J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. Van- derPlas, S. Wanderman-Milne, and Q. Zhang, JAX: composable transformations of Python+NumPy pro- grams (2018). [51]https://github.com/lucaleonect/nqa_server

work page 2018

[67] [67]

A. Chen, V. D. Naik, and M. Heyl, Convolutional trans- former wave functions (2025), arXiv:2503.10462 [cond- mat.dis-nn]

work page arXiv 2025

[68] [68]

Schmitt, M

M. Schmitt, M. M. Rams, J. Dziarmaga, M. Heyl, and W. H. Zurek, Quantum phase transition dynamics in the two-dimensional transverse-field ising model, Sci- ence Advances8, eabl6850 (2022)

work page 2022

[69] [69]

Mendes-Santos, M

T. Mendes-Santos, M. Schmitt, A. Angelone, A. Ro- driguez, P. Scholl, H. J. Williams, D. Barredo, T. La- haye, A. Browaeys, M. Heyl, and M. Dalmonte, Wave- function network description and kolmogorov complex- ity of quantum many-body systems, Phys. Rev. X14, 021029 (2024)

work page 2024

[70] [70]

Lange, A

H. Lange, A. Van de Walle, A. Abedinnia, and A. Bohrdt, From architectures to applications: A re- view of neural quantum states (2024), arXiv:2402.09402 [cond-mat.dis-nn]

work page arXiv 2024

[71] [71]

Schmitt and M

M. Schmitt and M. Heyl, Simulating dynamics of cor- related matter with neural quantum states (2025), arXiv:2506.03124 [quant-ph]

work page arXiv 2025

[72] [72]

(n.a.), we denote the Pauli-Z eigenvalues withx i =±1 for notational consistency with the following sections

work page

[73] [73]

Rattray, D

M. Rattray, D. Saad, and S.-i. Amari, Natural gradient descent for on-line learning, Phys. Rev. Lett.81, 5461 (1998)

work page 1998

[74] [74]

Revisiting Natural Gradient for Deep Networks

R. Pascanu and Y. Bengio, Revisiting natural gradient for deep networks (2014), arXiv:1301.3584 [cs.LG]

work page internal anchor Pith review Pith/arXiv arXiv 2014

[75] [75]

Ben-Israel and T

A. Ben-Israel and T. N. E. Greville,Generalized in- verses: theory and applications(Springer Science & Business Media, 2006)

work page 2006

[76] [76]

B. T. Polyak, Some methods of speeding up the conver- gence of iteration methods, USSR Computational Math- ematics and Mathematical Physics4, 1 (1964)

work page 1964

[77] [77]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature323, 533 (1986)

work page 1986

[78] [78]

Sutskever, J

I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, inProceedings of the 30th International Conference on Machine Learning (ICML), ICML’13 (JMLR.org, 2013) pp. 1139–1147

work page 2013

[79] [79]

Das and B

A. Das and B. K. Chakrabarti, Colloquium: Quantum annealing and analog quantum computation, Rev. Mod. Phys.80, 1061 (2008)

work page 2008

[80] [80]

A. B. Finnila, M. A. Gomez, C. Sebenik, C. Stenson, and J. D. Doll, Quantum annealing: A new method for minimizing multidimensional functions, Chemical Physics Letters219, 343 (1994)

work page 1994