Diffusion-warm sampling of the XY model enables fast thermalization at scale

Pooya Ronagh; Roger Melko; Sehmimul Hoque

arxiv: 2606.30773 · v1 · pith:6IXI5PU5new · submitted 2026-06-29 · 🪐 quant-ph · cond-mat.dis-nn· cs.LG

Diffusion-warm sampling of the XY model enables fast thermalization at scale

Sehmimul Hoque , Roger Melko , Pooya Ronagh This is my paper

Pith reviewed 2026-07-01 01:44 UTC · model grok-4.3

classification 🪐 quant-ph cond-mat.dis-nncs.LG

keywords diffusion modelsXY modelMCMC samplingthermalizationgenerative modelsspin systemscontinuous symmetrieslattice models

0 comments

The pith

A temperature-conditioned diffusion model trained on small XY lattices generates accurate configurations for larger lattices that let MCMC thermalize an order of magnitude faster.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that training a temperature-conditioned diffusion model solely on smaller XY model lattices produces usable spin configurations for substantially larger lattices. When these diffusion outputs initialize Markov chain Monte Carlo runs, the number of steps needed to reach thermal equilibrium drops by a factor of roughly ten compared with random initialization. A sympathetic reader cares because standard MCMC thermalization times grow rapidly with lattice size for continuous-spin models, making direct simulation of large systems impractical. The experiments track spin correlations and other observables to show that the hybrid diffusion-plus-MCMC procedure yields results consistent with the target thermal distribution.

Core claim

Training a temperature-conditioned diffusion model on smaller-size XY model lattices enables the generation of accurate samples in larger lattice sizes. By tracking physically important observables of the model, such as spin correlations, our experiments demonstrate that diffusion sampling followed by a few MCMC steps reduces the thermalization time by an order of magnitude relative to the standard MCMC with random initialization. This supplies a route to scalable sampling of continuous-state spin systems.

What carries the argument

Temperature-conditioned diffusion model that maps smaller-lattice training data to warm-start configurations for MCMC on larger XY lattices.

If this is right

Sampling becomes feasible on lattice sizes too large for direct diffusion training or for slow random-start MCMC.
Only a small number of MCMC steps after diffusion generation suffice to reach equilibrium.
The technique extends MCMC to continuous-symmetry spin models where size generalization has been a bottleneck.
Generative models can now be used to initialize simulations of condensed-matter systems at previously inaccessible scales.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the size-independent accuracy holds, one diffusion model could serve as a reusable initializer across a wide range of lattice sizes.
The reduction in thermalization cost could make finite-size scaling studies near the XY model's Kosterlitz-Thouless transition more routine.
Analogous warm-start diffusion sampling might be tested on the Heisenberg model or other O(n) spin systems.
The approach could be combined with cluster algorithms or other advanced MCMC updates to push scales even farther.

Load-bearing premise

The diffusion model produces large-lattice configurations whose spin correlations and other observables match those of the true thermal ensemble at the target size, without systematic size-dependent errors.

What would settle it

On an intermediate lattice size where both long equilibrated MCMC and the hybrid method can be compared at high statistics, any statistically significant mismatch in measured two-point spin correlations or susceptibility would falsify the accuracy claim.

Figures

Figures reproduced from arXiv: 2606.30773 by Pooya Ronagh, Roger Melko, Sehmimul Hoque.

**Figure 1.** Figure 1: FIG. 1: The figure demonstrates samples generated from the diffusion model provide an efficient warm start for [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: FIG. 2: The figure shows the whole pipeline of diffusion-warmed MCMC. Random configurations are passed through [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3: The figure shows spin-spin correlation [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4: The figure demonstrates that a small number of Wolff MCMC updates is sufficient to correct the spin-spin [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5: The figure shows the expected energy and helicity modulus across a wide range of temperatures, [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 6.** Figure 6: FIG. 6: The figure shows how [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗

**Figure 7.** Figure 7: FIG. 7: The figure shows how heat capacity changes with different annealing strategies during inference. [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗

**Figure 8.** Figure 8: FIG. 8: Training and validation loss curves. [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗

**Figure 9.** Figure 9: FIG. 9: The figure shows samples produced after diffusion and then after [PITH_FULL_IMAGE:figures/full_fig_p016_9.png] view at source ↗

**Figure 10.** Figure 10: FIG. 10: The figure shows the architecture of the score network. The input spins are used to first get a sine and [PITH_FULL_IMAGE:figures/full_fig_p017_10.png] view at source ↗

read the original abstract

We introduce a novel technique for scalable sampling of spin-system states with continuous symmetries using diffusion models. By applying our approach to the XY model, a fundamental continuous-spin model in condensed matter physics, we show that our technique addresses the shortfalls of the Markov chain Monte Carlo (MCMC) in generalization to varying system sizes. More specifically, we show that training a temperature-conditioned diffusion model on smaller-size XY model lattices enables the generation of accurate samples in larger lattice sizes. By tracking physically important observables of the model, such as spin correlations, our experiments demonstrate that diffusion sampling followed by a few MCMC steps reduces the thermalization time by an order of magnitude relative to the standard MCMC with random initialization. Our study provides valuable insight as to how generative models can be used to study continuous-state condensed matter systems at scale.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Diffusion model trained on small XY lattices seeds faster MCMC on large ones, but the size-generalization evidence looks thin without more checks.

read the letter

The core result here is that a temperature-conditioned diffusion model trained only on small lattices can generate starting configurations for larger XY lattices; after a handful of MCMC sweeps those configurations thermalize an order of magnitude faster than random starts. That is the practical claim worth noticing.

What the work actually does is combine a generative model with standard MCMC in a way that targets the system-size bottleneck in continuous-spin simulations. The XY model is a reasonable test case because of the continuous symmetry and the Kosterlitz-Thouless physics. Tracking spin correlations after the hybrid procedure is a sensible first observable.

The soft spot is the extrapolation step itself. The abstract reports that correlations look good, but gives no quantitative error bars, no explicit finite-size scaling checks, and no discussion of whether vortex or higher-order angular statistics are preserved. If the diffusion output carries even modest size-dependent bias that survives a few sweeps, the claimed thermalization speedup shrinks or disappears. The stress-test note on undetected extrapolation artifacts is therefore on point; the paper needs to show that the learned distribution is close enough to the true Boltzmann measure on the larger lattices, not just that one or two observables match.

This is the kind of paper that would interest people running large-scale Monte Carlo on spin models who are already willing to try generative warm-starts. It is not yet a finished method, but the empirical demonstration is concrete enough that a serious referee should see it. I would send it out for review rather than desk-reject, with the expectation that the authors supply the missing validation numbers and checks on additional observables.

Referee Report

3 major / 1 minor

Summary. The paper introduces a diffusion-warm sampling technique for the XY model. It claims that a temperature-conditioned diffusion model trained only on smaller lattices generates accurate samples for larger lattices, and that diffusion sampling followed by a few MCMC steps reduces thermalization time by an order of magnitude relative to standard MCMC from random initialization, as verified by tracking spin correlations and other observables.

Significance. If the generalization to larger lattices holds without undetected biases, the approach would offer a practical route to faster equilibration in continuous-spin models, addressing a known limitation of MCMC at scale. The empirical demonstration of size extrapolation in a generative model for a physically relevant system is a concrete strength.

major comments (3)

[Abstract] Abstract: the claim that 'accuracy was demonstrated' by tracking spin correlations provides no quantitative error bars, training/validation split details, or finite-size scaling checks; this prevents verification of the central empirical claim that samples match the true thermal distribution on larger lattices.
[Experiments] The central claim requires that diffusion outputs on large lattices match the Boltzmann measure beyond the reported spin correlations (e.g., vortex configurations, higher-order angular correlations, or KT-transition finite-size corrections). No such checks are described, leaving open the possibility of mode collapse or size-dependent extrapolation artifacts that would invalidate the order-of-magnitude thermalization reduction.
[Results] The reported order-of-magnitude thermalization speedup after 'a few MCMC steps' is load-bearing for the practical utility claim, yet lacks explicit metrics (e.g., integrated autocorrelation times with uncertainties) or ablation against alternative warm-start methods.

minor comments (1)

[Abstract] Abstract: the phrase 'diffusion-warm sampling' is used without a concise definition; adding one sentence would improve immediate readability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address each major comment point by point below and will revise the manuscript accordingly to strengthen the presentation of our results.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that 'accuracy was demonstrated' by tracking spin correlations provides no quantitative error bars, training/validation split details, or finite-size scaling checks; this prevents verification of the central empirical claim that samples match the true thermal distribution on larger lattices.

Authors: We agree that the abstract is brief and omits these specifics. The main text reports error bars on spin correlations, describes the training/validation procedure, and includes finite-size scaling analysis. We will revise the abstract to briefly reference these quantitative elements for improved clarity. revision: yes
Referee: [Experiments] The central claim requires that diffusion outputs on large lattices match the Boltzmann measure beyond the reported spin correlations (e.g., vortex configurations, higher-order angular correlations, or KT-transition finite-size corrections). No such checks are described, leaving open the possibility of mode collapse or size-dependent extrapolation artifacts that would invalidate the order-of-magnitude thermalization reduction.

Authors: We acknowledge that additional observables would provide stronger validation against mode collapse or artifacts. We will add analysis of vortex configurations and higher-order angular correlations in the revised experiments section. revision: yes
Referee: [Results] The reported order-of-magnitude thermalization speedup after 'a few MCMC steps' is load-bearing for the practical utility claim, yet lacks explicit metrics (e.g., integrated autocorrelation times with uncertainties) or ablation against alternative warm-start methods.

Authors: We will include integrated autocorrelation times with uncertainties and ablations against alternative warm-start methods in the updated results section to provide more rigorous support for the speedup. revision: yes

Circularity Check

0 steps flagged

No circularity in empirical sampling claims

full rationale

The paper reports an empirical technique: training a temperature-conditioned diffusion model on small XY lattices to generate samples for larger lattices, followed by limited MCMC steps that reduce observed thermalization time by an order of magnitude. All load-bearing claims are experimental measurements of observables (spin correlations) on generated configurations, which are independently falsifiable against exact or high-precision benchmarks and do not reduce to any fitted parameter, self-citation, or definitional identity within the paper's own equations. No derivation chain, uniqueness theorem, or ansatz is invoked that collapses to the inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the unstated premise that a diffusion model can be trained to approximate the Boltzmann distribution of the XY model sufficiently well that its outputs remain statistically faithful when the lattice size is increased; no explicit free parameters, axioms, or invented entities are named in the abstract.

axioms (1)

domain assumption A temperature-conditioned diffusion model can learn to sample from the equilibrium distribution of the XY model on finite lattices.
This assumption is required for the size-generalization claim to hold but is not derived or proven in the provided abstract.

pith-pipeline@v0.9.1-grok · 5675 in / 1245 out tokens · 36378 ms · 2026-07-01T01:44:53.403093+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 35 canonical work pages · 8 internal anchors

[1]

In practice, thermalization time is estimated using long MCMC runs by monitoring convergence of all observables

and different observables may converge at different rates [12, 30]. In practice, thermalization time is estimated using long MCMC runs by monitoring convergence of all observables. While convergence of observables does not imply that a system has reached equilibrium [11], this is the best that can be done for such physics systems as there is no clear metr...

2000
[2]

Anderson

Brian D.O. Anderson. Reverse-time diffusion equation models.Stochastic Processes and their Applications, 12(3):313–326, 1982. URL:https://www.sciencedirect.com/science/article/pii/ 0304414982900515,doi:10.1016/0304-4149(82)90051-5

work page doi:10.1016/0304-4149(82)90051-5 1982
[3]

V. L. Berezinskii. Destruction of Long-range Order in One-dimensional and Two-dimensional Systems having a Continuous Symmetry Group I. Classical Systems.Sov. Phys. JETP, 32:493–500, 1971

1971
[4]

The sample size required in importance sampling

Sourav Chatterjee and Persi Diaconis. The sample size required in importance sampling, 2017. URL: https://arxiv.org/abs/1511.01437,arXiv:1511.01437

work page internal anchor Pith review Pith/arXiv arXiv 2017
[5]

Dhruv Devulapalli, T. C. Mooney, and James D. Watson. The Complexity of Thermalization in Finite Quantum Systems, 2026. URL:https://arxiv.org/abs/2507.00405,arXiv:2507.00405

work page arXiv 2026
[6]

The Markov chain Monte Carlo revolution.Bulletin of the American Mathematical Society, 46:179–205, 2008

Persi Diaconis. The Markov chain Monte Carlo revolution.Bulletin of the American Mathematical Society, 46:179–205, 2008. URL:https://api.semanticscholar.org/CorpusID:14905050

2008
[7]

The Mixing Time Evolution of Glauber Dynamics for the Mean-Field Ising Model.Communications in Mathematical Physics, 289(2):725–764, Jul 2009

Jian Ding, Eyal Lubetzky, and Yuval Peres. The Mixing Time Evolution of Glauber Dynamics for the Mean-Field Ising Model.Communications in Mathematical Physics, 289(2):725–764, Jul 2009. doi:10.1007/s00220-009-0781-9

work page doi:10.1007/s00220-009-0781-9 2009
[8]

Edwards and Alan D

Robert G. Edwards and Alan D. Sokal. Dynamic critical behavior of Wolff’s collective-mode Monte Carlo algorithm for the two-dimensionalO(n)nonlinear σ model.Phys. Rev. D, 40:1374–1377, Aug 1989. URL:https://link.aps.org/doi/10.1103/PhysRevD.40.1374,doi:10.1103/PhysRevD.40.1374

work page doi:10.1103/physrevd.40.1374 1989
[9]

Fernández, Manuel F

Julio F. Fernández, Manuel F. Ferreira, and Jolanta Stankiewicz. Critical behavior of the two-dimensional XY model: A Monte Carlo simulation.Phys. Rev. B, 34:292–300, Jul 1986. URL:https://link.aps. org/doi/10.1103/PhysRevB.34.292,doi:10.1103/PhysRevB.34.292

work page doi:10.1103/physrevb.34.292 1986
[10]

RydbergGpt.Machine Learning: Science and Technology, 6(4):045057, Dec 2025.doi:10.1088/2632-2153/ae1d0b

David Fitzek, Yi Hong Teoh, H P Cyrus Fung, Gebremedhin A Dagnew, Ejaaz Merali, M Schuyler Moss, Benjamin MacLellan, and Roger G Melko. RydbergGpt.Machine Learning: Science and Technology, 6(4):045057, Dec 2025.doi:10.1088/2632-2153/ae1d0b

work page doi:10.1088/2632-2153/ae1d0b 2025
[11]

VegardFlovik, FerranMacià, andErikWahlström. Describingsynchronizationandtopologicalexcitations in arrays of magnetic spin torque oscillators through the Kuramoto model.Scientific Reports, 6(1):32528, Sep 2016.doi:10.1038/srep32528

work page doi:10.1038/srep32528 2016
[12]

Luis Pedro García-Pintos, Noah Linden, Artur S. L. Malabarba, Anthony J. Short, and Andreas Winter. Equilibration Time scales of Physically Relevant Observables.Phys. Rev. X, 7:031027, Aug 2017. URL: https://link.aps.org/doi/10.1103/PhysRevX.7.031027,doi:10.1103/PhysRevX.7.031027. 11

work page doi:10.1103/physrevx.7.031027 2017
[13]

Equilibration, thermalisation, and the emergence of statistical mechanics in closed quantum systems.Reports on Progress in Physics, 79(5):056001, Apr 2016

Christian Gogolin and Jens Eisert. Equilibration, thermalisation, and the emergence of statistical mechanics in closed quantum systems.Reports on Progress in Physics, 79(5):056001, Apr 2016. doi:10.1088/0034-4885/79/5/056001

work page doi:10.1088/0034-4885/79/5/056001 2016
[14]

PhD thesis, Waterloo U., 2017

Lauren Elizabeth Hayward Sierens.Simulating quantum matter through lattice field the- ories. PhD thesis, Waterloo U., 2017. URL: https://uwspace.uwaterloo.ca/items/ 14b662dd-a5dd-4ed1-b4f7-bcd5f1baad57

2017
[15]

Denoising Diffusion Probabilistic Models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffusion Probabilistic Models, 2020. URL: https://arxiv.org/abs/2006.11239,arXiv:2006.11239

work page internal anchor Pith review Pith/arXiv arXiv 2020
[16]

Classifier-Free Diffusion Guidance

Jonathan Ho and Tim Salimans. Classifier-Free Diffusion Guidance, 2022. URL:https://arxiv.org/ abs/2207.12598,arXiv:2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022
[17]

Riemannian Diffusion Models, 2022

Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, and Aaron Courville. Riemannian Diffusion Models, 2022. URL:https://arxiv.org/abs/2208.07949, arXiv:2208.07949

work page arXiv 2022
[18]

Phys., 1925,31, 253–258, doi:10.1007/BF02980577

Ernst Ising. Beitrag zur Theorie des Ferromagnetismus.Zeitschrift für Physik, 31(1):253–258, Feb 1925. doi:10.1007/BF02980577

work page doi:10.1007/bf02980577 1925
[19]

Torsional Diffusion for Molecular Conformer Generation.arXiv preprint arXiv:2206.01729, 2022

Bowen Jing, Gabriele Corso, Jeffrey Chang, Regina Barzilay, and Tommi Jaakkola. Torsional Diffusion for Molecular Conformer Generation.arXiv preprint arXiv:2206.01729, 2022. URL:https://arxiv. org/abs/2206.01729

work page arXiv 2022
[20]

José, Leo P

Jorge V. José, Leo P. Kadanoff, Scott Kirkpatrick, and David R. Nelson. Renormalization, vortices, and symmetry-breaking perturbations in the two-dimensional planar model.Phys. Rev. B, 16:1217–1241, Aug 1977. URL:https://link.aps.org/doi/10.1103/PhysRevB.16.1217, doi:10.1103/PhysRevB. 16.1217

work page doi:10.1103/physrevb.16.1217 1977
[21]

Adam: A Method for Stochastic Optimization

Diederik P. Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization, 2017. URL: https://arxiv.org/abs/1412.6980,arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2017
[22]

Ordering, metastability and phase transitions in two-dimensional systems.Journal of Physics C: Solid State Physics, 6(7):1181, Apr 1973.doi:10.1088/0022-3719/6/ 7/010

J M Kosterlitz and D J Thouless. Ordering, metastability and phase transitions in two-dimensional systems.Journal of Physics C: Solid State Physics, 6(7):1181, Apr 1973.doi:10.1088/0022-3719/6/ 7/010

work page doi:10.1088/0022-3719/6/ 1973
[23]

Efficient Identification of Critical Transitions via flow Matching: A Scalable Generative Approach for Many-Body Systems, 2026

Qian-Rui Lee and Daw-Wei Wang. Efficient Identification of Critical Transitions via flow Matching: A Scalable Generative Approach for Many-Body Systems, 2026. URL:https://arxiv.org/abs/2508. 15318,arXiv:2508.15318

work page arXiv 2026
[24]

Curse-of-dimensionality revisited: Collapse of importance sampling in very large scale systems

Bo Li, Thomas Bengtsson, and Peter Bickel. Curse-of-dimensionality revisited: Collapse of importance sampling in very large scale systems. 01 2005. URL: https://statistics.berkeley.edu/sites/ default/files/tech-reports/696.pdf

2005
[25]

Uniformly Frustrated XY Model: Strengthening of the Vortex Lattice by Intrinsic Disorder.Condensed Matter, 6(4), 2021

Ilaria Maccari, Lara Benfatto, and Claudio Castellani. Uniformly Frustrated XY Model: Strengthening of the Vortex Lattice by Intrinsic Disorder.Condensed Matter, 6(4), 2021. URL:https://www.mdpi. com/2410-3896/6/4/42,doi:10.3390/condmat6040042

work page doi:10.3390/condmat6040042 2021
[26]

Enhancing deep neural networks through complex-valued representations and kuramoto synchronization dynamics.Transactions on Machine Learning Research, 2025

Sabine Muzellec, Andrea Alamia, Thomas Serre, and Rufin VanRullen. Enhancing deep neural networks through complex-valued representations and kuramoto synchronization dynamics.Transactions on Machine Learning Research, 2025. URL:https://openreview.net/forum?id=zx6QGmBL43

2025
[27]

Boltzmann Generators -- Sampling Equilibrium States of Many-Body Systems with Deep Learning

Frank Noé, Simon Olsson, Jonas Köhler, and Hao Wu. Boltzmann Generators – Sampling Equilibrium states of Many-Body Systems with Deep Learning, 2019. URL:https://arxiv.org/abs/1812.01729, arXiv:1812.01729

work page internal anchor Pith review Pith/arXiv arXiv 2019
[28]

Oliveira, Binquan Luan, Pierre M

Felipe L. Oliveira, Binquan Luan, Pierre M. Esteves, Mathias Steiner, and Rodrigo Neumann Bar- ros Ferreira. pymser-An Open-Source Library for Automatic Equilibration Detection in Molecular Simulations.Journal of Chemical Theory and Computation, 20(19):8559–8568, September 2024. URL: http://dx.doi.org/10.1021/acs.jctc.4c00417,doi:10.1021/acs.jctc.4c00417

work page doi:10.1021/acs.jctc.4c00417 2024
[29]

FiLM: Visual Reasoning with a General Conditioning Layer

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, and Aaron Courville. FiLM: Visual Reasoning with a General Conditioning Layer, 2017. URL: https://arxiv.org/abs/1709.07871, arXiv:1709.07871

work page internal anchor Pith review Pith/arXiv arXiv 2017
[30]

R. B. Potts. Some generalized order-disorder transformations.Mathematical Proceedings of the Cambridge Philosophical Society, 48(1):106–109, 1952.doi:10.1017/S0305004100027419

work page doi:10.1017/s0305004100027419 1952
[31]

Typical fast thermalization processes in closed many-body systems.Nature Communi- cations, 7(1):10821, Mar 2016.doi:10.1038/ncomms10821

Peter Reimann. Typical fast thermalization processes in closed many-body systems.Nature Communi- cations, 7(1):10821, Mar 2016.doi:10.1038/ncomms10821

work page doi:10.1038/ncomms10821 2016
[32]

CO-PUBLISHED WITH IMPERIAL COLLEGE PRESS,

David Ruelle.Statistical Mechanics. CO-PUBLISHED WITH IMPERIAL COLLEGE PRESS,
[33]

worldscientific.com/doi/pdf/10.1142/4090,doi:10.1142/4090

URL: https://www.worldscientific.com/doi/abs/10.1142/4090, arXiv:https://www. worldscientific.com/doi/pdf/10.1142/4090,doi:10.1142/4090. 12

work page doi:10.1142/4090
[34]

Anders W. Sandvik. Computational Studies of Quantum Spin Systems.AIP Conference Proceed- ings, 1297(1):135–338, 11 2010.arXiv:https://pubs.aip.org/aip/acp/article-pdf/1297/1/135/ 11407753/135_1_online.pdf,doi:10.1063/1.3518900

work page doi:10.1063/1.3518900 2010
[35]

Time Complexity in Deep Learning Models.Procedia Computer Science, 215:202–210, 2022

Bhoomi Shah and Hetal Bhavsar. Time Complexity in Deep Learning Models.Procedia Computer Science, 215:202–210, 2022. 4th International Conference on Innovative Data Communication Technology and Application. URL: https://www.sciencedirect.com/science/article/pii/S1877050922020944, doi:10.1016/j.procs.2022.12.023

work page doi:10.1016/j.procs.2022.12.023 2022
[36]

Generative Modeling by Estimating Gradients of the Data Distribution

Yang Song and Stefano Ermon. Generative Modeling by Estimating Gradients of the Data Distribution. InAdvances in Neural Information Processing Systems, pages 11895–11907, 2019. URL: https: //arxiv.org/abs/1907.05600

work page internal anchor Pith review Pith/arXiv arXiv 2019
[37]

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-Based Generative Modeling through Stochastic Differential Equations. InInternational Conference on Learning Representations, 2021. URL:https://openreview.net/forum?id=PxTIG12RRHS

2021
[38]

Yuntai Song. Monte Carlo simulation with Wolff algorithm for scaling behavior of two dimensional XY model with KT phase transition.Journal of Physics: Conference Series, 2649(1):012055, Nov 2023. doi:10.1088/1742-6596/2649/1/012055

work page doi:10.1088/1742-6596/2649/1/012055 2023
[39]

Numerical Studies of Vortices and Helicity Modulus in the Two-Dimensional Generalized XY Model.Frontiers in Physics, Volume10-2022, 2022

Yun-Zhou Sun, Qin Wu, Xiao-Li Yang, Yan Zhou, Lan-Yan Zhu, Quan Chen, and Qing An. Numerical Studies of Vortices and Helicity Modulus in the Two-Dimensional Generalized XY Model.Frontiers in Physics, Volume10-2022, 2022. URL: https://www.frontiersin.org/journals/physics/articles/ 10.3389/fphy.2022.851322,doi:10.3389/fphy.2022.851322

work page doi:10.3389/fphy.2022.851322 2022
[40]

Boltzmann sampling for an XY model using a non-degenerate optical parametric oscillator network.Quantum Science and Technology, 3(1):014004, Nov 2017.doi:10.1088/2058-9565/aa923b

Y Takeda, S Tamate, Y Yamamoto, H Takesue, T Inagaki, and S Utsunomiya. Boltzmann sampling for an XY model using a non-degenerate optical parametric oscillator network.Quantum Science and Technology, 3(1):014004, Nov 2017.doi:10.1088/2058-9565/aa923b

work page doi:10.1088/2058-9565/aa923b 2017
[41]

Simulating the classical XY model with a laser network

Shuhei Tamate, Yoshihisa Yamamoto, Alireza Marandi, Peter McMahon, and Shoko Utsunomiya. Simulating the classical XY model with a laser network, 2016. URL:https://arxiv.org/abs/1608. 00358,arXiv:1608.00358

work page internal anchor Pith review Pith/arXiv arXiv 2016
[42]

Collective Monte Carlo Updating for Spin Systems.Phys

Ulli Wolff. Collective Monte Carlo Updating for Spin Systems.Phys. Rev. Lett., 62:361–364, Jan
[43]

URL:https://link.aps.org/doi/10.1103/PhysRevLett.62.361, doi:10.1103/PhysRevLett. 62.361

work page doi:10.1103/physrevlett.62.361
[44]

Integrated spatial photonic XY Ising sampler based on a high-uniformity 1 x 8 multi-mode interferometer.Photon

Xin Ye, Wenjia Zhang, and Zuyuan He. Integrated spatial photonic XY Ising sampler based on a high-uniformity 1 x 8 multi-mode interferometer.Photon. Res., 13(5):1419–1427, May 2025. URL: https://opg.optica.org/prj/abstract.cfm?URI=prj-13-5-1419,doi:10.1364/PRJ.542991. Appendix A: Finding thermalization time In this section we describe how we computed the ...

work page doi:10.1364/prj.542991 2025
[45]

Initialize an empty clusterC, an empty queueQ and an empty listM containing sites which have been visited
[46]

Choose a random site(k, l)in the lattice and add the site(k, l)to the cluster C, the queueQ and listM
[47]

choose a random2dimensional unit vectorv
[48]

Now we reflect all the spins in the cluster

whileQis not empty (a) remove a spinsfromQ (b) for each neighbourn of s, if n is not present inM then add n to C, Q and M with probability padd = 1−exp (min (0,−2β(s·v)(n·v))) The cluster is formed when the queueQ is empty. Now we reflect all the spins in the cluster. Let’s say the set of all spins in the cluster is denoted bySc. These spins are reflected...
[49]

7 we compare the performance of the annealed guidance with a fixed guidance scale of1.5

In Fig. 7 we compare the performance of the annealed guidance with a fixed guidance scale of1.5. Note that no Wolff steps have been done on the diffusion samples here and we can see that the heat capacity of the diffusion samples produced by the annealed CFG method are clearly better than the samples produced by the fixed guidance scale. Appendix E: Exper...

[1] [1]

In practice, thermalization time is estimated using long MCMC runs by monitoring convergence of all observables

and different observables may converge at different rates [12, 30]. In practice, thermalization time is estimated using long MCMC runs by monitoring convergence of all observables. While convergence of observables does not imply that a system has reached equilibrium [11], this is the best that can be done for such physics systems as there is no clear metr...

2000

[2] [2]

Anderson

Brian D.O. Anderson. Reverse-time diffusion equation models.Stochastic Processes and their Applications, 12(3):313–326, 1982. URL:https://www.sciencedirect.com/science/article/pii/ 0304414982900515,doi:10.1016/0304-4149(82)90051-5

work page doi:10.1016/0304-4149(82)90051-5 1982

[3] [3]

V. L. Berezinskii. Destruction of Long-range Order in One-dimensional and Two-dimensional Systems having a Continuous Symmetry Group I. Classical Systems.Sov. Phys. JETP, 32:493–500, 1971

1971

[4] [4]

The sample size required in importance sampling

Sourav Chatterjee and Persi Diaconis. The sample size required in importance sampling, 2017. URL: https://arxiv.org/abs/1511.01437,arXiv:1511.01437

work page internal anchor Pith review Pith/arXiv arXiv 2017

[5] [5]

Dhruv Devulapalli, T. C. Mooney, and James D. Watson. The Complexity of Thermalization in Finite Quantum Systems, 2026. URL:https://arxiv.org/abs/2507.00405,arXiv:2507.00405

work page arXiv 2026

[6] [6]

The Markov chain Monte Carlo revolution.Bulletin of the American Mathematical Society, 46:179–205, 2008

Persi Diaconis. The Markov chain Monte Carlo revolution.Bulletin of the American Mathematical Society, 46:179–205, 2008. URL:https://api.semanticscholar.org/CorpusID:14905050

2008

[7] [7]

The Mixing Time Evolution of Glauber Dynamics for the Mean-Field Ising Model.Communications in Mathematical Physics, 289(2):725–764, Jul 2009

Jian Ding, Eyal Lubetzky, and Yuval Peres. The Mixing Time Evolution of Glauber Dynamics for the Mean-Field Ising Model.Communications in Mathematical Physics, 289(2):725–764, Jul 2009. doi:10.1007/s00220-009-0781-9

work page doi:10.1007/s00220-009-0781-9 2009

[8] [8]

Edwards and Alan D

Robert G. Edwards and Alan D. Sokal. Dynamic critical behavior of Wolff’s collective-mode Monte Carlo algorithm for the two-dimensionalO(n)nonlinear σ model.Phys. Rev. D, 40:1374–1377, Aug 1989. URL:https://link.aps.org/doi/10.1103/PhysRevD.40.1374,doi:10.1103/PhysRevD.40.1374

work page doi:10.1103/physrevd.40.1374 1989

[9] [9]

Fernández, Manuel F

Julio F. Fernández, Manuel F. Ferreira, and Jolanta Stankiewicz. Critical behavior of the two-dimensional XY model: A Monte Carlo simulation.Phys. Rev. B, 34:292–300, Jul 1986. URL:https://link.aps. org/doi/10.1103/PhysRevB.34.292,doi:10.1103/PhysRevB.34.292

work page doi:10.1103/physrevb.34.292 1986

[10] [10]

RydbergGpt.Machine Learning: Science and Technology, 6(4):045057, Dec 2025.doi:10.1088/2632-2153/ae1d0b

David Fitzek, Yi Hong Teoh, H P Cyrus Fung, Gebremedhin A Dagnew, Ejaaz Merali, M Schuyler Moss, Benjamin MacLellan, and Roger G Melko. RydbergGpt.Machine Learning: Science and Technology, 6(4):045057, Dec 2025.doi:10.1088/2632-2153/ae1d0b

work page doi:10.1088/2632-2153/ae1d0b 2025

[11] [11]

VegardFlovik, FerranMacià, andErikWahlström. Describingsynchronizationandtopologicalexcitations in arrays of magnetic spin torque oscillators through the Kuramoto model.Scientific Reports, 6(1):32528, Sep 2016.doi:10.1038/srep32528

work page doi:10.1038/srep32528 2016

[12] [12]

Luis Pedro García-Pintos, Noah Linden, Artur S. L. Malabarba, Anthony J. Short, and Andreas Winter. Equilibration Time scales of Physically Relevant Observables.Phys. Rev. X, 7:031027, Aug 2017. URL: https://link.aps.org/doi/10.1103/PhysRevX.7.031027,doi:10.1103/PhysRevX.7.031027. 11

work page doi:10.1103/physrevx.7.031027 2017

[13] [13]

Equilibration, thermalisation, and the emergence of statistical mechanics in closed quantum systems.Reports on Progress in Physics, 79(5):056001, Apr 2016

Christian Gogolin and Jens Eisert. Equilibration, thermalisation, and the emergence of statistical mechanics in closed quantum systems.Reports on Progress in Physics, 79(5):056001, Apr 2016. doi:10.1088/0034-4885/79/5/056001

work page doi:10.1088/0034-4885/79/5/056001 2016

[14] [14]

PhD thesis, Waterloo U., 2017

Lauren Elizabeth Hayward Sierens.Simulating quantum matter through lattice field the- ories. PhD thesis, Waterloo U., 2017. URL: https://uwspace.uwaterloo.ca/items/ 14b662dd-a5dd-4ed1-b4f7-bcd5f1baad57

2017

[15] [15]

Denoising Diffusion Probabilistic Models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffusion Probabilistic Models, 2020. URL: https://arxiv.org/abs/2006.11239,arXiv:2006.11239

work page internal anchor Pith review Pith/arXiv arXiv 2020

[16] [16]

Classifier-Free Diffusion Guidance

Jonathan Ho and Tim Salimans. Classifier-Free Diffusion Guidance, 2022. URL:https://arxiv.org/ abs/2207.12598,arXiv:2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022

[17] [17]

Riemannian Diffusion Models, 2022

Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, and Aaron Courville. Riemannian Diffusion Models, 2022. URL:https://arxiv.org/abs/2208.07949, arXiv:2208.07949

work page arXiv 2022

[18] [18]

Phys., 1925,31, 253–258, doi:10.1007/BF02980577

Ernst Ising. Beitrag zur Theorie des Ferromagnetismus.Zeitschrift für Physik, 31(1):253–258, Feb 1925. doi:10.1007/BF02980577

work page doi:10.1007/bf02980577 1925

[19] [19]

Torsional Diffusion for Molecular Conformer Generation.arXiv preprint arXiv:2206.01729, 2022

Bowen Jing, Gabriele Corso, Jeffrey Chang, Regina Barzilay, and Tommi Jaakkola. Torsional Diffusion for Molecular Conformer Generation.arXiv preprint arXiv:2206.01729, 2022. URL:https://arxiv. org/abs/2206.01729

work page arXiv 2022

[20] [20]

José, Leo P

Jorge V. José, Leo P. Kadanoff, Scott Kirkpatrick, and David R. Nelson. Renormalization, vortices, and symmetry-breaking perturbations in the two-dimensional planar model.Phys. Rev. B, 16:1217–1241, Aug 1977. URL:https://link.aps.org/doi/10.1103/PhysRevB.16.1217, doi:10.1103/PhysRevB. 16.1217

work page doi:10.1103/physrevb.16.1217 1977

[21] [21]

Adam: A Method for Stochastic Optimization

Diederik P. Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization, 2017. URL: https://arxiv.org/abs/1412.6980,arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2017

[22] [22]

Ordering, metastability and phase transitions in two-dimensional systems.Journal of Physics C: Solid State Physics, 6(7):1181, Apr 1973.doi:10.1088/0022-3719/6/ 7/010

J M Kosterlitz and D J Thouless. Ordering, metastability and phase transitions in two-dimensional systems.Journal of Physics C: Solid State Physics, 6(7):1181, Apr 1973.doi:10.1088/0022-3719/6/ 7/010

work page doi:10.1088/0022-3719/6/ 1973

[23] [23]

Efficient Identification of Critical Transitions via flow Matching: A Scalable Generative Approach for Many-Body Systems, 2026

Qian-Rui Lee and Daw-Wei Wang. Efficient Identification of Critical Transitions via flow Matching: A Scalable Generative Approach for Many-Body Systems, 2026. URL:https://arxiv.org/abs/2508. 15318,arXiv:2508.15318

work page arXiv 2026

[24] [24]

Curse-of-dimensionality revisited: Collapse of importance sampling in very large scale systems

Bo Li, Thomas Bengtsson, and Peter Bickel. Curse-of-dimensionality revisited: Collapse of importance sampling in very large scale systems. 01 2005. URL: https://statistics.berkeley.edu/sites/ default/files/tech-reports/696.pdf

2005

[25] [25]

Uniformly Frustrated XY Model: Strengthening of the Vortex Lattice by Intrinsic Disorder.Condensed Matter, 6(4), 2021

Ilaria Maccari, Lara Benfatto, and Claudio Castellani. Uniformly Frustrated XY Model: Strengthening of the Vortex Lattice by Intrinsic Disorder.Condensed Matter, 6(4), 2021. URL:https://www.mdpi. com/2410-3896/6/4/42,doi:10.3390/condmat6040042

work page doi:10.3390/condmat6040042 2021

[26] [26]

Enhancing deep neural networks through complex-valued representations and kuramoto synchronization dynamics.Transactions on Machine Learning Research, 2025

Sabine Muzellec, Andrea Alamia, Thomas Serre, and Rufin VanRullen. Enhancing deep neural networks through complex-valued representations and kuramoto synchronization dynamics.Transactions on Machine Learning Research, 2025. URL:https://openreview.net/forum?id=zx6QGmBL43

2025

[27] [27]

Boltzmann Generators -- Sampling Equilibrium States of Many-Body Systems with Deep Learning

Frank Noé, Simon Olsson, Jonas Köhler, and Hao Wu. Boltzmann Generators – Sampling Equilibrium states of Many-Body Systems with Deep Learning, 2019. URL:https://arxiv.org/abs/1812.01729, arXiv:1812.01729

work page internal anchor Pith review Pith/arXiv arXiv 2019

[28] [28]

Oliveira, Binquan Luan, Pierre M

Felipe L. Oliveira, Binquan Luan, Pierre M. Esteves, Mathias Steiner, and Rodrigo Neumann Bar- ros Ferreira. pymser-An Open-Source Library for Automatic Equilibration Detection in Molecular Simulations.Journal of Chemical Theory and Computation, 20(19):8559–8568, September 2024. URL: http://dx.doi.org/10.1021/acs.jctc.4c00417,doi:10.1021/acs.jctc.4c00417

work page doi:10.1021/acs.jctc.4c00417 2024

[29] [29]

FiLM: Visual Reasoning with a General Conditioning Layer

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, and Aaron Courville. FiLM: Visual Reasoning with a General Conditioning Layer, 2017. URL: https://arxiv.org/abs/1709.07871, arXiv:1709.07871

work page internal anchor Pith review Pith/arXiv arXiv 2017

[30] [30]

R. B. Potts. Some generalized order-disorder transformations.Mathematical Proceedings of the Cambridge Philosophical Society, 48(1):106–109, 1952.doi:10.1017/S0305004100027419

work page doi:10.1017/s0305004100027419 1952

[31] [31]

Typical fast thermalization processes in closed many-body systems.Nature Communi- cations, 7(1):10821, Mar 2016.doi:10.1038/ncomms10821

Peter Reimann. Typical fast thermalization processes in closed many-body systems.Nature Communi- cations, 7(1):10821, Mar 2016.doi:10.1038/ncomms10821

work page doi:10.1038/ncomms10821 2016

[32] [32]

CO-PUBLISHED WITH IMPERIAL COLLEGE PRESS,

David Ruelle.Statistical Mechanics. CO-PUBLISHED WITH IMPERIAL COLLEGE PRESS,

[33] [33]

worldscientific.com/doi/pdf/10.1142/4090,doi:10.1142/4090

URL: https://www.worldscientific.com/doi/abs/10.1142/4090, arXiv:https://www. worldscientific.com/doi/pdf/10.1142/4090,doi:10.1142/4090. 12

work page doi:10.1142/4090

[34] [34]

Anders W. Sandvik. Computational Studies of Quantum Spin Systems.AIP Conference Proceed- ings, 1297(1):135–338, 11 2010.arXiv:https://pubs.aip.org/aip/acp/article-pdf/1297/1/135/ 11407753/135_1_online.pdf,doi:10.1063/1.3518900

work page doi:10.1063/1.3518900 2010

[35] [35]

Time Complexity in Deep Learning Models.Procedia Computer Science, 215:202–210, 2022

Bhoomi Shah and Hetal Bhavsar. Time Complexity in Deep Learning Models.Procedia Computer Science, 215:202–210, 2022. 4th International Conference on Innovative Data Communication Technology and Application. URL: https://www.sciencedirect.com/science/article/pii/S1877050922020944, doi:10.1016/j.procs.2022.12.023

work page doi:10.1016/j.procs.2022.12.023 2022

[36] [36]

Generative Modeling by Estimating Gradients of the Data Distribution

Yang Song and Stefano Ermon. Generative Modeling by Estimating Gradients of the Data Distribution. InAdvances in Neural Information Processing Systems, pages 11895–11907, 2019. URL: https: //arxiv.org/abs/1907.05600

work page internal anchor Pith review Pith/arXiv arXiv 2019

[37] [37]

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-Based Generative Modeling through Stochastic Differential Equations. InInternational Conference on Learning Representations, 2021. URL:https://openreview.net/forum?id=PxTIG12RRHS

2021

[38] [38]

Yuntai Song. Monte Carlo simulation with Wolff algorithm for scaling behavior of two dimensional XY model with KT phase transition.Journal of Physics: Conference Series, 2649(1):012055, Nov 2023. doi:10.1088/1742-6596/2649/1/012055

work page doi:10.1088/1742-6596/2649/1/012055 2023

[39] [39]

Numerical Studies of Vortices and Helicity Modulus in the Two-Dimensional Generalized XY Model.Frontiers in Physics, Volume10-2022, 2022

Yun-Zhou Sun, Qin Wu, Xiao-Li Yang, Yan Zhou, Lan-Yan Zhu, Quan Chen, and Qing An. Numerical Studies of Vortices and Helicity Modulus in the Two-Dimensional Generalized XY Model.Frontiers in Physics, Volume10-2022, 2022. URL: https://www.frontiersin.org/journals/physics/articles/ 10.3389/fphy.2022.851322,doi:10.3389/fphy.2022.851322

work page doi:10.3389/fphy.2022.851322 2022

[40] [40]

Boltzmann sampling for an XY model using a non-degenerate optical parametric oscillator network.Quantum Science and Technology, 3(1):014004, Nov 2017.doi:10.1088/2058-9565/aa923b

Y Takeda, S Tamate, Y Yamamoto, H Takesue, T Inagaki, and S Utsunomiya. Boltzmann sampling for an XY model using a non-degenerate optical parametric oscillator network.Quantum Science and Technology, 3(1):014004, Nov 2017.doi:10.1088/2058-9565/aa923b

work page doi:10.1088/2058-9565/aa923b 2017

[41] [41]

Simulating the classical XY model with a laser network

Shuhei Tamate, Yoshihisa Yamamoto, Alireza Marandi, Peter McMahon, and Shoko Utsunomiya. Simulating the classical XY model with a laser network, 2016. URL:https://arxiv.org/abs/1608. 00358,arXiv:1608.00358

work page internal anchor Pith review Pith/arXiv arXiv 2016

[42] [42]

Collective Monte Carlo Updating for Spin Systems.Phys

Ulli Wolff. Collective Monte Carlo Updating for Spin Systems.Phys. Rev. Lett., 62:361–364, Jan

[43] [43]

URL:https://link.aps.org/doi/10.1103/PhysRevLett.62.361, doi:10.1103/PhysRevLett. 62.361

work page doi:10.1103/physrevlett.62.361

[44] [44]

Integrated spatial photonic XY Ising sampler based on a high-uniformity 1 x 8 multi-mode interferometer.Photon

Xin Ye, Wenjia Zhang, and Zuyuan He. Integrated spatial photonic XY Ising sampler based on a high-uniformity 1 x 8 multi-mode interferometer.Photon. Res., 13(5):1419–1427, May 2025. URL: https://opg.optica.org/prj/abstract.cfm?URI=prj-13-5-1419,doi:10.1364/PRJ.542991. Appendix A: Finding thermalization time In this section we describe how we computed the ...

work page doi:10.1364/prj.542991 2025

[45] [45]

Initialize an empty clusterC, an empty queueQ and an empty listM containing sites which have been visited

[46] [46]

Choose a random site(k, l)in the lattice and add the site(k, l)to the cluster C, the queueQ and listM

[47] [47]

choose a random2dimensional unit vectorv

[48] [48]

Now we reflect all the spins in the cluster

whileQis not empty (a) remove a spinsfromQ (b) for each neighbourn of s, if n is not present inM then add n to C, Q and M with probability padd = 1−exp (min (0,−2β(s·v)(n·v))) The cluster is formed when the queueQ is empty. Now we reflect all the spins in the cluster. Let’s say the set of all spins in the cluster is denoted bySc. These spins are reflected...

[49] [49]

7 we compare the performance of the annealed guidance with a fixed guidance scale of1.5

In Fig. 7 we compare the performance of the annealed guidance with a fixed guidance scale of1.5. Note that no Wolff steps have been done on the diffusion samples here and we can see that the heat capacity of the diffusion samples produced by the annealed CFG method are clearly better than the samples produced by the fixed guidance scale. Appendix E: Exper...