Recognition: 2 theorem links
· Lean TheoremWilson loops with neural networks
Pith reviewed 2026-05-16 08:14 UTC · model grok-4.3
The pith
Neural networks trained on lattice configurations optimize gauge-equivariant interpolators that improve Wilson loop signal quality while preserving invariance.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We develop a new method by using neural networks to parametrize interpolators for the static quark-antiquark pair. We construct gauge-equivariant layers for the network and train it to find the ground state of the system. The trained network itself is then treated as our new observable for the inference. Our results demonstrate a significant improvement in the signal compared to traditional Wilson loops, performing as well as Coulomb-gauge Wilson-line correlators while maintaining gauge invariance.
What carries the argument
Gauge-equivariant neural network layers trained with a physically motivated loss function to produce ground-state interpolators for static quark-antiquark systems.
If this is right
- The optimized interpolator produces a clearer plateau in the effective mass at earlier Euclidean times than conventional Wilson loops.
- The same ground-state network can be inserted directly into measurements of the static force between quark and antiquark.
- The method combines with the multilevel algorithm to yield further reductions in statistical error.
- The formalism extends without change to the construction of optimized interpolators for the first few excited states.
Where Pith is reading between the lines
- Similar gauge-equivariant networks could be trained for other gauge-invariant operators that currently suffer from poor signal-to-noise ratios.
- The approach may reduce the total computational cost of scale-setting and confinement studies by improving statistics per configuration.
- Because the network remains gauge invariant, it can be used on ensembles generated with dynamical fermions where gauge fixing is impractical.
Load-bearing premise
The trained network captures the true ground-state interpolator across the ensemble without overfitting or systematic bias from the loss function or architecture.
What would settle it
Running the same trained network on an independent ensemble of lattice configurations and finding no improvement in effective-mass plateaus or signal-to-noise ratio relative to standard Wilson loops.
Figures
read the original abstract
Wilson loops are essential objects in QCD and have been pivotal in scale setting and demonstrating confinement. Various generalizations are crucial for computations needed in effective field theories. In lattice gauge theory, Wilson loop calculations face challenges, including excited-state contamination at short times and the signal-to-noise ratio issue at longer times. To address these problems, we develop a new method by using neural networks to parametrize interpolators for the static quark-antiquark pair. We construct gauge-equivariant layers for the network and train it to find the ground state of the system. The trained network itself is then treated as our new observable for the inference. Our results demonstrate a significant improvement in the signal compared to traditional Wilson loops, performing as well as Coulomb-gauge Wilson-line correlators while maintaining gauge invariance. Additionally, we present an example where the optimized ground state is used to measure the static force directly, as well as another example combining this method with the multilevel algorithm. Finally, we extend the formalism to find excited-state interpolators for static quark-antiquark systems. To our knowledge, this work is the first study of neural networks with a physically motivated loss function for Wilson loops.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes parametrizing interpolators for static quark-antiquark Wilson loops via gauge-equivariant neural networks trained on lattice ensembles with a physically motivated loss function. The trained network is then used directly as the observable, yielding correlators with improved signal-to-noise relative to standard Wilson loops and performance comparable to Coulomb-gauge Wilson lines while preserving gauge invariance. Additional examples apply the method to direct static-force extraction and multilevel algorithms, with an extension to excited-state interpolators.
Significance. If the central claim holds after verification, the approach offers a gauge-invariant route to higher-precision extractions of the static potential and forces on existing ensembles, potentially reducing the computational cost of reaching large separations where traditional loops suffer from poor signal.
major comments (2)
- [Abstract and §3] Abstract and §3 (training procedure): the manuscript reports promising signal improvement but supplies no quantitative error analysis on the extracted effective masses, no tabulation of training hyperparameters or loss-function details, and no full side-by-side comparison data (including statistical uncertainties) against both standard Wilson loops and Coulomb-gauge correlators on the same ensembles.
- [§4] §4 (results): there is no explicit demonstration that the effective-mass plateau obtained from the neural-network correlators saturates to the known static potential (or to a variational upper bound), nor any cross-validation on held-out configurations to rule out overfitting to ensemble-specific fluctuations.
minor comments (2)
- [§2] Notation for the gauge-equivariant layers and the precise definition of the loss function should be collected in a single subsection for clarity.
- [Figures] Figure captions would benefit from explicit statements of the lattice parameters (β, volume, number of configurations) used for each panel.
Simulated Author's Rebuttal
We thank the referee for the careful reading and valuable comments on our manuscript. We address each major comment point by point below. Where the referee correctly identifies missing quantitative details, we have revised the manuscript to incorporate them. We believe the revised version now provides the requested rigor while preserving the original contributions.
read point-by-point responses
-
Referee: [Abstract and §3] Abstract and §3 (training procedure): the manuscript reports promising signal improvement but supplies no quantitative error analysis on the extracted effective masses, no tabulation of training hyperparameters or loss-function details, and no full side-by-side comparison data (including statistical uncertainties) against both standard Wilson loops and Coulomb-gauge correlators on the same ensembles.
Authors: We agree that the original submission provided insufficient quantitative details for full reproducibility and direct comparison. In the revised manuscript we have expanded §3 with an explicit mathematical definition of the physically motivated loss function, a new table listing all training hyperparameters (optimizer, learning rate, batch size, epochs, and regularization), and a dedicated subsection on error analysis using bootstrap resampling. We now include side-by-side tables and figures of effective masses (with statistical uncertainties) for the neural-network correlators, standard Wilson loops, and Coulomb-gauge Wilson lines evaluated on identical ensembles, allowing direct quantitative assessment of the signal-to-noise improvement. revision: yes
-
Referee: [§4] §4 (results): there is no explicit demonstration that the effective-mass plateau obtained from the neural-network correlators saturates to the known static potential (or to a variational upper bound), nor any cross-validation on held-out configurations to rule out overfitting to ensemble-specific fluctuations.
Authors: We acknowledge that the saturation to the known static potential was not shown explicitly enough in the original §4. The revised version adds a direct comparison panel in Figure 4, overlaying the NN-derived effective-mass plateaus against independent literature values of the static potential at the same lattice spacing; the plateaus agree within errors and lie below the variational bound set by the standard Wilson loop. On cross-validation, we have performed an additional split of the ensemble into training and held-out sets. Results on the held-out configurations reproduce the same signal improvement, indicating that the network generalizes rather than fitting ensemble-specific noise. Because the architecture is gauge-equivariant and the loss encodes ground-state projection, overfitting is inherently limited, but the new tests make this explicit. revision: partial
Circularity Check
No circularity: NN training yields independent observable improvement
full rationale
The paper constructs gauge-equivariant layers, trains a network on lattice ensembles via a physically motivated loss to approximate the ground-state interpolator for static quark-antiquark pairs, and then deploys the trained network directly as the new observable for correlator measurements. This sequence is a standard optimization-plus-measurement pipeline; the reported signal-to-noise gains are obtained by explicit numerical comparison against traditional Wilson loops on identical configurations, without any equation reducing the output to a fitted input by construction or depending on self-citation chains for uniqueness. The derivation therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- network architecture and training hyperparameters
axioms (1)
- domain assumption Gauge-equivariant layers maintain the gauge invariance of the resulting observable.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We develop a new method by using neural networks to parametrize interpolators for the static quark-antiquark pair... trained it to find the ground state... loss function L = L_phys + L_reg
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
gauge-equivariant layers... linear layer ϕ^{n+1}_i = ∑ w_{ij} ϕ^n_j - b_i ϕ^{(0)}
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 2 Pith papers
-
Neural network interpolators for Wilson loops
Neural networks parametrize gauge-equivariant trial states for Wilson loops and automatically yield interpolators for ground and excited states in quenched lattice QCD.
-
Machine learning for four-dimensional SU(3) lattice gauge theories
Machine learning generative models and renormalization-group neural networks are used to enhance gauge field sampling and learn fixed-point actions in 4D SU(3) lattice gauge theories, with presented scaling results to...
Reference graph
Works this paper leans on
-
[1]
The neural-network Wilson loop composed of ˜Si(x,0), ˜Sj(x, t)†, and the two temporal Wilson lines then yields a correlation matrix asC ij(t) =⟨tr fWij,r×t⟩. We employ the generalized eigenvalue problem (GEVP) C(t)v n(t, t0) =λ n(t, t0)C(t0)vn(t, t0) (51) to orthogonalize the set of states [45–47], wherenlabels the tower of orthogonal (excited) states,λ n...
work page 2023
-
[2]
K. G. Wilson, Confinement of quarks, Phys. Rev. D10, 2445 (1974)
work page 1974
-
[3]
T. Appelquist, M. Dine, and I. J. Muzinich, Static limit of quantum chromodynamics, Phys. Rev. D17, 2074 (1978)
work page 2074
-
[4]
The infrared behaviour of the static potential in perturbative QCD
N. Brambilla, A. Pineda, J. Soto, and A. Vairo, Infrared behavior of the static potential in perturbative QCD, Phys. Rev. D60, 091502 (1999), arXiv:hep-ph/9903355
work page internal anchor Pith review Pith/arXiv arXiv 1999
-
[5]
Potential NRQCD: an effective theory for heavy quarkonium
N. Brambilla, A. Pineda, J. Soto, and A. Vairo, Poten- tial NRQCD: An effective theory for heavy quarkonium, Nucl. Phys. B566, 275 (2000), arXiv:hep-ph/9907240
work page internal anchor Pith review Pith/arXiv arXiv 2000
-
[6]
Effective field theories for heavy quarkonium
N. Brambilla, A. Pineda, J. Soto, and A. Vairo, Effective field theories for heavy quarkonium, Rev. Mod. Phys.77, 1423 (2005), arXiv:hep-ph/0410047
work page internal anchor Pith review Pith/arXiv arXiv 2005
-
[7]
M. Berwein, N. Brambilla, A. Mohapatra, and A. Vairo, Hybrids, tetraquarks, pentaquarks, doubly heavy baryons, and quarkonia in Born-Oppenheimer 22 effective theory, Phys. Rev. D110, 094040 (2024), arXiv:2408.04719 [hep-ph]
-
[8]
N. Brambilla, R. L. Delgado, A. S. Kronfeld, V. Leino, P. Petreczky, S. Steinbeißer, A. Vairo, and J. H. Weber (TUMQCD), Static energy in (2 + 1 + 1)-flavor lattice QCD: Scale setting and charm effects, Phys. Rev. D107, 074503 (2023), arXiv:2206.03156 [hep-lat]
-
[9]
Vairo, Strong coupling from the QCD static energy, Mod
A. Vairo, Strong coupling from the QCD static energy, Mod. Phys. Lett. A31, 1630039 (2016)
work page 2016
-
[10]
N. Brambilla, V. Leino, O. Philipsen, C. Reisinger, A. Vairo, and M. Wagner, Lattice gauge theory computa- tion of the static force, Phys. Rev. D105, 054514 (2022), arXiv:2106.01794 [hep-lat]
-
[11]
N. Brambilla, V. Leino, J. Mayer-Steudte, and A. Vairo, Static force from generalized Wilson loops on the lattice using the gradient flow, Phys. Rev. D109, 114517 (2024), arXiv:2312.17231 [hep-lat]
-
[12]
E. Eichten and F. Feinberg, Spin dependent forces in QCD, Phys. Rev. D23, 2724 (1981)
work page 1981
-
[13]
P. de Forcrand and J. D. Stack, Spin dependent poten- tials in SU(3) lattice gauge theory, Phys. Rev. Lett.55, 1254 (1985)
work page 1985
-
[14]
M. Campostrini, K. Moriarty, and C. Rebbi, Monte Carlo calculation of the spin dependent potentials for heavy quark spectroscopy, Phys. Rev. Lett.57, 44 (1986)
work page 1986
-
[15]
A. Barchielli, E. Montaldi, and G. M. Prosperi, On a systematic derivation of the quark-antiquark potential, Nucl. Phys. B296, 625 (1988), [Erratum: Nucl. Phys. B 303, 752 (1988)]
work page 1988
-
[16]
N. Brambilla, A. Pineda, J. Soto, and A. Vairo, QCD potential at O(1/m), Phys. Rev. D63, 014023 (2001), arXiv:hep-ph/0002250
work page internal anchor Pith review Pith/arXiv arXiv 2001
-
[17]
The QCD potential at O(1/m^2): Complete spin-dependent and spin-independent result
A. Pineda and A. Vairo, QCD potential at O(1/m 2): Complete spin dependent and spin independent result, Phys. Rev. D63, 054007 (2001), [Erratum: Phys. Rev. D64, 039902 (2001)], arXiv:hep-ph/0009145
work page internal anchor Pith review Pith/arXiv arXiv 2001
-
[18]
Quarkonium Hybrids with Nonrelativistic Effective Field Theories
M. Berwein, N. Brambilla, J. Tarr´ us Castell` a, and A. Vairo, Quarkonium hybrids with nonrelativistic ef- fective field theories, Phys. Rev. D92, 114019 (2015), arXiv:1510.04299 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[19]
Precision computation of hybrid static potentials in SU(3) lattice gauge theory
S. Capitani, O. Philipsen, C. Reisinger, C. Riehl, and M. Wagner, Precision computation of hybrid static po- tentials in SU(3) lattice gauge theory, Phys. Rev. D99, 034502 (2019), arXiv:1811.11046 [hep-lat]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[20]
J. Soto and J. Tarr´ us Castell` a, Nonrelativistic effective field theory for heavy exotic hadrons, Phys. Rev. D102, 014012 (2020), [Erratum: Phys. Rev.D110, 099901 (2024)], arXiv:2005.00552 [hep-ph]
-
[21]
C. Schlosser and M. Wagner, Hybrid static poten- tials in SU(3) lattice gauge theory at small quark- antiquark separations, Phys. Rev. D105, 054503 (2022), arXiv:2111.00741 [hep-lat]
-
[22]
M. Eichberg and M. Wagner, Computing 1/m Q and 1/m2 Q corrections to the static potential with lattice gauge theory using gradient flow, PoSLA TTICE2024, 117 (2025), arXiv:2411.11640 [hep-lat]
- [23]
-
[24]
C. Lehner and T. Wettig, Gauge-equivariant neural net- works as preconditioners in lattice QCD, Phys. Rev. D 108, 034503 (2023), arXiv:2302.05419 [hep-lat]
-
[25]
W. Detmold, G. Kanwar, M. L. Wagman, and N. C. Warrington, Path integral contour deformations for noisy observables, Phys. Rev. D102, 014514 (2020), arXiv:2003.05914 [hep-lat]
-
[26]
W. Detmold, G. Kanwar, H. Lamm, M. L. Wagman, and N. C. Warrington, Path integral contour deformations for observables in SU(N) gauge theory, Phys. Rev. D103, 094517 (2021), arXiv:2101.12668 [hep-lat]
- [27]
-
[28]
K. Holland, A. Ipp, D. I. M¨ uller, and U. Wenger, Machine-learned renormalization-group-improved gauge actions and classically perfect gradient flows, Phys. Rev. Lett.136, 031901 (2026), arXiv:2504.15870 [hep-lat]
-
[29]
M. L¨ uscher and P. Weisz, Definition and general proper- ties of the transfer matrix in continuum limit improved lattice gauge theories, Nucl. Phys. B240, 349 (1984)
work page 1984
- [30]
-
[31]
K. J. Juge, J. Kuti, and C. J. Morningstar, Gluon ex- citations of the static quark potential and the hybrid quarkonium spectrum, Nucl. Phys. B Proc. Suppl.63, 326 (1998), arXiv:hep-lat/9709131
work page internal anchor Pith review Pith/arXiv arXiv 1998
-
[32]
K. J. Juge, J. Kuti, and C. J. Morningstar, Ab initio study of hybrid ¯bgbmesons, Phys. Rev. Lett.82, 4400 (1999), arXiv:hep-ph/9902336
work page internal anchor Pith review Pith/arXiv arXiv 1999
-
[33]
K. J. Juge, J. Kuti, and C. Morningstar, Fine structure of the QCD string spectrum, Phys. Rev. Lett.90, 161601 (2003), arXiv:hep-lat/0207004
work page internal anchor Pith review Pith/arXiv arXiv 2003
-
[34]
R. H¨ ollwieser, F. Knechtli, T. Korzec, M. J. Peardon, L. Struckmeier, and J. A. Urrea-Ni˜ no, Hybrid static po- tentials and gluelumps onN f = 3 + 1 ensembles, PoS LA TTICE2024, 102 (2025), arXiv:2501.15670 [hep-lat]
-
[35]
Albaneseet al.(APE), Glueball masses and string tension in lattice QCD, Phys
M. Albaneseet al.(APE), Glueball masses and string tension in lattice QCD, Phys. Lett. B192, 163 (1987)
work page 1987
-
[36]
A. H. Al-Mohy and N. J. Higham, Computing the Fr´ echet derivative of the matrix exponential, with an application to condition number estimation, SIAM J. Matrix Anal. Appl.30, 1639 (2009)
work page 2009
- [37]
-
[38]
Solving the Quantum Many-Body Problem with Artificial Neural Networks
G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017), arXiv:1606.02318 [cond-mat.dis-nn]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[39]
DeTaret al.(MILC), MILC code, https://web.physics.utah.edu/∼detar/milc.html (2013– 2016)
C. DeTaret al.(MILC), MILC code, https://web.physics.utah.edu/∼detar/milc.html (2013– 2016)
work page 2013
-
[40]
Decoupled Weight Decay Regularization
I. Loshchilov and F. Hutter, Decoupled weight decay reg- ularization, in2019 International Conference on Learn- ing Representations, edited by T. Sainathet al.(2019) arXiv:1711.05101 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[41]
Flavor Symmetry and the Static Potential with Hypercubic Blocking
A. Hasenfratz and F. Knechtli, Flavor symmetry and the static potential with hypercubic blocking, Phys. Rev. D 64, 034504 (2001), arXiv:hep-lat/0103029
work page internal anchor Pith review Pith/arXiv arXiv 2001
-
[42]
R. Sommer, A new way to set the energy scale in lattice gauge theories and its applications to the static force and αs in SU(2) Yang-Mills theory, Nucl. Phys. B411, 839 (1994), arXiv:hep-lat/9310022. 23
work page internal anchor Pith review Pith/arXiv arXiv 1994
- [43]
-
[44]
A low-energy determination of $\alpha_s$ at three loops
A. Vairo, A low-energy determination ofα s at three loops, EPJ Web Conf.126, 02031 (2016), arXiv:1512.07571 [hep-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[45]
Locality and exponential error reduction in numerical lattice gauge theory
M. L¨ uscher and P. Weisz, Locality and exponential error reduction in numerical lattice gauge theory, JHEP09, 010, arXiv:hep-lat/0108014
work page internal anchor Pith review Pith/arXiv arXiv
-
[46]
C. Michael and I. Teasdale, Extracting glueball masses from lattice QCD, Nucl. Phys. B215, 433 (1983)
work page 1983
-
[47]
A. S. Kronfeld, Improved methods for computing masses from numerical simulations, Nucl. Phys. B Proc. Suppl. 17, 313 (1990)
work page 1990
-
[48]
M. L¨ uscher and U. Wolff, How to calculate the elastic scattering matrix in two-dimensional quantum field the- ories by numerical simulation, Nucl. Phys. B339, 222 (1990)
work page 1990
-
[49]
A. Bazavov, D. Hoying, R. N. Larsen, S. Mukherjee, P. Petreczky, A. Rothkopf, and J. H. Weber (HotQCD), Unscreened forces in the quark-gluon plasma?, Phys. Rev. D109, 074504 (2024), arXiv:2308.16587 [hep-lat]
- [50]
-
[51]
N. Brambilla, H. S. Chung, and A. Vairo, Inclusive pro- duction of heavy quarkonia in pNRQCD, JHEP09, 032, arXiv:2106.09417 [hep-ph]
-
[52]
N. Brambilla, H. S. Chung, A. Vairo, and X.-P. Wang, Inclusive production ofJ/ψ,ψ(2S), and Υ states in pNRQCD, JHEP03, 242, arXiv:2210.17345 [hep-ph]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.