pith. sign in

arxiv: 2605.17083 · v1 · pith:GAZ2F2NLnew · submitted 2026-05-16 · ⚛️ physics.comp-ph · cond-mat.dis-nn· cond-mat.str-el· cond-mat.supr-con· physics.chem-ph

Basis-free neural-network geminal and Jastrow factors for variational Monte Carlo

Pith reviewed 2026-05-20 15:06 UTC · model grok-4.3

classification ⚛️ physics.comp-ph cond-mat.dis-nncond-mat.str-elcond-mat.supr-conphysics.chem-ph
keywords variational Monte Carloneural network quantum statesantisymmetrized geminal powerJastrow factorhydrogen systemsnodal surfacedynamical correlationbasis-free ansatz
0
0 comments X

The pith

Neural-network replacements for basis sets in geminal and Jastrow factors separate nodal definition from dynamical correlation in variational Monte Carlo.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a basis-free ansatz that pairs an antisymmetrized geminal power determinant with neural-network Jastrow factors for use in variational Monte Carlo. The AGP component sets the nodal surface while the neural networks handle dynamical correlation without altering the nodes. This design allows the authors to isolate errors from static correlation versus dynamical effects. Tests on the hydrogen molecule and a rectangular hydrogen tetramer reach sub-millihartree accuracy when the nodes are suitable, but reveal remaining problems when the geometry challenges the nodal surface.

Core claim

The authors show that replacing conventional basis-set expansions with feed-forward neural networks in both the geminal and Jastrow constructions yields a compact wave function that achieves sub-millihartree accuracy for the hydrogen molecule and the rectangular hydrogen tetramer whenever the AGP nodes are adequate, while highlighting the residual nodal limitation near the large-radius square geometry of the hydrogen tetramer.

What carries the argument

The basis-free Jastrow-AGP ansatz in which an antisymmetrized geminal power determinant defines the nodal surface and a neural-network Jastrow factor recovers dynamical correlation at fixed nodes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Extending this separation to larger systems could help identify when additional nodal optimization is needed beyond neural Jastrow factors.
  • Similar neural replacements might apply to other antisymmetric wave function forms to reduce basis dependence.
  • Testing on systems with known exact nodes would confirm the isolation of dynamical correlation improvements.

Load-bearing premise

The AGP determinant supplies an adequate nodal surface so the neural-network Jastrow factor can recover dynamical correlation at fixed nodes without adjusting the nodes.

What would settle it

A variational Monte Carlo calculation on the square hydrogen tetramer at large separation that fails to reach sub-millihartree accuracy even after full optimization of the neural-network parameters would indicate that the nodal limitation cannot be overcome by the Jastrow factor alone.

Figures

Figures reproduced from arXiv: 2605.17083 by Jan Kessler, Thomas D. K\"uhne.

Figure 1
Figure 1. Figure 1: FIG. 1. Optimization protocol used for the neural-network [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3. Geometry and parametrizations of the rectangular [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 2
Figure 2. Figure 2: b resolves the remaining deviations. The rela￾tive error increases below 2 Bohr, i.e., around and below the equilibrium distance of about 1.4 Bohr, reaching ap￾proximately 0.0025% or 0.025 mHa. At larger separations the error remains close to 0.001%. This behavior dif￾fers from many conventional single-reference electronic￾structure methods, which are often accurate near equi￾librium but deteriorate for st… view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Energy differences to reference values [5, 44] for NNAGP + all-body-NNJF wave functions with different numbers of [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7 [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
read the original abstract

Neural-network quantum states offer a flexible route to compact many-electron wave functions, but their practical accuracy depends strongly on how fermionic antisymmetry, electron correlation, and optimization noise are treated. Here we combine an antisymmetrized geminal power (AGP) determinant with feed-forward neural networks that replace conventional basis-set expansions in the geminal and in two Jastrow-factor constructions. The resulting basis-free Jastrow--AGP ansatz is optimized by variational Monte Carlo and is designed to separate two tasks: the AGP part defines the nodal surface, while the neural-network Jastrow factor recovers dynamical correlation at fixed nodes. This separation makes it possible to distinguish errors associated with dynamical correlation from those caused by static, multireference correlation. Applications to the hydrogen molecule and the rectangular hydrogen tetramer show sub-millihartree accuracy when the AGP nodes are adequate, and expose the residual nodal limitation near the large-radius square geometry of the hydrogen tetramer. These results clarify where neural-network building blocks can improve a compact geminal ansatz and where additional nodal flexibility is required.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript presents a basis-free variational Monte Carlo ansatz that replaces conventional basis expansions with feed-forward neural networks for both the antisymmetrized geminal power (AGP) geminal and two Jastrow-factor constructions. The central design choice fixes the nodal surface via the AGP determinant while using the neural-network Jastrow solely to recover dynamical correlation at those fixed nodes. Numerical results on H2 and the rectangular H4 geometry reach sub-millihartree accuracy relative to reference values when the AGP nodes are adequate; the same framework is used to identify a residual error at the large-radius square H4 geometry that is attributed to nodal limitations.

Significance. If the separation of nodal and dynamical-correlation errors can be rigorously validated, the work supplies a practical diagnostic for deciding when additional nodal flexibility (e.g., beyond a single AGP) is required in neural-network quantum states. The concrete accuracy numbers on two small systems and the explicit identification of a nodal bottleneck constitute a useful benchmark contribution for the VMC community.

major comments (1)
  1. [§4] §4 (H4 results) and the abstract: the claim that the residual error near the large-radius square geometry is caused by nodal limitations presupposes that the neural-network Jastrow has already saturated the dynamical correlation recoverable at the fixed AGP nodes. No scaling study with Jastrow network depth/width, no comparison against a converged conventional Jastrow at the same nodes, and no independent error decomposition are reported; without such evidence the attribution remains inconclusive.
minor comments (2)
  1. [§2] The precise functional form of the neural-network geminal (how the AGP coefficients are generated from the network output) should be written explicitly, preferably with an equation.
  2. [Figures] Figure captions for the H4 energy curves should state the reference method and basis used for the comparison values.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful reading of our manuscript and for the constructive feedback. The major comment raises a valid point about the strength of evidence supporting our attribution of the residual error to nodal limitations. We address this concern directly below and outline revisions that will be made to the manuscript.

read point-by-point responses
  1. Referee: [§4] §4 (H4 results) and the abstract: the claim that the residual error near the large-radius square geometry is caused by nodal limitations presupposes that the neural-network Jastrow has already saturated the dynamical correlation recoverable at the fixed AGP nodes. No scaling study with Jastrow network depth/width, no comparison against a converged conventional Jastrow at the same nodes, and no independent error decomposition are reported; without such evidence the attribution remains inconclusive.

    Authors: We agree with the referee that a more rigorous demonstration that the neural-network Jastrow has saturated the dynamical correlation at the fixed AGP nodes would make the attribution of the residual error to nodal limitations more conclusive. The current results show that two distinct neural-network Jastrow constructions yield essentially the same residual error at the large-radius square H4 geometry (while both recover sub-millihartree accuracy at the rectangular geometry), which we interpret as evidence that further dynamical-correlation recovery is not possible at those nodes. However, we acknowledge that this interpretation would be strengthened by the additional checks the referee suggests. We will therefore add a scaling study with respect to Jastrow network depth and width, include a comparison against a converged conventional Jastrow factor at the same AGP nodes, and provide a clearer error decomposition in the revised manuscript. The language in the abstract and §4 will be adjusted to reflect that the nodal limitation is inferred from the saturation behavior observed with the present flexible Jastrow forms. revision: yes

Circularity Check

0 steps flagged

No circularity: variational optimization and external benchmarks keep results independent of input definitions

full rationale

The paper defines a variational Monte Carlo procedure in which neural-network parameters for the AGP geminal and Jastrow factors are optimized to minimize the energy expectation value. Reported sub-millihartree accuracies are obtained by direct comparison of these variational energies against known external reference values for the H2 and rectangular H4 systems. The separation of tasks (AGP supplying fixed nodes, Jastrow recovering dynamical correlation) is presented explicitly as a design choice that enables diagnostic attribution of residual errors, not as a definitional identity that forces the numerical outcomes. No equation reduces a reported energy or accuracy figure to a quantity defined by the same fitted parameters, and the provided text contains no load-bearing self-citations or imported uniqueness theorems. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The approach rests on the standard antisymmetry requirement for fermionic wave functions and on the assumption that a fixed nodal surface from the AGP can be used while the Jastrow recovers correlation; no new particles or forces are introduced.

axioms (2)
  • standard math Fermionic wave functions must be antisymmetric under particle exchange.
    Invoked when the AGP determinant is introduced to enforce antisymmetry.
  • domain assumption The nodal surface can be held fixed while dynamical correlation is added by the Jastrow factor.
    Central design choice stated in the abstract that enables separation of error sources.

pith-pipeline@v0.9.0 · 5742 in / 1361 out tokens · 47626 ms · 2026-05-20T15:06:22.569439+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 1 internal anchor

  1. [1]

    Its initiator extension further reduced the walker populations required for large determinant spaces [23]

    A complementary stochastic route to correlated elec- tronic structure is full configuration interaction quan- tum Monte Carlo (FCIQMC), introduced by Alavi and coworkers, which samples signed walker populations in Slater-determinant space and can reach finite-basis FCI- quality energies without storing the complete CI vec- tor [22]. Its initiator extensio...

  2. [2]

    85 86 87 88 89 90 [°] 2.020 2.018 2.016 2.014 2.012 2.010 Energy [Ha] NNAGP (45x45) + All-Body-NNJF (110x110) FermiNet FCI FIG

    are shown as reference values. 85 86 87 88 89 90 [°] 2.020 2.018 2.016 2.014 2.012 2.010 Energy [Ha] NNAGP (45x45) + All-Body-NNJF (110x110) FermiNet FCI FIG. 5.H 4: VMC energies of the NNWF for different H–H angles at fixed radiusR= 3.2843 Bohr. FermiNet and FCI energies from Ref. 27 are shown for comparison. and static correlation can be varied geometri...

  3. [3]

    J. C. Slater, The theory of complex spectra, Phys. Rev. 34, 1293 (1929)

  4. [4]

    Jastrow, Many-body problem with strong forces, Phys

    R. Jastrow, Many-body problem with strong forces, Phys. Rev.98, 1479 (1955)

  5. [5]

    A. C. Hurley, J. E. Lennard-Jones, and J. A. Pople, The molecular orbital theory of chemical valency xvi. a theory of paired-electrons in polyatomic molecules, Proc. R. Soc. Lond. A220, 446 (1953)

  6. [6]

    Casula and S

    M. Casula and S. Sorella, Geminal wave functions with jastrow correlation: A first application to atoms, J. Chem. Phys.119, 6500 (2003)

  7. [7]

    Genovese, A

    C. Genovese, A. Meninno, and S. Sorella, Assessing the accuracy of the jastrow antisymmetrized geminal power in the h4 model system, J. Chem. Phys.150, 084102 (2019)

  8. [8]

    Pauling, The nature of the chemical bond

    L. Pauling, The nature of the chemical bond. application of results obtained from the quantum mechanics and from a theory of paramagnetic susceptibility to the structure of molecules, J. Am. Chem. Soc.53, 1367 (1931)

  9. [9]

    P. W. Anderson, The resonating valence bond state in La2CuO4 and superconductivity, Science235, 1196 (1987)

  10. [10]

    P. W. Anderson, P. A. Lee, M. Randeria, T. M. Rice, N. Trivedi, and F. C. Zhang, The physics behind high- temperature superconducting cuprates: The plain vanilla version of RVB, J. Phys.: Condens. Matter16, R755 (2004)

  11. [11]

    Nakano, C

    K. Nakano, C. Attaccalite, M. Barborini, L. Capriotti, M. Casula, E. Coccia, M. Dagrada, C. Genovese, Y. Luo, G. Mazzola, A. Zen, and S. Sorella, Turborvb: A many- body toolkit for ab initio electronic simulations by quan- tum monte carlo, J. Chem. Phys.152, 204121 (2020)

  12. [12]

    Nakano, O

    K. Nakano, O. Kohul´ ak, A. Raghav, M. Casula, and S. Sorella, Turbogenius: Python suite for high- throughput calculations of ab initio quantum monte carlo methods, J. Chem. Phys.159, 224801 (2023)

  13. [13]

    Azadi, R

    S. Azadi, R. Singh, and T. D. K¨ uhne, Resonating valence bond quantum monte carlo: Application to the ozone molecule, Int. J. Quantum Chem.115, 1673 (2015)

  14. [14]

    Azadi, M

    S. Azadi, M. S. Bahramy, and T. D. K¨ uhne, Electron correlation effects and spin-liquid state in the herbert- smithite kagome lattice, Phys. Rev. Research7, 013165 (2025)

  15. [15]

    Azadi, W

    S. Azadi, W. M. C. Foulkes, and T. D. K¨ uhne, Quan- tum monte carlo study of high pressure solid molecular hydrogen, New J. Phys.15, 113005 (2013)

  16. [16]

    Azadi, G

    S. Azadi, G. H. Booth, and T. D. K¨ uhne, Equation of state of atomic solid hydrogen by stochastic many-body wave function methods, J. Chem. Phys.153, 204107 (2020)

  17. [17]

    Metropolis, A

    N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller, Equation of state calcula- tions by fast computing machines, J. Chem. Phys.21, 1087 (1953)

  18. [18]

    W. K. Hastings, Monte Carlo Sampling Methods Using Markov Chains and Their Applications, Biometrika57, 97 (1970)

  19. [19]

    W. L. McMillan, Ground state of liquid he 4, Phys. Rev. 138, A442 (1965)

  20. [20]

    D. M. Ceperley and B. J. Alder, Quantum monte carlo, Science231, 555 (1986)

  21. [21]

    W. M. C. Foulkes, L. Mitas, R. J. Needs, and G. Ra- jagopal, Quantum monte carlo simulations of solids, Rev. Mod. Phys.73, 33 (2001)

  22. [22]

    R. J. Needs, M. D. Towler, N. D. Drummond, and P. Lopez Rios, Continuum variational and diffusion quan- tum monte carlo calculations, J. Phys.: Condens. Matter 22, 023201 (2010)

  23. [23]

    B. M. Austin, D. Y. Zubarev, and J. Lester, William A., Quantum monte carlo and related approaches, Chem. Rev.112, 263 (2012)

  24. [24]

    G. H. Booth, A. J. W. Thom, and A. Alavi, Fermion monte carlo without fixed nodes: A game of life, death, and annihilation in slater determinant space, J. Chem. Phys.131, 054106 (2009)

  25. [25]

    Cleland, G

    D. Cleland, G. H. Booth, and A. Alavi, Communications: Survival of the fittest: Accelerating convergence in full configuration-interaction quantum monte carlo, J. Chem. 10 Phys.132, 041103 (2010)

  26. [26]

    Hornik, Approximation capabilities of multilayer feed- forward networks, Neural Netw.4, 251 (1991)

    K. Hornik, Approximation capabilities of multilayer feed- forward networks, Neural Netw.4, 251 (1991)

  27. [27]

    J. Han, L. Zhang, and W. E, Solving many-electron schr¨ odinger equation using deep neural networks, J. Comput. Phys.399, 108929 (2019)

  28. [28]

    K. Choo, A. Mezzacapo, and G. Carleo, Fermionic neural-network states for ab-initio electronic structure, Nat. Commun.11, 2368 (2020)

  29. [29]

    D. Pfau, J. S. Spencer, A. G. D. G. Matthews, and W. M. C. Foulkes, Ab initio solution of the many-electron schr¨ odinger equation with deep neural networks, Phys. Rev. Research2, 033429 (2020)

  30. [30]

    Kessler, F

    J. Kessler, F. Calcavecchia, and T. D. K¨ uhne, Artifi- cial neural networks as trial wave functions for quantum monte carlo, Adv. Theory Simul.4, 2000269 (2021)

  31. [31]

    M. T. Entwistle, Z. Sch¨ atzle, P. A. Erdman, J. Hermann, and F. No´ e, Electronic excited states in deep variational Monte Carlo, Nat. Commun.14, 274 (2023)

  32. [32]

    X. Li, C. Fan, W. Ren, and J. Chen, Fermionic neu- ral network with effective core potential, Phys. Rev. Re- search4, 013021 (2022)

  33. [33]

    J. S. Spencer, D. Pfau, A. Botev, and W. M. C. Foulkes, Better, faster fermionic neural networks, arXiv:2011.07125 [physics.comp-ph] (2020)

  34. [34]

    Pescia, J

    G. Pescia, J. Han, A. Lovato, J. Lu, and G. Carleo, Neural-network quantum states for periodic systems in continuous space, Phys. Rev. Research4, 023138 (2022)

  35. [35]

    Wilson, S

    M. Wilson, S. Moroni, M. Holzmann, N. Gao, F. Wu- darski, T. Vegge, and A. Bhowmik, Wave function ansatz (but periodic) networks and the homogeneous electron gas, arXiv:2202.04622 [physics.chem-ph] (2022)

  36. [36]

    X. Li, Z. Li, and J. Chen, Ab initio calculation of real solids via neural network ansatz, arXiv:2203.15472 [physics.chem-ph] (2022)

  37. [37]

    Calcavecchia, F

    F. Calcavecchia, F. Pederiva, M. H. Kalos, and T. D. K¨ uhne, Sign problem of the fermionic shadow wave func- tion, Phys. Rev. E90, 053304 (2014)

  38. [38]

    Calcavecchia and T

    F. Calcavecchia and T. D. K¨ uhne, On fermionic shadow wave functions for strongly correlated multi-reference systems based on a single slater determinant, EPL110, 20011 (2015)

  39. [39]

    Calcavecchia and T

    F. Calcavecchia and T. D. K¨ uhne, Metal-insulator transi- tion of solid hydrogen by the antisymmetric shadow wave function, Z. Naturforsch. A73, 845 (2018)

  40. [40]

    D. P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization, arXiv:1412.6980 [cs] (2014)

  41. [41]

    Flyvbjerg and H

    H. Flyvbjerg and H. G. Petersen, Error estimates on av- erages of correlated data, J. Chem. Phys.91, 461 (1989)

  42. [42]

    Jonsson, Standard error estimation by an automated blocking method, Phys

    M. Jonsson, Standard error estimation by an automated blocking method, Phys. Rev. E98, 043304 (2018)

  43. [43]

    J. B. Anderson, Quantum chemistry by random walk: H4 square, Int. J. Quantum Chem.15, 109 (1979)

  44. [44]

    Gasperich, M

    K. Gasperich, M. Deible, and K. D. Jordan, H4: A model system for assessing the performance of diffusion monte carlo calculations using a single slater determinant trial function, J. Chem. Phys.147, 074106 (2017)

  45. [45]

    Dugas, Y

    C. Dugas, Y. Bengio, F. B´ elisle, C. Nadeau, and R. Gar- cia, Incorporating second-order functional knowledge for better option pricing, inAdv. Neural Inf. Process. Syst. (2001) pp. 472–478

  46. [46]

    Pachucki, Born-oppenheimer potential for h 2, Phys

    K. Pachucki, Born-oppenheimer potential for h 2, Phys. Rev. A82, 032509 (2010)

  47. [47]

    R. J. Bartlett and M. Musia l, Coupled-cluster theory in quantum chemistry, Rev. Mod. Phys.79, 291 (2007)

  48. [48]

    Sorella, M

    S. Sorella, M. Casula, and D. Rocca, Weak binding be- tween two aromatic rings: Feeling the van der waals attraction by quantum monte carlo methods, J. Chem. Phys.127, 014105 (2007)

  49. [49]

    Van Voorhis and M

    T. Van Voorhis and M. Head-Gordon, Benchmark vari- ational coupled cluster doubles results, J. Chem. Phys. 113, 8873 (2000)

  50. [50]

    H. G. A. Burton and A. J. W. Thom, Holomorphic hartree–fock theory: An inherently multireference ap- proach, J. Chem. Theory Comput.12, 167 (2016)