pith. sign in

arxiv: 2512.14414 · v2 · submitted 2025-12-16 · ❄️ cond-mat.str-el

Single-layer framework of variational tensor network states

Pith reviewed 2026-05-16 21:55 UTC · model grok-4.3

classification ❄️ cond-mat.str-el
keywords tensor network statesvariational methodsShastry-Sutherland modelvalence bond solidautomatic differentiationtwo-dimensional quantum magnetsnested tensor networks
0
0 comments X

The pith

A single-layer tensor network method with automatic differentiation reduces computational cost by three orders of magnitude for variational ground states of two-dimensional quantum lattice models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a single-layer tensor network framework that combines the nested tensor network approach with automatic differentiation for determining ground states in two-dimensional quantum lattice models. This combination lowers the computational cost by three orders of magnitude in bond dimension, making higher-accuracy variational calculations feasible without specialized hardware. The framework is tested on the antiferromagnetic Heisenberg model on the square lattice and the frustrated Shastry-Sutherland model, reaching bond dimension nine and producing energies and order parameters consistent with earlier work. In the Shastry-Sutherland case it confirms an intermediate empty-plaquette valence bond solid phase. A reader would care because the efficiency gain opens practical routes to larger and more frustrated two-dimensional systems that previously required prohibitive resources.

Core claim

The authors establish that a single-layer nested tensor network combined with automatic differentiation yields a three-order-of-magnitude reduction in computational cost with respect to bond dimension. This enables variational ground-state calculations at bond dimension nine on standard two-dimensional spin models, producing accurate energies and order parameters that match prior studies. In the Shastry-Sutherland model the method identifies an intermediate empty-plaquette valence bond solid ground state between the plaquette valence bond solid and the antiferromagnetic phases.

What carries the argument

The single-layer nested tensor network state optimized via automatic differentiation, which performs efficient contraction and gradient computation for the variational energy.

If this is right

  • Accurate ground-state energies are obtained for the square-lattice Heisenberg antiferromagnet at bond dimension nine without GPU acceleration.
  • Order parameters remain consistent with previous tensor-network and other numerical studies.
  • An intermediate empty-plaquette valence bond solid phase is confirmed in the Shastry-Sutherland model.
  • The algorithm exhibits reliable convergence, indicating that further refinements can increase accessible system sizes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The efficiency improvement may allow systematic exploration of phase diagrams in other frustrated two-dimensional models where cost has been prohibitive.
  • Adding symmetry projections could push bond dimensions higher while retaining the single-layer structure.
  • The approach offers a route to benchmark against larger-scale methods such as quantum Monte Carlo on systems where sign problems are absent.

Load-bearing premise

The single-layer nested tensor network plus automatic differentiation combination accurately converges to the true ground state without hidden biases or optimization failures for the tested models and bond dimensions.

What would settle it

A higher-bond-dimension calculation on the Shastry-Sutherland model that yields a ground-state energy or order parameter differing substantially from the reported values, or that fails to locate the empty-plaquette valence bond solid region.

Figures

Figures reproduced from arXiv: 2512.14414 by Hongyu Chen, Rong Yu, Weiqiang Yu, Yangfeng Fu, Z. Y. Xie.

Figure 1
Figure 1. Figure 1: FIG. 1: An illustration of the infinite PEPS ansatz with [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2: An illustration of the NTN method. (a) The [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3: A sketch of the CTMRG algorithm used in [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4: The PEPS ansatz used in this work to study the [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6: The obtained magnetization [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5: The obtained ground-state energy [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7: Convergence analysis of [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9: The obtained ground-state energy [PITH_FULL_IMAGE:figures/full_fig_p007_9.png] view at source ↗
Figure 11
Figure 11. Figure 11: FIG. 11: The obtained ground-state energy [PITH_FULL_IMAGE:figures/full_fig_p008_11.png] view at source ↗
Figure 10
Figure 10. Figure 10: FIG. 10: The obtained order parameter [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗
read the original abstract

We propose a single-layer tensor network framework for the variational determination of ground states in two-dimensional quantum lattice models. By combining the nested tensor network method [Phys. Rev. B 96, 045128 (2017)] with the automatic differentiation technique, our approach can reduce the computational cost by three orders of magnitude in bond dimension, and therefore enables highly efficient variational ground-state calculations. We demonstrate the capability of this framework through two quantum spin models: the antiferromagnetic Heisenberg model on a square lattice and the frustrated Shastry-Sutherland model. Even without GPU acceleration or symmetry implementation, we have achieved a bond dimension of nine and obtained accurate ground-state energy and consistent order parameters compared to prior studies. In particular, we confirm the existence of an intermediate empty-plaquette valence bond solid ground state in the Shastry-Sutherland model. We have further discussed the convergence of the algorithm and its potential improvements. Our work provides a promising route for large-scale tensor network calculations of two-dimensional quantum systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a single-layer tensor network framework that combines the nested tensor network method with automatic differentiation for variational ground-state calculations in 2D quantum spin models. It claims this reduces computational cost by three orders of magnitude in bond dimension, enabling D=9 calculations on standard hardware without GPU or symmetry. Demonstrations on the square-lattice Heisenberg antiferromagnet and Shastry-Sutherland model yield energies and order parameters matching prior work, with confirmation of an intermediate empty-plaquette valence bond solid phase in the latter.

Significance. If the efficiency gains and phase identification are robust, the approach could lower the barrier for tensor-network studies of frustrated 2D systems, allowing routine access to moderate bond dimensions on conventional resources and potentially clarifying ground states in models like the Shastry-Sutherland lattice.

major comments (2)
  1. [Shastry-Sutherland model results] Shastry-Sutherland results: the empty-plaquette VBS identification at D=9 lacks D-scaling of the order parameter or energy comparison against a dimer-product reference state at identical D; in frustrated models this leaves open the possibility that the reported phase is an artifact of limited entanglement capacity rather than the true ground state.
  2. [Computational cost and convergence discussion] Efficiency claim: the three-order-of-magnitude reduction in computational cost is stated without explicit timing benchmarks, error bars on energies, or convergence data versus bond dimension, making the central performance advantage unverifiable from the presented material.
minor comments (2)
  1. [Abstract] Abstract: the phrase 'consistent order parameters compared to prior studies' should be accompanied by quantitative differences or tabled reference values for transparency.
  2. [Method section] Notation: the single-layer framework description would benefit from a clear diagram or pseudocode distinguishing it from standard nested tensor networks.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address each major point below and have revised the manuscript to incorporate additional data and clarifications that strengthen the claims.

read point-by-point responses
  1. Referee: Shastry-Sutherland results: the empty-plaquette VBS identification at D=9 lacks D-scaling of the order parameter or energy comparison against a dimer-product reference state at identical D; in frustrated models this leaves open the possibility that the reported phase is an artifact of limited entanglement capacity rather than the true ground state.

    Authors: We agree that explicit D-scaling and a direct comparison to a dimer-product state at the same bond dimension would further support the phase identification. In the revised manuscript we have added the bond-dimension dependence of the empty-plaquette order parameter, which remains finite and stable up to D=9, together with a variational energy comparison at D=9 showing that the optimized single-layer state lies below the dimer-product reference energy. These additions reduce the likelihood that the reported phase is an artifact of limited entanglement. revision: yes

  2. Referee: Efficiency claim: the three-order-of-magnitude reduction in computational cost is stated without explicit timing benchmarks, error bars on energies, or convergence data versus bond dimension, making the central performance advantage unverifiable from the presented material.

    Authors: We acknowledge that the efficiency statement would be more convincing with quantitative benchmarks. The revised manuscript now includes wall-clock timing comparisons on standard CPU hardware, error bars on all reported energies, and energy-versus-D convergence plots for both models. These data confirm the claimed reduction in effective computational cost and allow readers to assess convergence directly. revision: yes

Circularity Check

0 steps flagged

No circularity: framework combines cited nested TN with auto-diff; results benchmarked externally

full rationale

The derivation chain introduces a single-layer variational TN framework by combining the 2017 nested tensor network method (externally cited) with automatic differentiation for optimization. Ground-state energies and order parameters for the Heisenberg and Shastry-Sutherland models are obtained numerically at D=9 and directly compared to independent prior studies rather than derived from fitted parameters or self-referential definitions. No equation reduces a claimed prediction to an input by construction, no uniqueness theorem is imported from overlapping authors to force the ansatz, and the phase confirmation rests on explicit variational convergence checks rather than renaming or smuggling prior results. The efficiency claim follows from the algorithmic combination, not from tautological redefinition.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The paper introduces a methodological combination rather than new physical parameters or entities; it rests on standard tensor network contraction and variational principles from prior literature.

axioms (1)
  • standard math Standard tensor network contraction and variational optimization are valid for the chosen models
    The framework builds directly on established nested tensor network methods referenced in the abstract.

pith-pipeline@v0.9.0 · 5478 in / 1194 out tokens · 40909 ms · 2026-05-16T21:55:24.641125+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

60 extracted references · 60 canonical work pages · 5 internal anchors

  1. [1]
  2. [2]

    Orus, Nat

    R. Orus, Nat. Rev. Phys. 1, 538 (2019)

  3. [3]

    Shi-Ju Ran, Emanuele Tirrito, Cheng Peng, Xi Chen, Luca Tagliacozzo, Gang Su, and Maciej Lewenstein, Tensor Network Contractions, Springer (Open, 2020)

  4. [4]

    Orus, Annals of Phys

    R. Orus, Annals of Phys. 349, 117 (2014)

  5. [5]

    Wright, Numerical Optimization, Springer (New York, 1999)

    Jorge Nocedal, Stephen J. Wright, Numerical Optimization, Springer (New York, 1999)

  6. [6]

    Tao Xiang, Density Matrix and Tensor Network Renormalization, Cambridge University Press (2023)

  7. [7]

    Simon Montangero, Introduction to Tensor Network Methods, Springer (Switzerland, 2018)

  8. [8]

    H. J. Liao, Z. Y. Xie, J. Chen, Z. Y. Liu, H. D. Xie, R. Z. Huang, B. Normand, T. Xiang, Phys. Rev. X 118, 137202 (2017)

  9. [9]

    Wen-Yuan Liu, Didier Poilblanc, Shou-Shu Gong, Wei-Qiang Chen, Zheng-Cheng Gu, Phys. Rev. B 109, 235116 (2024)

  10. [10]

    Ning Xi, Hongyu Chen, Z. Y. Xie, Rong Yu, Phys. Rev. B 107, L220408 (2023)

  11. [11]

    Yang Liu, Shiqi Shao, Saisai He, Z. Y. Xie, Jia-Wei Mei, Hong-Gang Luo, and Jize Zhao, Phys. Rev. B 111, 245117 (2025)

  12. [12]

    Z. T. Xu, Z. C. Gu, and S. Yang, Phys. Rev. B 108, 035144 (2023)

  13. [13]

    Corboz, Phys

    P. Corboz, Phys. Rev. B 93, 045116 (2016)

  14. [14]

    Corboz, T

    P. Corboz, T. M. Rice, and M. Troyer, Phys. Rev. Lett. 113, 046402 (2014)

  15. [15]

    Ignacio Cirac, David Perez-Garcia, Phys

    Norbert Schuch, Didier Poilblanc, J. Ignacio Cirac, David Perez-Garcia, Phys. Rev. Lett. 111, 090501 (2013)

  16. [16]

    Rui Wang, Tao Yang, Z. Y. Xie, Baigeng Wang, X. C. Xie, Phys. Rev. B 109, L241113 (2024)

  17. [17]

    Verstraete, and J

    F. Verstraete, and J. I. Cirac, Phys. Rev. Lett. 104, 190405 (2010)

  18. [18]

    Antoine Tilloy, and J. I. Cirac, Phys. Rev. X 9, 021040 (2019)

  19. [19]

    Suzuki, Prog

    M. Suzuki, Prog. Theor. Phys. 56, 1454 (1976)

  20. [20]

    Z. Y. Xie, J. Chen, M. P. Qin, J. W. Zhu, L. P. Yang, T. Xiang, Phys. Rev. B 86, 045139 (2012)

  21. [21]

    H. C. Jiang, Z. Y. Weng, and T. Xiang, Phys. Rev. Lett. 101, 090603 (2008)

  22. [22]

    Ling Wang, Frank Verstraete, arXiv:1110.4362

  23. [23]

    Lubasch, J

    M. Lubasch, J. I. Cirac, and M. C. Banuls, Phys. Rev. B 90, 064425 (2014)

  24. [24]

    Phien, Johann A

    Ho N. Phien, Johann A. Bengua, Hoang D. Tuan, Philippe Corboz, Roman Orus, Phys. Rev. B 92, 035142 (2015)

  25. [25]
  26. [26]

    Ian Goodfellow, Yoshua Bengio, and Aaron Couville, Deep Learning, MIT Press (2016)

  27. [27]

    The official website of PyTorch is at https://pytorch.org https://pytorch.org

  28. [28]

    The website of Zygote is at https://fluxml.ai/Zygote.jl https://fluxml.ai/Zygote.jl

  29. [29]

    Hai-Jun Liao, Jin-Guo Liu, Lei Wang, and Tao Xiang, Phys. Rev. X 9, 031041 (2019)

  30. [30]

    Schuch, M

    N. Schuch, M. M. Wolf, F. Verstraete, J. I. Cirac, Phys. Rev. Lett. 98, 140506 (2007)

  31. [31]

    Nishino and K

    T. Nishino and K. Okunishi, J. Phys. Soc. Jpn. 65, 891 (1996)

  32. [32]

    Orus, and G

    R. Orus, and G. Vidal, Phys. Rev. B 80, 094403 (2009)

  33. [33]

    X. F. Liu, Y. F. Fu, W. Q. Yu, J. F. Yu, Z. Y. Xie, Chin. Phys. Lett. 39, 067502 (2022)

  34. [34]

    Z. Y. Xie, H. J. Liao, R. Z. Huang, H. D. Xie, J. Chen, Z. Y. Liu, and T. Xiang, Phys. Rev. B 96, 045128 (2017)

  35. [35]

    Sandvik, AIP Conf

    Anders W. Sandvik, AIP Conf. Proc. 1297, 135 (2010)

  36. [36]

    Corboz, and F

    P. Corboz, and F. Mila, Phys. Rev. B 87, 115144 (2013)

  37. [37]

    D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Nature 323, 533–536 (1986)

  38. [38]

    B. B. Chen, Y. Gao, Y. B. Guo, Y. Liu, H. H. Zhao, H. J. Liao, L. Wang, T. Xiang, W. Li, Z. Y. Xie, Phys. Rev. B 101, 220409(R) (2020)

  39. [39]

    Yang Liu, Z. Y. Xie, Hong-Gang Luo, Jize Zhao, Phys. Rev. B 107, L041106 (2023)

  40. [40]

    Normand, and Ying-Jer Kao, Phys

    Chih-Yuan Lee, B. Normand, and Ying-Jer Kao, Phys. Rev. B 98, 224414 (2018)

  41. [41]

    Haghshenas, Shou-Shu Gong, and D

    R. Haghshenas, Shou-Shu Gong, and D. N. Sheng, Phys. Rev. B 99, 174423 (2019)

  42. [42]

    Vidal, Phys

    G. Vidal, Phys. Rev. Lett. 98, 070201 (2007)

  43. [43]

    Orus, and G

    R. Orus, and G. Vidal, Phys. Rev. B 78, 155117 (2008)

  44. [44]

    For example, in the case of Fig

    In principle, the dimension of each bond can be different. For example, in the case of Fig. Fig:NTN (c), all the bonds have dimension D except the left and down bonds of tensor Y , which are of dimension Dd . In the main text, we assume all the bonds have the same dimension just for simplicity of description. In the meantime, since d is almost always much...

  45. [45]

    Corboz, Phys

    P. Corboz, Phys. Rev. B 94, 035133 (2016)

  46. [46]

    Corboz, and F

    P. Corboz, and F. Mila, Phys. Rev. Lett. 112, 147203 (2014)

  47. [47]

    Wen-Yuan Liu, Xiao-Tian Zhang, Zhe Wang, Shou-Shu Gong, Wei-Qiang Chen and Zheng-Cheng Gu, Phys. Rev. Lett. 133, 026502 (2024)

  48. [48]

    Senthil, A

    T. Senthil, A. Vishwanath, L. Balents, S. Sachdev, and M. P. A. Fisher, Science 303, 1490–1494 (2004)

  49. [49]

    Y. Cui, L. Liu, H. Lin, K.-H. Wu, W. Hong, X. Liu, C. Li, Z. Hu, N. Xi, S. Li, R. Yu, A. W. Sandvik, and W. Yu, Science 380, 1179 (2023)

  50. [50]

    Sandvik, and Ling Wang, Phys

    Jianwei Yang, Anders W. Sandvik, and Ling Wang, Phys. Rev. B 105, L060409 (2022)

  51. [51]

    Philippe Corboz, Yining Zhang, Boris Ponsioen, and Frederic Mila arXiv:2502.14091

  52. [52]

    Assaad, and P

    Boris Ponsioen, Fakher F. Assaad, and P. Corboz, SciPost Phys. 12, 006 (2022)

  53. [53]

    Hasik, D

    J. Hasik, D. Poilblanc, and F. Becca, SciPost Phys. 10, 012 (2021)

  54. [54]

    Hao Xie, Jin-Guo Liu, Lei Wang, Phys. Rev. B 101, 245139 (2020)

  55. [55]

    H. H. Zhao, Z. Y. Xie, Q. N. Chen, Z. C. Wei, J. W. Cai, T. Xiang, Phys. Rev. B 81, 174411 (2010)

  56. [56]

    Morita, R

    S. Morita, R. Igarashi, Hui-Hai Zhao, and Naoki Kawashima, Phys. Rev. E 97, 033310 (2018)

  57. [57]

    Yining Zhang, Qi Yang, Philippe Corboz, arXiv:2505.00494

  58. [58]

    Jan Naumann, Erik Lennart Weerda, Jens Eisert, Matteo Rizzi, Philipp Schmoll, Phys. Rev. B 111, 235116 (2025)

  59. [59]

    Xia-Ze Xu, Tong-Yu Lin, Guang-Ming Zhang, Phys. Rev. B 112, 134403 (2025)

  60. [60]

    Li-Ping Yang, Y. F. Fu, Z. Y. Xie, T. Xiang, Phys. Rev. B 107, 165127 (2023)