Exploring the performance of superposition of product states: from 1D to 3D quantum spin systems
Pith reviewed 2026-05-17 23:41 UTC · model grok-4.3
The pith
The superposition of product states ansatz reaches high accuracy for ground state searches in tilted Ising models from one to three dimensions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The superposition-of-product-states ansatz, structurally related to canonical polyadic tensor decomposition, attains high accuracy for ground state search in tilted Ising models from one to three dimensions with short- and long-range interactions as well as on a random network.
What carries the argument
The superposition-of-product-states (SPS) ansatz, a variational framework that expresses states as sums of product states, which carries the argument by remaining independent of lattice geometry while permitting direct and accurate extraction of observables.
If this is right
- SPS enables ground state searches in higher-dimensional and irregular geometries where tensor networks lose expressive power.
- The ansatz supports direct computation of observables without the approximation steps required by compressed representations.
- Parallelizability allows scaling to larger system sizes than sequential tensor network contractions.
- Analytical shortcuts reduce the cost of certain expectation value calculations in Ising-type models.
Where Pith is reading between the lines
- SPS could serve as a complementary tool to tensor networks, used first to seed initial states or to handle non-local interactions.
- The same variational form might be tested on other spin Hamiltonians such as Heisenberg or XY models to map out its range of applicability.
- Because the representation is a sum of product states, one could derive explicit bounds on how many terms are needed for a given accuracy in long-range systems.
Load-bearing premise
The SPS ansatz stays expressive enough and trainable for the chosen models without hitting the expressive power limits that affect other variational methods in higher dimensions.
What would settle it
Run exact diagonalization on a small three-dimensional tilted Ising instance with known ground state energy, optimize SPS with increasing numbers of terms, and check whether the relative energy error fails to drop below a few percent.
Figures
read the original abstract
Tensor networks (TNs) are one of the best available tools to study many-body quantum systems. TNs are particularly suitable for one-dimensional local Hamiltonians, while their performance for generic geometries is mainly limited by two aspects: the limitation in expressive power and the approximate extraction of information. Here we investigate the performance of superposition-of-product-states (SPS) ansatz, a variational framework structurally related to canonical polyadic tensor decomposition. The ansatz does not compress information as effectively as tensor networks, but it has the advantages (i) of allowing accurate extraction of information, (ii) of being structurally independent of the geometry of the system, (iii) of being readily parallelizable, and (iv) of allowing analytical shortcuts. We first study the typical properties of the SPS ansatz for spin-$1/2$ systems, including its entanglement entropy, and its trainability. We then use this ansatz for ground state search in tilted Ising models -- including one-dimensional and three-dimensional with short- and long-range interaction, and a random network -- demonstrating that SPS can attain high accuracy.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript explores the superposition-of-product-states (SPS) variational ansatz, which is structurally related to canonical polyadic tensor decomposition. It first characterizes typical properties of the ansatz for spin-1/2 systems, including entanglement entropy and trainability, then applies it to ground-state search on tilted Ising models in one and three dimensions (short- and long-range interactions) as well as on a random network, reporting that high accuracy is attained.
Significance. If the numerical results hold, the geometry-independent, parallelizable nature of SPS together with its analytical shortcuts offers a practical alternative to tensor networks for systems whose geometry limits bond-dimension scaling. Explicit study of entanglement entropy and trainability provides concrete information on the ansatz’s limitations that is useful for the community.
major comments (2)
- [§5] §5 (3D long-range tilted Ising results): the central claim that SPS attains high accuracy rests on numerical optimization, yet no scaling of the minimal rank (number of product states) with linear size or interaction range is reported; without this, it is impossible to judge whether the required superposition remains tractable in the regime advertised as geometry-independent.
- [§4.2] §4.2 (trainability analysis): the study of barren-plateaus or gradient variance is performed only for small system sizes; the extrapolation to the 3D and random-network instances used in §5 is not justified by any scaling argument or additional data.
minor comments (2)
- [Abstract] The abstract states “high accuracy” without quoting a single quantitative figure (energy error, fidelity, or overlap); these numbers should appear already in the abstract.
- [Figures 4–6] Figure captions for the 3D and network results should explicitly state the rank used and the optimization method (e.g., number of SPS terms, optimizer, convergence criterion).
Simulated Author's Rebuttal
We thank the referee for their detailed and constructive feedback on our manuscript. We address each of the major comments below and have revised the manuscript accordingly to improve clarity and strengthen our claims.
read point-by-point responses
-
Referee: [§5] §5 (3D long-range tilted Ising results): the central claim that SPS attains high accuracy rests on numerical optimization, yet no scaling of the minimal rank (number of product states) with linear size or interaction range is reported; without this, it is impossible to judge whether the required superposition remains tractable in the regime advertised as geometry-independent.
Authors: We agree that providing scaling information on the minimal rank is crucial for assessing the tractability of the SPS ansatz in larger systems. Although the current results focus on demonstrating high accuracy for the specific system sizes studied across different geometries, we have now added a new figure and accompanying discussion in the revised manuscript showing the dependence of the required rank on system size for the 1D case with both short- and long-range interactions. This analysis suggests that the rank scales favorably for the interaction ranges considered, supporting the geometry-independent applicability. We have also included a brief remark on the expected scaling in 3D based on these observations. revision: yes
-
Referee: [§4.2] §4.2 (trainability analysis): the study of barren-plateaus or gradient variance is performed only for small system sizes; the extrapolation to the 3D and random-network instances used in §5 is not justified by any scaling argument or additional data.
Authors: We acknowledge that the trainability study was limited to small system sizes. To better justify the extrapolation, we have incorporated additional numerical data for larger 1D systems and provided a scaling argument based on the structure of the SPS ansatz, which indicates that gradient variances remain manageable due to the parallelizable optimization procedure. We have updated §4.2 to include these elements and clarified the limitations for very large 3D systems, noting that the absence of severe barren plateaus observed in small systems persists in the regimes explored in §5. revision: yes
Circularity Check
No significant circularity; results from direct numerical optimization on concrete models
full rationale
The paper introduces the SPS ansatz, analyzes its typical properties such as entanglement entropy and trainability for spin-1/2 systems, and then applies it via variational optimization to ground-state searches in tilted Ising models across 1D, 3D short-range, long-range, and random-network geometries. All reported accuracies and performance metrics derive from explicit numerical minimizations on specific Hamiltonians rather than from any self-referential fitting, ansatz smuggling via citation, or reduction of a central claim to a prior self-citation. No load-bearing uniqueness theorems or renamings of known results appear; the geometry-independent advantages are demonstrated empirically without circular redefinition of inputs as outputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The variational optimization of SPS parameters can reach states with high overlap to the true ground state for the chosen Ising models.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The SPS ansatz ... is a rank-M canonical polyadic (CP) decomposition ... expressive power ... controlled by the decomposition rank M
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We first study ... entanglement entropy, and its trainability. We then use this ansatz for ground state search in tilted Ising models
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Distributions of 10,000 SPS normsZoverDfor a system sizeL= 20at different values ofM
Statistics of the norm To further understand the global structure of the SPS states in Hilbert space, we first examine its normalization factorZto find how stable the ensemble is under random 0 50 100 150 200 Z 0.0 0.2 0.4 0.6 0.8 Frequency M 2 4 8 16 32 64 128 256 512 0 5 100.0 0.5 Figure 1. Distributions of 10,000 SPS normsZoverDfor a system sizeL= 20at...
-
[2]
Statistics of local observables We now consider the statistics of the expectation value of a single-site Pauli-Xoperator at sitel,σ x l , ⟨σx l ⟩= 1 Z MX m,m′=1 cmcm′ sin θm l +θ m′ l LY k̸=l cos θm k −θ m′ k , (4) derived in App. A3. It is straightforward to verify that the expectation value⟨σ x l ⟩vanishes on average, since ED2 θ [sin(θm l +θ m′ l )] = ...
-
[3]
Typical entanglement entropy The evaluation of the entanglement entropy of the SPS ansatz serves as a key indicator of theexpress- ive powerof the ansatz. Entanglement reflects the degree of quantum correlation that can be represen- ted between subsystems—higher typical entropy indic- ates that the ansatz can capture more complex and cor- related quantum ...
-
[4]
show that the 2-Rényi entropy for a single biparti- tion of an open chain, the typical half-chain entropy sat- isfyS 2 ≈min lnχ, L 2 ln 2 +O(χ −1). This distinction highlights that while random MPS can reach the entan- glement capacity allowed by their bond dimension, the SPS ansatz is inherently more restricted, which directly affects their expressive po...
-
[5]
This behavior indicates that although many subspaces appear flat and could resemble barren plat- eaus, the relevant local subspace associated with the ob- servable retains finite gradients, thereby ensuring that optimization remains feasible in practice. We remark that, for the random nature of the SPS analyzed, the same results occur also when one consid...
-
[6]
L. D. Landau and E. M. Lifshitz,Quantum Mechanics II: Advanced Topics(Elsevier, 2015) chapter on Many-Body Systems
work page 2015
-
[7]
M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information(Cambridge University Press, 2010)
work page 2010
-
[8]
W. M. Foulkes, L. Mitas, R. Needs, and G. Rajagopal, Quantum monte carlo simulations of solids, Reviews of Modern Physics73, 33 (2001)
work page 2001
-
[9]
G. Carleo and M. Troyer, Solving the quantum many- body problem with artificial neural networks, Science 355, 602 (2017)
work page 2017
-
[10]
D. Wu, R. Rossi, F. Vicentini, N. Astrakhantsev,et al., Variational benchmarks for quantum many-body prob- lems, Science386, 296 (2024)
work page 2024
-
[11]
U.Schollwöck,Thedensity-matrixrenormalizationgroup in the age of matrix product states, Annals of Physics 326, 96 (2011)
work page 2011
-
[12]
R. Orús, A practical introduction to tensor networks: Matrix product states and projected entangled pair states, Annals of physics349, 117 (2014)
work page 2014
-
[13]
Renormalization algorithms for Quantum-Many Body Systems in two and higher dimensions
F. Verstraete and J. I. Cirac, Renormalization algorithms for quantum-many body systems in two and higher di- mensions, arXiv preprint cond-mat/0407066 (2004)
work page internal anchor Pith review Pith/arXiv arXiv 2004
-
[14]
T. Nishino and K. Okunishi, Corner transfer matrix renormalization group method, Journal of the Physical Society of Japan65, 891 (1996)
work page 1996
-
[15]
R. Orús and G. Vidal, Simulation of two-dimensional quantum systems on an infinite lattice revisited: Corner transfer matrix for tensor contraction, Physical Review B80, 094403 (2009)
work page 2009
- [16]
-
[17]
Efficient MPS algorithm for periodic boundary conditions and applications
M. Weyrauch and M. V. Rakov, Efficient mps algorithm for periodic boundary conditions and applications, arXiv preprint arXiv:1303.1333 (2013)
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[18]
E. Robeva and A. Seigal, Duality of graphical models and tensor networks, Information and Inference: A Journal of the IMA8, 273 (2019)
work page 2019
-
[19]
R. Alkabetz and I. Arad, Tensor networks contraction and the belief propagation algorithm, Physical Review Research3, 023073 (2021)
work page 2021
-
[20]
S. Sahu and B. Swingle, Efficient tensor network simu- lation of quantum many-body physics on sparse graphs, arXiv preprint arXiv:2206.04701 (2022)
-
[21]
C. Guo, D. Poletti, and I. Arad, Block belief propagation algorithm for two-dimensional tensor networks, Physical Review B108, 125111 (2023)
work page 2023
-
[22]
A. W. Sandvik and G. Vidal, Variational quantum monte carlo simulations with tensor-network states, Physical re- view letters99, 220602 (2007)
work page 2007
-
[23]
L. Wang, I. Pižorn, and F. Verstraete, Monte carlo simu- lation with tensor network states, Physical Review B83, 9 134421 (2011)
work page 2011
-
[24]
A. J. Ferris and G. Vidal, Perfect sampling with unitary tensor networks, Physical Review B85, 165146 (2012)
work page 2012
-
[25]
D. S. Rokhsar and B. G. Kotliar, Gutzwiller projection for bosons, Phys. Rev. B44, 10328 (1991)
work page 1991
- [26]
- [27]
-
[28]
F. L. Hitchcock, The expression of a tensor or a poly- adic as a sum of products, Journal of Mathematics and Physics6, 164 (1927)
work page 1927
-
[29]
J. D. Carroll and J.-J. Chang, Analysis of individual dif- ferences in multidimensional scaling via an n-way gener- alization of "eckart-young" decomposition, Psychomet- rika35, 283 (1970)
work page 1970
-
[30]
R. A. Harshman, Foundations of the parafac procedure: Models and conditions for an “explanatory” multi-modal factor analysis, UCLA working papers in phonetics16, 84 (1970)
work page 1970
- [31]
-
[32]
B. Collins, C. E. González-Guillén, and D. Pérez-García, Matrix product states, random matrix theory and the principleofmaximumentropy,CommunicationsinMath- ematical Physics320, 663 (2013)
work page 2013
-
[33]
J. Haferkamp, C. Bertoni, I. Roth, and J. Eisert, Emer- gent statistical mechanics from properties of disordered random matrix product states, PRX Quantum2, 040308 (2021)
work page 2021
-
[34]
J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Bab- bush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nature communications9, 4812 (2018)
work page 2018
- [35]
-
[36]
M. Larocca, S. Thanasilp, S. Wang, K. Sharma, J. Bia- monte, P. J. Coles, L. Cincio, J. R. McClean, Z. Holmes, and M. Cerezo, Barren plateaus in variational quantum computing, Nature Reviews Physics , 1 (2025)
work page 2025
-
[37]
K. Srimahajariyapong, S. Thanasilp, and T. Chotibut, Connecting phases of matter to the flatness of the loss landscape in analog variational quantum algorithms, arXiv preprint arXiv:2506.13865 (2025)
-
[38]
T. Barthel and Q. Miao, Absence of barren plateaus and scaling of gradients in the energy optimization of isomet- ric tensor network states, Communications in Mathem- atical Physics406, 86 (2025)
work page 2025
-
[39]
Decoupled Weight Decay Regularization
I. Loshchilov and F. Hutter, Decoupled weight decay reg- ularization, arXiv preprint arXiv:1711.05101 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[40]
J. Hauschild, J. Unfried, S. Anand, B. Andrews, M. Bintz, U. Borla, S. Divic, M. Drescher, J. Geiger, M. Hefel, K. Hémery, W. Kadow, J. Kemp, N. Kirchner, V. S. Liu, G. Möller, D. Parker, M. Rader, A. Romen, S. Scalet, L. Schoonderwoerd, M. Schulz, T. Soejima, P. Thoma, Y. Wu, P. Zechmann, L. Zweng, R. S. K. Mong, M. P. Zaletel, and F. Pollmann, Tensor ne...
work page 2024
-
[41]
The product structure of MPS-under-permutations
M. Florido-Llinàs, Á. M. Alhambra, R. Trivedi, N. Schuch, D. Pérez-García, and J. I. Cirac, The product structure of mps-under-permutations, arXiv preprint arXiv:2410.19541 (2024). Appendix A: Analytical formulations A notable advantage of the SPS ansatz is its ana- lytic tractability. In contrast to many variational families that require heavy numerical ...
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[42]
Mean and variance of an SPS norm Starting from the SPS normZ, each parameter is inde- pendent, allowing us to compute the expectation values of individual terms separately. For the diagonal term, since the probability density function ofc m is uniform withp(c m) = 1/2, the expectation value ofc2 m is EDc[c2 m] = 1 2 Z 1 −1 c2 m dcm = 1 3 .(A1) Similarly, ...
-
[43]
2-Rényi entropy Considering the reduced density matrixρA of a bipar- tition of the system, within the SPS ansatz framework, we can simply write the reduced density matrix as ρA = 1 Z MX m,m′=1 cmcm′ L/2O l=1 θm l ED θm′ l C mm′ B ,(A5) whereC mm′ B = LY l=L/2+1 D θm l θm′ l E and D θm l θm′ l E = cos θm l −θ m′ l . Then the trace of the square of the re- ...
-
[44]
This section introduces the analytic formulations of these local ob- servables
Local Observables In the analyses of typicality, trainability, and ground- state search with the SPS ansatz, the computation relies on expectation values of local observables. This section introduces the analytic formulations of these local ob- servables. Considering, for instance, the Pauli-Xoperator at site k,σ x k, the expectation value is given by ⟨σx...
-
[45]
Gradients of the expectation value The SPS ansatz allows for an analytical evaluation of the derivative of the expectation value of a local observ- able with respect to any ansatz parameter inΘ, as util- ized in Secs. IIB and III. In particular, the derivative of 11 the energy with respect to an SPS parameterϑis given by ∂ESPS ∂ϑ = 1 Z ∂⟨Ψ u|H|Ψu⟩ ∂ϑ −E S...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.