pith. sign in

arxiv: 2501.05958 · v3 · submitted 2025-01-10 · 🧮 math.NA · cs.NA· physics.chem-ph· physics.comp-ph

Lower Bound on the Representation Complexity of Antisymmetric Tensor Product Functions

Pith reviewed 2026-05-23 05:54 UTC · model grok-4.3

classification 🧮 math.NA cs.NAphysics.chem-phphysics.comp-ph
keywords antisymmetric tensor product functionsrepresentation complexitycanonical polyadic ranklower boundshigh-dimensional problemsquantum many-body systemsnumerical analysis
0
0 comments X

The pith

The minimum number of terms for a class of tensor product functions to be exactly antisymmetric grows exponentially with dimension.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proves that a class of tensor product functions (TPFs) cannot achieve exact antisymmetry unless the number of terms grows exponentially with the problem dimension. This bound applies equally to discretized TPFs and to versions parameterized by neural networks. A reader would care because TPF approximations are promoted for high-dimensional PDEs and eigenvalue problems on the basis of linear scaling, yet the antisymmetry constraint central to quantum many-body systems forces an exponential cost. The argument proceeds by identifying the TPFs with antisymmetric tensors and lower-bounding their Canonical Polyadic rank.

Core claim

The minimum number of involved terms for a class of TPFs to be exactly antisymmetric increases exponentially fast with the problem dimension. This class encompasses both traditionally discretized TPFs and recent neural-network-parameterized ones. The proof exploits the link between the antisymmetric TPFs and the corresponding antisymmetric tensors, focusing on the Canonical Polyadic rank of the latter, and concludes that low-rank TPFs are fundamentally unsuitable for high-dimensional problems where antisymmetry is essential.

What carries the argument

The identification of antisymmetric TPFs with antisymmetric tensors whose Canonical Polyadic rank supplies the exponential lower bound on the number of terms.

If this is right

  • Low-rank TPFs cannot exactly represent antisymmetric functions without incurring exponential cost in high dimensions.
  • The same exponential lower bound applies to neural-network-parameterized TPFs as to classical discretized ones.
  • TPF approximations lose their advertised linear scaling once antisymmetry is required.
  • High-dimensional quantum many-body problems lie outside the regime where low-rank TPFs can be exact.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Representations that enforce antisymmetry by construction, rather than through the choice of terms, may evade the exponential barrier identified here.
  • Analogous rank lower bounds may exist for other symmetry constraints, such as symmetry under particle exchange or under spatial rotations.
  • Practical approximations would need to quantify the error incurred when antisymmetry is only approximately satisfied by low-rank TPFs.

Load-bearing premise

The minimum number of terms required by the TPF class equals or exceeds the Canonical Polyadic rank of the associated antisymmetric tensor, and this identification holds for both discretized and neural-network cases.

What would settle it

An explicit construction, for arbitrarily large dimension, of an antisymmetric tensor whose Canonical Polyadic rank grows only polynomially, or a concrete antisymmetric TPF with polynomially many terms that satisfies the antisymmetry condition exactly.

Figures

Figures reproduced from arXiv: 2501.05958 by Xin Liu, Yukuan Hu, Yuyang Wang.

Figure 1
Figure 1. Figure 1: Numerical comparison of the TNN approximations with and without explicit antisym [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Architecture of TNN. TNN wave function ansatz follows a similar structure. Each electronic coordinate is represented by a d-dimensional vector rj = (rj1, . . . , rjd) ⊤ ∈ R d , with each one-dimensional coordinate processed by an independent subnetwork, resulting in a total of dN independent subnetworks [34]. The TNN wave function ansatz for this system is fθ(r) := Xp i=1 YN j=1 Yd k=1 ψijk(rjk; θjk) ! , ∀… view at source ↗
Figure 3
Figure 3. Figure 3: Comparison of TNN approximations with and without antisymmetrization for the one [PITH_FULL_IMAGE:figures/full_fig_p015_3.png] view at source ↗
read the original abstract

Tensor product function (TPF) approximations have been widely adopted in solving high-dimensional problems, such as partial differential equations and eigenvalue problems, achieving desirable accuracy with computational overhead that scales linearly with problem dimensions. However, recent studies have underscored the extraordinarily high computational cost of TPFs on quantum many-body problems, even for systems with as few as three particles. A key distinction in these problems is the antisymmetry requirement on the unknown functions. In the present work, we rigorously establish that the minimum number of involved terms for a class of TPFs to be exactly antisymmetric increases exponentially fast with the problem dimension. This class encompasses both traditionally discretized TPFs and the recent ones parameterized by neural networks. Our proof exploits the link between the antisymmetric TPFs in this class and the corresponding antisymmetric tensors and focuses on the Canonical Polyadic rank of the latter. As a result, our findings reveal that low-rank TPFs are fundamentally unsuitable for high-dimensional problems where antisymmetry is essential.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper claims to prove that the minimal number of terms in a class of tensor product functions (TPFs) needed to achieve exact antisymmetry grows exponentially with dimension. The class is asserted to include both basis-discretized TPFs and neural-network-parameterized TPFs. The argument proceeds by identifying antisymmetric TPFs with antisymmetric tensors and invoking a lower bound on their Canonical Polyadic (CP) rank.

Significance. If the claimed equivalence between term count and CP rank holds for both discretization and NN cases, the result supplies a concrete exponential lower bound that explains why low-rank TPF representations become impractical for antisymmetric high-dimensional problems such as quantum many-body systems. The manuscript does not supply machine-checked proofs or reproducible code, but the derivation is presented as parameter-free once the tensor identification is granted.

major comments (1)
  1. [Proof of the main result (Section 3 / Theorem 1)] The central reduction equates the minimal TPF term count to the CP rank of the associated antisymmetric tensor. This identification is immediate when univariate factors are expanded in a fixed finite basis (discretized case). For NN-parameterized univariate maps, which are arbitrary nonlinear functions, no explicit reduction is supplied showing that any such NN-TPF can be represented or bounded by a finite-basis expansion whose rank is controlled by the term count. Without this step the exponential lower bound does not automatically transfer to the NN setting, which is load-bearing for the claim that the result covers “both traditionally discretized TPFs and the recent ones parameterized by neural networks.”
minor comments (2)
  1. [Introduction] The precise definition of the “class of TPFs” under consideration should be stated explicitly in the introduction rather than deferred to the proof section.
  2. [Preliminaries] Notation for the antisymmetric tensor and its CP decomposition should be introduced with a short self-contained paragraph before the main theorem.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their thorough review and valuable comments on our manuscript. The main issue raised concerns the applicability of the lower bound to neural network-parameterized tensor product functions. We provide a response below and will update the manuscript to include the missing details.

read point-by-point responses
  1. Referee: [Proof of the main result (Section 3 / Theorem 1)] The central reduction equates the minimal TPF term count to the CP rank of the associated antisymmetric tensor. This identification is immediate when univariate factors are expanded in a fixed finite basis (discretized case). For NN-parameterized univariate maps, which are arbitrary nonlinear functions, no explicit reduction is supplied showing that any such NN-TPF can be represented or bounded by a finite-basis expansion whose rank is controlled by the term count. Without this step the exponential lower bound does not automatically transfer to the NN setting, which is load-bearing for the claim that the result covers “both traditionally discretized TPFs and the recent ones parameterized by neural networks.”

    Authors: We appreciate the referee pointing out the need for an explicit argument in the NN case. To clarify: consider any TPF representation with k terms that is exactly antisymmetric. Discretizing the domain using the same finite grid for each variable produces an antisymmetric tensor. The discretized TPF corresponds to a CP decomposition of this tensor with at most k terms, hence the CP rank is at most k. However, it is known (and used in our proof for the discretized case) that the CP rank of nonzero antisymmetric tensors grows exponentially with the dimension. This yields the same exponential lower bound on k for the NN case as well. We will insert this reduction explicitly into the revised Section 3 to strengthen the presentation. revision: yes

Circularity Check

0 steps flagged

No circularity detected; lower bound derived from external CP-rank definition

full rationale

The derivation establishes a lower bound on the number of TPF terms required for exact antisymmetry by linking the TPF class to the CP rank of the associated antisymmetric tensor. This link is presented as a mathematical equivalence based on the standard definition of CP rank (minimal number of rank-1 terms), which is an independent external concept rather than a fitted parameter or self-referential definition. The abstract and description indicate the proof applies the same tensor-rank argument to both discretized and NN-parameterized cases without any reduction of the claimed bound to the input assumptions by construction. No self-citations, ansatzes, or renamings are identified as load-bearing steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the mapping from antisymmetric TPFs to antisymmetric tensors and the properties of their CP rank; no free parameters or invented entities are introduced in the abstract.

axioms (1)
  • domain assumption Antisymmetric TPFs in the considered class correspond to antisymmetric tensors whose CP rank governs the minimal number of terms
    The proof exploits this link as stated in the abstract.

pith-pipeline@v0.9.0 · 5711 in / 1094 out tokens · 23361 ms · 2026-05-23T05:54:37.650336+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

59 extracted references · 59 canonical work pages · 1 internal anchor

  1. [1]

    Bachmayr

    M. Bachmayr. Low-rank tensor methods for partial differential equations. Acta Numerica, 32:1–121, 2023

  2. [2]

    Bajdich, L

    M. Bajdich, L. Mitas, G. Drobný, L. K. Wagner, and K. E. Schmidt. Pfaffian pairing wave functions in electronic-structure quantum Monte Carlo simulations. Physical Review Letters, 96(13), 2006

  3. [3]

    Bajdich, L

    M. Bajdich, L. Mitas, L. K. Wagner, and K. E. Schmidt. Pfaffian pairing and backflow wavefunctions for electronic structure quantum Monte Carlo methods. Physical Review B , 77(11), 2008

  4. [4]

    Bazarkhanov and V

    D. Bazarkhanov and V. Temlyakov. Nonlinear tensor product approximation of functions. Journal of Complexity , 31(6):867–884, 2015

  5. [5]

    Beylkin and M

    G. Beylkin and M. J. Mohlenkamp. Numerical operator calculus in higher dimensions. Proceedings of the National Academy of Sciences , 99(16):10246–10251, 2002

  6. [6]

    Beylkin and M

    G. Beylkin and M. J. Mohlenkamp. Algorithms for numerical analysis in high dimensions. SIAM Journal on Scientific Computing , 26(6):2133–2159, 2005

  7. [7]

    Bradbury, R

    J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Nec- ula, A. Paszke, J. VanderPlas, S. Wanderman-Milne, and Q. Zhang. JAX: Composable transformations of Python+NumPy programs. http://github.com/jax-ml/jax, 2018

  8. [8]

    Chen and J

    Z. Chen and J. Lu. Exact and Efficient Representation of Totally Anti-Symmetric Func- tions. arXiv preprint arXiv:2311.05064, 2023

  9. [9]

    X. Dai, Y. Fan, and Z. Sheng. Subspace Method Based on Neural Networks for Eigenvalue Problems. arXiv preprint arXiv:2410.13358, 2024

  10. [10]

    H. Derksen. On the nuclear norm and the singular value decomposition of tensors. Foun- dations of Computational Mathematics , 16(3):779–811, 2016

  11. [11]

    DeVore, B

    R. DeVore, B. Hanin, and G. Petrova. Neural network approximation. Acta Numerica , 30:327–444, 2021

  12. [12]

    Dolgov, D

    S. Dolgov, D. Kressner, and C. Strössner. Functional Tucker approximation using Cheby- shev interpolation. SIAM Journal on Scientific Computing , 43(3):A2190–A2210, 2021. 15

  13. [13]

    W. E and B. Yu. The deep Ritz method: A deep learning-based numerical algorithm for solving variational problems. Communications in Mathematics and Statistics , 6:1–12, 2018

  14. [14]

    R. P. Feynman and M. Cohen. Energy spectrum of the excitations in liquid helium. Physical Review, 102(5):1189–1204, 1956

  15. [15]

    V. Fock. Näherungsmethode zur lösung des quantenmechanischen mehrkörperproblems. Zeitschrift für Physik , 61:126–148, 1930

  16. [16]

    Gao and S

    N. Gao and S. Günnemann. Neural Pfaffians: Solving Many Many-Electron Schrödinger Equations. arXiv preprint arXiv:2405.14762, 2024

  17. [17]

    Griebel and H

    M. Griebel and H. Harbrecht. Analysis of tensor approximation schemes for continuous functions. Foundations of Computational Mathematics , 23:219–240, 2023

  18. [18]

    Griebel, H

    M. Griebel, H. Harbrecht, and R. Schneider. Low-rank approximation of continuous func- tions in Sobolev spaces with dominating mixed smoothness. Mathematics of Computation , 92(342):1729–1746, 2023

  19. [19]

    Hackbusch

    W. Hackbusch. Tensor Spaces and Numerical Tensor Calculus , volume 42 of Springer Series in Computational Mathematics . Springer Cham, 2nd edition, 2012

  20. [20]

    Hackbusch and B

    W. Hackbusch and B. N. Khoromskij. Tensor-product approximation to operators and functions in high dimensions. Journal of Complexity , 23(4-6):697–714, 2007

  21. [21]

    Hackbusch and B

    W. Hackbusch and B. N. Khoromskij. Tensor-product approximation to multidimensional integral operators and Green’s functions. SIAM Journal on Matrix Analysis and Applica- tions, 30(3):1233–1253, 2008

  22. [22]

    J. Han, Y. Li, L. Lin, J. Lu, J. Zhang, and L. Zhang. Universal Approximation of Symmetric and Anti-Symmetric Functions. arXiv preprint arXiv:1912.01765, 2019

  23. [23]

    J. Han, L. Zhang, and W. E. Solving many-electron Schrödinger equation using deep neural networks. Journal of Computational Physics , 399, 2019

  24. [24]

    D. R. Hartree. The wave mechanics of an atom with a non-Coulomb central field. Part I. Theory and Methods. Mathematical Proceedings of the Cambridge Philosophical Society , 24(1):89–110, 1928

  25. [25]

    Helgaker, P

    T. Helgaker, P. Jørgensen, and J. Olsen. Molecular Electronic‐Structure Theory . John Wiley & Sons, Ltd, 1st edition, 2000

  26. [26]

    F. L. Hitchcock. The expression of a tensor or a polyadic as a sum of products. Journal of Mathematics and Physics , 6(1-4):164–189, 1927

  27. [27]

    Z. Hu, K. Shukla, G. E. Karniadakis, and K. Kawaguchi. Tackling the curse of dimension- ality with physics-informed neural networks. Neural Networks , 176, 2024

  28. [28]

    Huang, J

    H. Huang, J. Landsberg, and J. Lu. Geometry of backflow transformation ansatze for quantum many-body fermonic wavefunctions. Communications in Mathematical Sciences , 21(5):1447–1453, 2023

  29. [29]

    Huang, H

    J. Huang, H. Wu, and T. Zhou. Adaptive Neural Network Basis Methods for Partial Differential Equations with Low-Regular Solutions. arXiv preprint arXiv:2411.01998, 2024

  30. [30]

    M. Hutter. On Representing (Anti)symmetric Functions. arXiv preprint arXiv:2007.15298, 2020

  31. [31]

    Ilten and Z

    N. Ilten and Z. Teitler. Product ranks of the 3 3 determinant and permanent. Canadian Mathematical Bulletin , 59(2):311–319, 2016

  32. [32]

    R. Jastrow. Many-body problem with strong forces. Physical Review , 98(5):1479–1484, 1955

  33. [33]

    T. Kao, H. Zhang, L. Zhang, and J. Zhao. pETNNs: Partial Evolutionary Tensor Neu- ral Networks for Solving Time-Dependent Partial Differential Equations. arXiv preprint arXiv:2403.06084, 2024

  34. [34]

    Y. Liao, Z. Lin, J. Liu, Q. Sun, Y. Wang, T. Wu, and H. Xie. Solving Schrödinger Equation Using Tensor Neural Network. arXiv preprint arXiv:2209.12572, 2024. 16

  35. [35]

    Liao and P

    Y. Liao and P. Ming. Deep Nitsche method: Deep Ritz method with essential boundary conditions. Communications in Computational Physics , 29(5):1365–1384, January 2021

  36. [36]

    Y. Liao, Y. Wang, and H. Xie. Solving High Dimensional Partial Differential Equations Us- ing Tensor Type Discretization and Optimization Process. arXiv preprint arXiv:2211.16548, 2022

  37. [37]

    J. Lin, G. Goldshlager, and L. Lin. Explicitly antisymmetrized neural network layers for variational Monte Carlo simulation. Journal of Computational Physics , 474, 2023

  38. [38]

    Z. Lin, Y. Wang, and H. Xie. Adaptive Neural Network Subspace Method for Solving Partial Differential Equations with High Accuracy. arXiv preprint arXiv:2412.02586, 2024

  39. [39]

    P.-O. Löwdin. Correlation problem in many-electron quantum mechanics I. Review of different approaches and discussion of some current ideas. In I. Prigogine, editor, Advances in Chemical Physics , chapter 7, pages 207–322. John Wiley & Sons, Ltd, 1958

  40. [40]

    M. J. Mohlenkamp and L. Monzón. Trigonometric identities and sums of separable func- tions. The Mathematical Intelligencer , 27(2):65–69, 2005

  41. [41]

    I. V. Oseledets. Tensor-train decomposition. SIAM Journal on Scientific Computing , 33(5):2295–2317, 2011

  42. [42]

    I. V. Oseledets. Constructive representation of functions in low-rank tensor formats. Con- structive Approximation, 37:1–18, 2013

  43. [43]

    T. Pang, S. Yan, and M. Lin. O(N 2) Universal Antisymmetry in Fermionic Neural Net- works. arXiv preprint arXiv:2205.13205, 2022

  44. [44]

    W. Pauli. Über den zusammenhang des abschlusses der elektronengruppen im atom mit der komplexstruktur der spektren. Zeitschrift für Physik , 31(1):765–783, 1925

  45. [45]

    D. Pfau, J. S. Spencer, A. G. D. G. Matthews, and W. M. C. Foulkes. Ab initio solution of the many-electron Schrödinger equation with deep neural networks. Physical Review Research, 2(3), 2020

  46. [46]

    Raissi, P

    M. Raissi, P. Perdikaris, and G. E. Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics , 378:686–707, 2019

  47. [47]

    E. Schmidt. Zur theorie der linearen und nichtlinearen integralgleichungen. Mathematische Annalen, 63:433–476, 1907

  48. [48]

    Siegel, Q

    J. Siegel, Q. Hong, X. Jin, W. Hao, and J. Xu. Greedy training algorithms for neural networks and applications to PDEs. Journal of Computational Physics , 484, 2023

  49. [49]

    Sirignano and K

    J. Sirignano and K. Spiliopoulos. DGM: A deep learning algorithm for solving partial differential equations. Journal of Computational Physics , 375:1339–1364, 2018

  50. [50]

    J. C. Slater. Note on Hartree’s method. Physical Review, 35(2):210, 1930

  51. [51]

    T. Wang, Z. Hu, K. Kawaguchi, Z. Zhang, and G. E. Karniadakis. Tensor Neural Networks for High-Dimensional Fokker–Planck Equations. Neural Networks , 185:107165, 2025

  52. [52]

    Y. Wang, P. Jin, and H. Xie. Tensor Neural Network and Its Numerical Integration. arXiv preprint arXiv:2207.02754, 2022

  53. [53]

    Y. Wang, Z. Lin, Y. Liao, H. Liu, and H. Xie. Solving high-dimensional partial differential equations using tensor neural network and a posteriori error estimators. Journal of Scientific Computing, 101(67):1–29, 2024

  54. [54]

    Wang and H

    Y. Wang and H. Xie. Computing multi-eigenpairs of high-dimensional eigenvalue problems using tensor neural networks. Journal of Computational Physics , 506, 2024

  55. [55]

    Weidmann

    J. Weidmann. Linear Operators in Hilbert Spaces. Graduate Texts in Mathematics. Springer New York, NY, 1st edition, 1980

  56. [56]

    S. Xiao, P. Jin, and Y. Tang. Learning Solution Operators of PDEs Defined on Varying Domains via MIONet. arXiv preprint arXiv:2402.15097, 2024. 17

  57. [57]

    H. Ye, R. Li, Y. Gu, Y. Lu, D. He, and L. Wang. eO(N 2) Representation of General Continuous Anti-Symmetric Function. arXiv preprint arXiv:2402.15167, 2024

  58. [58]

    A Natural Deep Ritz Method for Essential Boundary Value Problems

    H. Yu and S. Zhang. A Natural Deep Ritz Method for Essential Boundary Value Problems. arXiv preprint arXiv:2411.09898, 2024

  59. [59]

    D. Zhou, H. Chen, C. H. Ho, and C. Ortner. A multilevel method for many-electron Schrödinger equations based on the atomic cluster expansion. SIAM Journal on Scientific Computing, 46(1):A105–A129, 2024. 18