Comparing Classical Simulation and Sample-Based Learning of Quantum Systems: Learning the Hardness of Quantum Systems from Samples

Fernando de Melo; Jo\~ao Pedro Del Rey; Ra\'ul O. Vallejos

arxiv: 2605.28986 · v1 · pith:CA7RADTAnew · submitted 2026-05-27 · 🪐 quant-ph

Comparing Classical Simulation and Sample-Based Learning of Quantum Systems: Learning the Hardness of Quantum Systems from Samples

Jo\~ao Pedro Del Rey , Ra\'ul O. Vallejos , Fernando de Melo This is my paper

Pith reviewed 2026-06-29 11:11 UTC · model grok-4.3

classification 🪐 quant-ph

keywords quantum simulationsample-based learningentanglementnon-stabilizernessgenerative modelshessian eigenvaluequantum hardnessneural networks

0 comments

The pith

Within the regimes studied, sample-based learning difficulty for quantum states tracks their classical simulation hardness from entanglement and non-stabilizerness.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines whether the ease of learning quantum states from measurement samples matches how hard they are to simulate classically. It tests this by training one fixed neural network model on samples from states where entanglement or non-stabilizerness is increased in controlled ways. If the two measures align, then the way neural networks train could serve as a practical test for quantum computational hardness. A reader might care because it connects theoretical complexity ideas to observable training behavior in models applied to quantum data.

Core claim

The central claim is that for the regimes studied, classical learnability tracks known simulation complexity measures. Increasing the bond dimension of matrix product states or the number of T gates in Clifford-dominated circuits leads to larger largest Hessian eigenvalues at convergence and degraded reconstruction performance under Random Subspace Optimization for the energy-based generative model.

What carries the argument

The observed tracking between simulation cost parameters (bond dimension for entanglement, T-gate count for non-stabilizerness) and neural-network complexity probes (largest Hessian eigenvalue, Random Subspace Optimization).

If this is right

Neural-network training dynamics can provide an empirical probe of quantum computational hardness.
Learnability from samples serves as a proxy for simulation complexity when entanglement or non-stabilizerness is varied.
Both entanglement and non-stabilizerness increase learning difficulty in the same direction as they increase simulation cost.
Reconstruction performance degrades under constrained model capacity as either quantum resource grows.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The correlation might be tested on other controlled families beyond matrix product states and Clifford-plus-T circuits.
If the pattern holds, it could help identify quantum data sets that are practically hard for classical learning algorithms.
Training dynamics might offer a way to rank the effective hardness of quantum systems without running full classical simulations.

Load-bearing premise

The fixed energy-based generative model architecture together with the two chosen probes yield a representative measure of learnability that is not an artifact of this specific model class or of the particular controlled state families.

What would settle it

A counterexample family of quantum states where simulation cost is high yet the model learns the samples to high accuracy, or where simulation cost is low yet learning fails under the same training conditions.

Figures

Figures reproduced from arXiv: 2605.28986 by Fernando de Melo, Jo\~ao Pedro Del Rey, Ra\'ul O. Vallejos.

**Figure 2.** Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 7.** Figure 7: FIG. 7. Number of training epochs required for convergence [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5. Entanglement entropy [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 6.** Figure 6: FIG. 6. Entanglement entropy [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 8.** Figure 8: FIG. 8. RSO reconstruction error [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

read the original abstract

We investigate the relationship between two distinct classical approaches to quantum systems: direct simulation from a classical description and sample-based learning from measurement data. While both tasks ultimately aim to reproduce Born-rule statistics, complexity-theoretic results suggest that simulability and learnability need not coincide in general. Here we study this relationship empirically using a fixed deep energy-based generative model trained on measurement samples from controlled families of quantum states. We independently tune two quantum resources associated with classical simulation cost: entanglement, through the bond dimension of random matrix product states, and non-stabilizerness, through the number of T gates in Clifford-dominated circuits. Learning difficulty is characterized using two probes of neural-network complexity: the largest Hessian eigenvalue at convergence and Random Subspace Optimization. For both quantum resources, increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction performance under constrained capacity. Our results indicate that, within the regimes studied here, classical learnability tracks known simulation complexity measures, suggesting that neural-network training dynamics can provide an empirical probe of quantum computational hardness.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper reports a controlled empirical correlation between simulation hardness parameters and NN training difficulty on quantum samples, but the single fixed model leaves the generality open.

read the letter

The main thing to know is that the authors vary bond dimension in random MPS and T-count in Clifford+T circuits, then train one deep energy-based generative model on samples and track two probes of learning difficulty: largest Hessian eigenvalue at convergence and performance under random subspace optimization. Both hardness knobs produce the same trend: higher simulation cost lines up with sharper landscapes and worse reconstruction under capacity limits.

What works is the clean separation of the two quantum resources and the use of two independent probes on the learning side. The setup is straightforward and the abstract is upfront that the claim is limited to the regimes they studied.

The soft spot is exactly what the stress-test flags: everything sits on one model class, one training procedure, and one pair of probes. No ablations across architectures, no error bars or run statistics, and no alternative learnability metrics are described. If this particular network family encodes the Born statistics of these state families in a biased way, the observed correlation could be an artifact of that choice rather than a broader link between simulation cost and learnability.

This is for people at the quantum-ML boundary who want empirical pointers on where classical methods start to struggle. It is worth sending to peer review because the controlled variation is new and the question is well-posed, even though any referee will want checks on model dependence before treating the correlation as general.

Referee Report

2 major / 0 minor

Summary. The paper empirically studies the link between classical simulation hardness and sample-based learnability of quantum states. Using a fixed deep energy-based generative model, it independently varies bond dimension in random matrix-product states (controlling entanglement) and T-gate count in Clifford+T circuits (controlling non-stabilizerness), then measures learning difficulty via the largest Hessian eigenvalue at convergence and performance under Random Subspace Optimization. The central finding is that increasing either simulation-cost parameter systematically correlates with sharper loss landscapes and degraded reconstruction under capacity constraints, suggesting neural-network training dynamics can empirically probe quantum computational hardness.

Significance. If the correlation holds beyond the specific model class, the work supplies a concrete empirical bridge between two classically distinct tasks (simulation from description versus learning from samples) that complexity theory suggests need not coincide. The controlled, independent variation of two hardness parameters is a methodological strength. However, the absence of architecture ablations or alternative learnability metrics leaves open whether the observed tracking is general or an artifact of the chosen energy-based model and probes.

major comments (2)

[Abstract] Abstract (description of experimental setup): the central claim that 'classical learnability tracks known simulation complexity measures' rests on the assumption that the fixed deep energy-based generative model together with the Hessian-eigenvalue and Random-Subspace-Optimization probes yield a representative measure of learnability. No ablation across model architectures, training procedures, or alternative learnability metrics is described, so any systematic bias in how this network family encodes Born-rule statistics of the chosen ensembles would produce a spurious correlation.
[Abstract] Abstract (reporting of results): the text states that 'increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction' but supplies no statistical details, error bars, number of independent runs, or significance tests. Without these, the reported 'consistent trends in two learning probes' remain plausible yet unverified.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below, indicating planned revisions where appropriate.

read point-by-point responses

Referee: [Abstract] Abstract (description of experimental setup): the central claim that 'classical learnability tracks known simulation complexity measures' rests on the assumption that the fixed deep energy-based generative model together with the Hessian-eigenvalue and Random-Subspace-Optimization probes yield a representative measure of learnability. No ablation across model architectures, training procedures, or alternative learnability metrics is described, so any systematic bias in how this network family encodes Born-rule statistics of the chosen ensembles would produce a spurious correlation.

Authors: Our experimental design deliberately fixes the learning architecture to isolate the effects of varying the two quantum hardness parameters (bond dimension and T-gate count) while holding all other factors constant. This controlled setup directly tests whether the chosen learnability probes track simulation cost. We agree that the absence of architecture ablations leaves open the question of generality, and we will add an explicit discussion of this scope limitation together with suggestions for future work in the revised manuscript. revision: partial
Referee: [Abstract] Abstract (reporting of results): the text states that 'increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction' but supplies no statistical details, error bars, number of independent runs, or significance tests. Without these, the reported 'consistent trends in two learning probes' remain plausible yet unverified.

Authors: The main text and figures report results averaged over multiple independent runs with error bars; we will revise the abstract to include a concise statement of these statistical details (e.g., number of runs and consistency of trends). revision: yes

Circularity Check

0 steps flagged

No significant circularity: empirical correlation between independently measured quantities.

full rationale

The paper reports an empirical study in which simulation hardness is controlled via externally defined parameters (bond dimension of random MPS, T-gate count in Clifford+T circuits) while learnability is assessed via separate neural-network probes (largest Hessian eigenvalue at convergence, Random Subspace Optimization performance). No derivation, equation, or self-citation reduces one set of quantities to the other by construction or by fitting a parameter that is then renamed as a prediction. The central claim is an observed correlation within fixed model families, not a self-referential identity.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard domain assumptions from quantum information and machine learning; no new free parameters, axioms, or invented entities are introduced in the abstract.

axioms (2)

domain assumption The chosen energy-based generative model is capable of approximating the Born-rule statistics of the sampled quantum states under sufficient training.
Implicit in the use of the model to reproduce measurement statistics.
domain assumption Largest Hessian eigenvalue at convergence and Random Subspace Optimization performance are valid quantitative proxies for learning difficulty.
Used without further justification to characterize learning difficulty.

pith-pipeline@v0.9.1-grok · 5721 in / 1254 out tokens · 45385 ms · 2026-06-29T11:11:25.588664+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

73 extracted references · 20 canonical work pages · 13 internal anchors

[1]

A further limitation concerns the measurement basis

– the smallestdthat recovers a target fraction of full- model performance – would provide a single number per dataset that aims to be consistent across architectures, and would allow a more direct comparison between the two resources. A further limitation concerns the measurement basis. Our model is trained exclusively on Born distributions in the computa...

2025
[2]

M. J. Bremner, R. Jozsa, and D. J. Shepherd, Proceed- ings of the Royal Society A: Mathematical, Physical and Engineering Sciences467, 459 (2011)

2011
[3]

Aaronson and A

S. Aaronson and A. Arkhipov, inProceedings of the forty- third annual ACM symposium on Theory of computing (2011) pp. 333–342

2011
[4]

Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

S. Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

2007
[5]

Hinsche, M

M. Hinsche, M. Ioannou, A. Nietner, J. Haferkamp, Y. Quek, D. Hangleiter, J.-P. Seifert, J. Eisert, and R. Sweke, Physical Review Letters130, 240602 (2023), arXiv:2207.03140 [quant-ph]

work page arXiv 2023
[6]

Torlai, G

G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Nature physics14, 447 (2018)

2018
[7]

Carrasquilla and R

J. Carrasquilla and R. G. Melko, Nature Physics13, 431 (2017)

2017
[8]

Carrasquilla, Advances in Physics: X5, 1797528 (2020)

J. Carrasquilla, Advances in Physics: X5, 1797528 (2020)

2020
[9]

Y. Du, Y. Zhu, Y.-H. Zhang, M.-H. Hsieh, P. Rebentrost, W. Gao, Y.-D. Wu, J. Eisert, G. Chiribella, D. Tao,et al., arXiv preprint arXiv:2509.04923 (2025)

work page arXiv 2025
[10]

Bittel, A

L. Bittel, A. A. Mele, J. Eisert, and L. Leone, Quantum 9, 1665 (2025)

2025
[11]

Huang, R

H.-Y. Huang, R. Kueng, G. Torlai, V. V. Albert, and J. Preskill, Science377, eabk3333 (2022)

2022
[12]

A condition under which classical simulability implies efficient state learnability

M. Yoganathan, A condition under which classical simulability implies efficient state learnability (2019), arXiv:1907.08163 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2019
[13]

Ippoliti and V

M. Ippoliti and V. Khemani, PRX Quantum5, 020304 (2024)

2024
[14]

H. Kim, A. Kumar, Y. Zhou, Y. Xu, R. Vasseur, and E.-A. Kim, arXiv preprint arXiv:2508.15895 (2025)

work page arXiv 2025
[15]

Kliesch and I

M. Kliesch and I. Roth, PRX quantum2, 010201 (2021)

2021
[16]

Hangleiter and J

D. Hangleiter and J. Eisert, Reviews of Modern Physics 95, 035001 (2023)

2023
[17]

H. T. Nguyen, P. Krishnan, D. Krishnaswamy, M. Us- man, and R. Buyya, arXiv preprint arXiv:2404.11420 (2024)

work page arXiv 2024
[18]

M. Y. Niu, A. M. Dai, L. Li, A. Odena, Z. Zhao, V. Smelyanskyi, H. Neven, and S. Boixo, arXiv preprint arXiv:2010.11983 (2020)

work page arXiv 2010
[19]

W. J. Maddox, G. Benton, and A. G. Wilson, arXiv preprint arXiv:2003.02139 (2020)

work page arXiv 2003
[20]

C. Li, H. Farkhoor, R. Liu, and J. Yosinski, arXiv preprint arXiv:1804.08838 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[21]

Ghorbani, S

B. Ghorbani, S. Krishnan, and Y. Xiao, inInterna- tional Conference on Machine Learning(PMLR, 2019) pp. 2232–2241

2019
[22]

Schollw¨ ock, Annals of physics326, 96 (2011)

U. Schollw¨ ock, Annals of physics326, 96 (2011)

2011
[23]

Bravyi and D

S. Bravyi and D. Gosset, Physical review letters116, 250501 (2016)

2016
[24]

Empirical Analysis of the Hessian of Over-Parametrized Neural Networks

L. Sagun, U. Evci, V. U. Guney, Y. Dauphin, and L. Bot- tou, arXiv preprint arXiv:1706.04454 (2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017
[25]

Classical simulation of quantum computation, the Gottesman-Knill theorem, and slightly beyond

M. Nest, arXiv preprint arXiv:0811.0898 (2008)

work page internal anchor Pith review Pith/arXiv arXiv 2008
[26]

The Heisenberg Representation of Quantum Computers

D. Gottesman, arXiv preprint quant-ph/9807006 (1998)

work page internal anchor Pith review Pith/arXiv arXiv 1998
[27]

Aaronson and D

S. Aaronson and D. Gottesman, Physical Review A—Atomic, Molecular, and Optical Physics70, 052328 (2004)

2004
[28]

Bravyi and A

S. Bravyi and A. Kitaev, Physical Review A—Atomic, Molecular, and Optical Physics71, 022316 (2005)

2005
[29]

L. Chen, R. J. Garcia, K. Bu, and A. Jaffe, Physical Review B109, 174207 (2024)

2024
[30]

Vidal, Physical review letters91, 147902 (2003)

G. Vidal, Physical review letters91, 147902 (2003). 9

2003
[31]

S. Zhou, Z. Yang, A. Hamma, and C. Chamon, SciPost Physics9, 087 (2020)

2020
[32]

Iannotti, G

D. Iannotti, G. Esposito, L. C. Venuti, and A. Hamma, Quantum9, 1797 (2025)

2025
[33]

Gray, Journal of Open Source Software3, 819 (2018)

J. Gray, Journal of Open Source Software3, 819 (2018)

2018
[34]

Garnerone, T

S. Garnerone, T. R. de Oliveira, and P. Zanardi, Physical Review A—Atomic, Molecular, and Optical Physics81, 032336 (2010)

2010
[35]

Quantum computing with Qiskit

A. Javadi-Abhari, M. Treinish, K. Krsulich, C. J. Wood, J. Lishman, J. Gacon, S. Martiel, P. D. Na- tion, L. S. Bishop, A. W. Cross,et al., arXiv preprint arXiv:2405.08810 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[36]

G. E. Hinton, Neural computation14, 1771 (2002)

2002
[37]

N. S. Keskar, D. Mudigere, J. Nocedal, M. Smelyan- skiy, and P. T. P. Tang, arXiv preprint arXiv:1609.04836 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[38]

Sharpness-Aware Minimization for Efficiently Improving Generalization

P. Foret, A. Kleiner, H. Mobahi, and B. Neyshabur, arXiv preprint arXiv:2010.01412 (2020)

work page internal anchor Pith review Pith/arXiv arXiv 2010
[39]

A note on the evaluation of generative models

L. Theis, A. v. d. Oord, and M. Bethge, arXiv preprint arXiv:1511.01844 (2015)

work page internal anchor Pith review Pith/arXiv arXiv 2015
[40]

Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

L. Sagun, L. Bottou, and Y. LeCun, arXiv preprint arXiv:1611.07476 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[41]

Gradient Descent Happens in a Tiny Subspace

G. Gur-Ari, D. A. Roberts, and E. Dyer, arXiv preprint arXiv:1812.04754 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[42]

Fort and S

S. Fort and S. Ganguli, arXiv preprint arXiv:1910.05929 (2019)

work page arXiv 1910
[43]

Chaudhari, A

P. Chaudhari, A. Choromanska, S. Soatto, Y. Le- Cun, C. Baldassi, C. Borgs, J. Chayes, L. Sagun, and R. Zecchina, Journal of Statistical Mechanics: Theory and Experiment2019, 124018 (2019)

2019
[44]

LeCun, J

Y. LeCun, J. Denker, and S. Solla, Advances in neural information processing systems2(1989)

1989
[45]

Frantar and D

E. Frantar and D. Alistarh, inInternational conference on machine learning(PMLR, 2023) pp. 10323–10337

2023
[46]

P. W. Koh and P. Liang, inInternational conference on machine learning(PMLR, 2017) pp. 1885–1894

2017
[47]

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

A. Power, Y. Burda, H. Edwards, I. Babuschkin, and V. Misra, arXiv preprint arXiv:2201.02177 (2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022
[48]

T. Han, L. Adilova, H. Petzka, J. Kleesiek, and M. Kamp, Advances in Neural Information Processing Systems38, 106766 (2026)

2026
[49]

Bengio, A

Y. Bengio, A. Courville, and P. Vincent, IEEE transac- tions on pattern analysis and machine intelligence35, 1798 (2013)

2013
[50]

E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, W. Chen,et al., Iclr1, 3 (2022)

2022
[51]

Dettmers, M

T. Dettmers, M. Lewis, Y. Belkada, and L. Zettlemoyer, Advances in neural information processing systems35, 30318 (2022)

2022
[52]

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

J. Frankle and M. Carbin, arXiv preprint arXiv:1803.03635 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[53]

Barton, J

B. Barton, J. Carrasquilla, C. Roth, and A. Valenti, Ma- chine Learning: Science and Technology (2026)

2026
[54]

Raghu, B

M. Raghu, B. Poole, J. Kleinberg, S. Ganguli, and J. Sohl-Dickstein, ininternational conference on machine learning(PMLR, 2017) pp. 2847–2854

2017
[55]

Aghajanyan, S

A. Aghajanyan, S. Gupta, and L. Zettlemoyer, inPro- ceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers)(2021) pp. 7319–7328

2021
[56]

M. A. Nielsen and I. L. Chuang,Quantum computation and quantum information(Cambridge university press, 2010)

2010
[57]

T. M. Cover,Elements of information theory(John Wi- ley & Sons, 1999)

1999
[58]

Veitch, S

V. Veitch, S. H. Mousavian, D. Gottesman, and J. Emer- son, New Journal of Physics16, 013009 (2014)

2014
[59]

Turkeshi, E

X. Turkeshi, E. Tirrito, and P. Sierant, Nature Commu- nications16, 2575 (2025)

2025
[60]

Leone, S

L. Leone, S. F. Oliviero, Y. Zhou, and A. Hamma, Quan- tum5, 453 (2021)

2021
[61]

Z. Yao, A. Gholami, S. Shen, M. Mustafa, K. Keutzer, and M. Mahoney, inproceedings of the AAAI conference on artificial intelligence, Vol. 35 (2021) pp. 10665–10673

2021
[62]

Brunner, D

N. Brunner, D. Cavalcanti, S. Pironio, V. Scarani, and S. Wehner, Reviews of modern physics86, 419 (2014)

2014
[63]

R. Uola, A. C. Costa, H. C. Nguyen, and O. G¨ uhne, Reviews of Modern Physics92, 015001 (2020)

2020
[64]

A. Bera, T. Das, D. Sadhukhan, S. Singha Roy, A. Sen, and U. Sen, Reports on Progress in Physics81, 024001 (2018)

2018
[65]

Horodecki, P

R. Horodecki, P. Horodecki, M. Horodecki, and K. Horodecki, Reviews of modern physics81, 865 (2009)

2009
[66]

Facco, M

E. Facco, M. d’Errico, A. Rodriguez, and A. Laio, Scien- tific reports7, 12140 (2017)

2017
[67]

Weimar, L

M. Weimar, L. M. Rachbauer, I. Starshynov, D. Faccio, L. Adilova, D. Bouchet, and S. Rotter, Physical Review X15, 031072 (2025)

2025
[68]

Kamkari, B

H. Kamkari, B. L. Ross, R. Hosseinzadeh, J. C. Cresswell, and G. Loaiza-Ganem, Advances in Neural Information Processing Systems37, 38307 (2024)

2024
[69]

Tempczyk, R

P. Tempczyk, R. Michaluk, L. Garncarek, P. Spurek, J. Tabor, and A. Golinski, inInternational Conference on Machine Learning(PMLR, 2022) pp. 21205–21231

2022
[70]

Passetti, D

G. Passetti, D. Hofmann, P. Neitemeier, L. Grunwald, M. A. Sentef, and D. M. Kennes, Physical Review Letters 131, 036502 (2023)

2023
[71]

Denis, A

Z. Denis, A. Sinibaldi, and G. Carleo, Physical Review Letters134, 079701 (2025)

2025
[72]

Bukov, A

M. Bukov, A. G. Day, D. Sels, P. Weinberg, A. Polkovnikov, and P. Mehta, Physical Review X8, 031086 (2018)

2018
[73]

Valenti, E

A. Valenti, E. van Nieuwenburg, S. Huber, and E. Gre- plova, Physical Review Research1, 033092 (2019)

2019

[1] [1]

A further limitation concerns the measurement basis

– the smallestdthat recovers a target fraction of full- model performance – would provide a single number per dataset that aims to be consistent across architectures, and would allow a more direct comparison between the two resources. A further limitation concerns the measurement basis. Our model is trained exclusively on Born distributions in the computa...

2025

[2] [2]

M. J. Bremner, R. Jozsa, and D. J. Shepherd, Proceed- ings of the Royal Society A: Mathematical, Physical and Engineering Sciences467, 459 (2011)

2011

[3] [3]

Aaronson and A

S. Aaronson and A. Arkhipov, inProceedings of the forty- third annual ACM symposium on Theory of computing (2011) pp. 333–342

2011

[4] [4]

Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

S. Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

2007

[5] [5]

Hinsche, M

M. Hinsche, M. Ioannou, A. Nietner, J. Haferkamp, Y. Quek, D. Hangleiter, J.-P. Seifert, J. Eisert, and R. Sweke, Physical Review Letters130, 240602 (2023), arXiv:2207.03140 [quant-ph]

work page arXiv 2023

[6] [6]

Torlai, G

G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Nature physics14, 447 (2018)

2018

[7] [7]

Carrasquilla and R

J. Carrasquilla and R. G. Melko, Nature Physics13, 431 (2017)

2017

[8] [8]

Carrasquilla, Advances in Physics: X5, 1797528 (2020)

J. Carrasquilla, Advances in Physics: X5, 1797528 (2020)

2020

[9] [9]

Y. Du, Y. Zhu, Y.-H. Zhang, M.-H. Hsieh, P. Rebentrost, W. Gao, Y.-D. Wu, J. Eisert, G. Chiribella, D. Tao,et al., arXiv preprint arXiv:2509.04923 (2025)

work page arXiv 2025

[10] [10]

Bittel, A

L. Bittel, A. A. Mele, J. Eisert, and L. Leone, Quantum 9, 1665 (2025)

2025

[11] [11]

Huang, R

H.-Y. Huang, R. Kueng, G. Torlai, V. V. Albert, and J. Preskill, Science377, eabk3333 (2022)

2022

[12] [12]

A condition under which classical simulability implies efficient state learnability

M. Yoganathan, A condition under which classical simulability implies efficient state learnability (2019), arXiv:1907.08163 [quant-ph]

work page internal anchor Pith review Pith/arXiv arXiv 2019

[13] [13]

Ippoliti and V

M. Ippoliti and V. Khemani, PRX Quantum5, 020304 (2024)

2024

[14] [14]

H. Kim, A. Kumar, Y. Zhou, Y. Xu, R. Vasseur, and E.-A. Kim, arXiv preprint arXiv:2508.15895 (2025)

work page arXiv 2025

[15] [15]

Kliesch and I

M. Kliesch and I. Roth, PRX quantum2, 010201 (2021)

2021

[16] [16]

Hangleiter and J

D. Hangleiter and J. Eisert, Reviews of Modern Physics 95, 035001 (2023)

2023

[17] [17]

H. T. Nguyen, P. Krishnan, D. Krishnaswamy, M. Us- man, and R. Buyya, arXiv preprint arXiv:2404.11420 (2024)

work page arXiv 2024

[18] [18]

M. Y. Niu, A. M. Dai, L. Li, A. Odena, Z. Zhao, V. Smelyanskyi, H. Neven, and S. Boixo, arXiv preprint arXiv:2010.11983 (2020)

work page arXiv 2010

[19] [19]

W. J. Maddox, G. Benton, and A. G. Wilson, arXiv preprint arXiv:2003.02139 (2020)

work page arXiv 2003

[20] [20]

C. Li, H. Farkhoor, R. Liu, and J. Yosinski, arXiv preprint arXiv:1804.08838 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[21] [21]

Ghorbani, S

B. Ghorbani, S. Krishnan, and Y. Xiao, inInterna- tional Conference on Machine Learning(PMLR, 2019) pp. 2232–2241

2019

[22] [22]

Schollw¨ ock, Annals of physics326, 96 (2011)

U. Schollw¨ ock, Annals of physics326, 96 (2011)

2011

[23] [23]

Bravyi and D

S. Bravyi and D. Gosset, Physical review letters116, 250501 (2016)

2016

[24] [24]

Empirical Analysis of the Hessian of Over-Parametrized Neural Networks

L. Sagun, U. Evci, V. U. Guney, Y. Dauphin, and L. Bot- tou, arXiv preprint arXiv:1706.04454 (2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017

[25] [25]

Classical simulation of quantum computation, the Gottesman-Knill theorem, and slightly beyond

M. Nest, arXiv preprint arXiv:0811.0898 (2008)

work page internal anchor Pith review Pith/arXiv arXiv 2008

[26] [26]

The Heisenberg Representation of Quantum Computers

D. Gottesman, arXiv preprint quant-ph/9807006 (1998)

work page internal anchor Pith review Pith/arXiv arXiv 1998

[27] [27]

Aaronson and D

S. Aaronson and D. Gottesman, Physical Review A—Atomic, Molecular, and Optical Physics70, 052328 (2004)

2004

[28] [28]

Bravyi and A

S. Bravyi and A. Kitaev, Physical Review A—Atomic, Molecular, and Optical Physics71, 022316 (2005)

2005

[29] [29]

L. Chen, R. J. Garcia, K. Bu, and A. Jaffe, Physical Review B109, 174207 (2024)

2024

[30] [30]

Vidal, Physical review letters91, 147902 (2003)

G. Vidal, Physical review letters91, 147902 (2003). 9

2003

[31] [31]

S. Zhou, Z. Yang, A. Hamma, and C. Chamon, SciPost Physics9, 087 (2020)

2020

[32] [32]

Iannotti, G

D. Iannotti, G. Esposito, L. C. Venuti, and A. Hamma, Quantum9, 1797 (2025)

2025

[33] [33]

Gray, Journal of Open Source Software3, 819 (2018)

J. Gray, Journal of Open Source Software3, 819 (2018)

2018

[34] [34]

Garnerone, T

S. Garnerone, T. R. de Oliveira, and P. Zanardi, Physical Review A—Atomic, Molecular, and Optical Physics81, 032336 (2010)

2010

[35] [35]

Quantum computing with Qiskit

A. Javadi-Abhari, M. Treinish, K. Krsulich, C. J. Wood, J. Lishman, J. Gacon, S. Martiel, P. D. Na- tion, L. S. Bishop, A. W. Cross,et al., arXiv preprint arXiv:2405.08810 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[36] [36]

G. E. Hinton, Neural computation14, 1771 (2002)

2002

[37] [37]

N. S. Keskar, D. Mudigere, J. Nocedal, M. Smelyan- skiy, and P. T. P. Tang, arXiv preprint arXiv:1609.04836 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[38] [38]

Sharpness-Aware Minimization for Efficiently Improving Generalization

P. Foret, A. Kleiner, H. Mobahi, and B. Neyshabur, arXiv preprint arXiv:2010.01412 (2020)

work page internal anchor Pith review Pith/arXiv arXiv 2010

[39] [39]

A note on the evaluation of generative models

L. Theis, A. v. d. Oord, and M. Bethge, arXiv preprint arXiv:1511.01844 (2015)

work page internal anchor Pith review Pith/arXiv arXiv 2015

[40] [40]

Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

L. Sagun, L. Bottou, and Y. LeCun, arXiv preprint arXiv:1611.07476 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[41] [41]

Gradient Descent Happens in a Tiny Subspace

G. Gur-Ari, D. A. Roberts, and E. Dyer, arXiv preprint arXiv:1812.04754 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[42] [42]

Fort and S

S. Fort and S. Ganguli, arXiv preprint arXiv:1910.05929 (2019)

work page arXiv 1910

[43] [43]

Chaudhari, A

P. Chaudhari, A. Choromanska, S. Soatto, Y. Le- Cun, C. Baldassi, C. Borgs, J. Chayes, L. Sagun, and R. Zecchina, Journal of Statistical Mechanics: Theory and Experiment2019, 124018 (2019)

2019

[44] [44]

LeCun, J

Y. LeCun, J. Denker, and S. Solla, Advances in neural information processing systems2(1989)

1989

[45] [45]

Frantar and D

E. Frantar and D. Alistarh, inInternational conference on machine learning(PMLR, 2023) pp. 10323–10337

2023

[46] [46]

P. W. Koh and P. Liang, inInternational conference on machine learning(PMLR, 2017) pp. 1885–1894

2017

[47] [47]

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

A. Power, Y. Burda, H. Edwards, I. Babuschkin, and V. Misra, arXiv preprint arXiv:2201.02177 (2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022

[48] [48]

T. Han, L. Adilova, H. Petzka, J. Kleesiek, and M. Kamp, Advances in Neural Information Processing Systems38, 106766 (2026)

2026

[49] [49]

Bengio, A

Y. Bengio, A. Courville, and P. Vincent, IEEE transac- tions on pattern analysis and machine intelligence35, 1798 (2013)

2013

[50] [50]

E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, W. Chen,et al., Iclr1, 3 (2022)

2022

[51] [51]

Dettmers, M

T. Dettmers, M. Lewis, Y. Belkada, and L. Zettlemoyer, Advances in neural information processing systems35, 30318 (2022)

2022

[52] [52]

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

J. Frankle and M. Carbin, arXiv preprint arXiv:1803.03635 (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[53] [53]

Barton, J

B. Barton, J. Carrasquilla, C. Roth, and A. Valenti, Ma- chine Learning: Science and Technology (2026)

2026

[54] [54]

Raghu, B

M. Raghu, B. Poole, J. Kleinberg, S. Ganguli, and J. Sohl-Dickstein, ininternational conference on machine learning(PMLR, 2017) pp. 2847–2854

2017

[55] [55]

Aghajanyan, S

A. Aghajanyan, S. Gupta, and L. Zettlemoyer, inPro- ceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers)(2021) pp. 7319–7328

2021

[56] [56]

M. A. Nielsen and I. L. Chuang,Quantum computation and quantum information(Cambridge university press, 2010)

2010

[57] [57]

T. M. Cover,Elements of information theory(John Wi- ley & Sons, 1999)

1999

[58] [58]

Veitch, S

V. Veitch, S. H. Mousavian, D. Gottesman, and J. Emer- son, New Journal of Physics16, 013009 (2014)

2014

[59] [59]

Turkeshi, E

X. Turkeshi, E. Tirrito, and P. Sierant, Nature Commu- nications16, 2575 (2025)

2025

[60] [60]

Leone, S

L. Leone, S. F. Oliviero, Y. Zhou, and A. Hamma, Quan- tum5, 453 (2021)

2021

[61] [61]

Z. Yao, A. Gholami, S. Shen, M. Mustafa, K. Keutzer, and M. Mahoney, inproceedings of the AAAI conference on artificial intelligence, Vol. 35 (2021) pp. 10665–10673

2021

[62] [62]

Brunner, D

N. Brunner, D. Cavalcanti, S. Pironio, V. Scarani, and S. Wehner, Reviews of modern physics86, 419 (2014)

2014

[63] [63]

R. Uola, A. C. Costa, H. C. Nguyen, and O. G¨ uhne, Reviews of Modern Physics92, 015001 (2020)

2020

[64] [64]

A. Bera, T. Das, D. Sadhukhan, S. Singha Roy, A. Sen, and U. Sen, Reports on Progress in Physics81, 024001 (2018)

2018

[65] [65]

Horodecki, P

R. Horodecki, P. Horodecki, M. Horodecki, and K. Horodecki, Reviews of modern physics81, 865 (2009)

2009

[66] [66]

Facco, M

E. Facco, M. d’Errico, A. Rodriguez, and A. Laio, Scien- tific reports7, 12140 (2017)

2017

[67] [67]

Weimar, L

M. Weimar, L. M. Rachbauer, I. Starshynov, D. Faccio, L. Adilova, D. Bouchet, and S. Rotter, Physical Review X15, 031072 (2025)

2025

[68] [68]

Kamkari, B

H. Kamkari, B. L. Ross, R. Hosseinzadeh, J. C. Cresswell, and G. Loaiza-Ganem, Advances in Neural Information Processing Systems37, 38307 (2024)

2024

[69] [69]

Tempczyk, R

P. Tempczyk, R. Michaluk, L. Garncarek, P. Spurek, J. Tabor, and A. Golinski, inInternational Conference on Machine Learning(PMLR, 2022) pp. 21205–21231

2022

[70] [70]

Passetti, D

G. Passetti, D. Hofmann, P. Neitemeier, L. Grunwald, M. A. Sentef, and D. M. Kennes, Physical Review Letters 131, 036502 (2023)

2023

[71] [71]

Denis, A

Z. Denis, A. Sinibaldi, and G. Carleo, Physical Review Letters134, 079701 (2025)

2025

[72] [72]

Bukov, A

M. Bukov, A. G. Day, D. Sels, P. Weinberg, A. Polkovnikov, and P. Mehta, Physical Review X8, 031086 (2018)

2018

[73] [73]

Valenti, E

A. Valenti, E. van Nieuwenburg, S. Huber, and E. Gre- plova, Physical Review Research1, 033092 (2019)

2019