Quantum Convolutional Neural Networks are Effectively Classically Simulable
Pith reviewed 2026-05-23 22:02 UTC · model grok-4.3
The pith
Randomly initialized QCNNs only access low-bodyness measurements and can be classically simulated on standard benchmarks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
When randomly initialized, QCNNs can only operate on the information encoded in low-bodyness measurements of their input states. They are commonly benchmarked on locally-easy datasets whose states are precisely classifiable by the information encoded in these low-bodyness observables subspace. The QCNN's action on this subspace can be efficiently classically simulated by a classical algorithm equipped with Pauli shadows on the dataset, with explicit demonstrations on up to 1024 qubits for phases of matter classification.
What carries the argument
The low-bodyness measurement subspace, which both restricts randomly initialized QCNN operation and permits efficient Pauli-shadow classical simulation of its outputs.
If this is right
- QCNN performance on common phases-of-matter tasks can be replicated by a classical shadow algorithm up to at least 1024 qubits.
- Heuristic success of QCNNs on existing benchmarks follows from the datasets lying inside the low-bodyness subspace.
- The same shadow-based argument extends to other variational quantum architectures that share the low-bodyness restriction.
- Non-trivial datasets containing information outside the low-bodyness subspace are required to test whether QCNNs can exceed classical simulation.
Where Pith is reading between the lines
- Many current QML benchmark tasks may be inadequate for revealing quantum advantage because they are solvable by low-body classical methods.
- New datasets built around high-body or long-range correlations would serve as a direct test of whether the low-bodyness limitation can be overcome.
- If other variational models also begin in a low-bodyness regime, similar classical shadow simulations could be applied to them without further modification.
Load-bearing premise
Randomly initialized QCNNs stay restricted to the low-bodyness measurement subspace and the benchmark datasets are exactly those classifiable from information inside that same subspace.
What would settle it
An experiment in which a randomly initialized QCNN achieves high accuracy on a phases-of-matter dataset that requires observables of body greater than two, while the corresponding Pauli-shadow classical simulator fails to match that accuracy.
Figures
read the original abstract
Quantum Convolutional Neural Networks (QCNNs) are widely regarded as a promising model for Quantum Machine Learning (QML). In this work we tie their heuristic success to two facts. First, that when randomly initialized, they can only operate on the information encoded in low-bodyness measurements of their input states. And second, that they are commonly benchmarked on "locally-easy'' datasets whose states are precisely classifiable by the information encoded in these low-bodyness observables subspace. We further show that the QCNN's action on this subspace can be efficiently classically simulated by a classical algorithm equipped with Pauli shadows on the dataset. Indeed, we present a shadow-based simulation of QCNNs on up-to $1024$ qubits for phases of matter classification. Our results can then be understood as highlighting a deeper symptom of QML: Models could only be showing heuristic success because they are benchmarked on simple problems, for which their action can be classically simulated. This insight points to the fact that non-trivial datasets are a truly necessary ingredient for moving forward with QML. To finish, we discuss how our results can be extrapolated to classically simulate other architectures.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims that QCNNs succeed heuristically because random initialization restricts them to low-bodyness measurements and because common benchmark datasets (e.g., phases of matter) are classifiable from that subspace alone. It further asserts that QCNN action on this subspace is classically simulable via Pauli shadow tomography and demonstrates such a simulation on up to 1024 qubits. The work interprets this as evidence that QML heuristic success often reflects simple, classically simulable problems and calls for non-trivial datasets; it also discusses extrapolation to other architectures.
Significance. If the central claims hold, the result would be significant for QML evaluation practices by showing that certain architectures remain classically simulable on standard benchmarks even at large scale. The explicit shadow-based simulation reaching 1024 qubits constitutes a concrete, reproducible strength that quantifies the simulability claim for the phases-of-matter task.
major comments (2)
- [Abstract, paragraph 2] Abstract, paragraph 2: The restriction to the low-bodyness subspace is derived only for randomly initialized QCNNs. No argument or numerical check is supplied showing that gradient updates preserve membership in this subspace or that the trained circuit remains equivalent to a low-body observable; this gap is load-bearing for the claim that trained QCNNs are classically simulable.
- [Abstract, paragraph 2] Abstract, paragraph 2 and § on datasets: The statement that benchmark datasets are 'precisely classifiable by the information encoded in these low-bodyness observables' is asserted without a control experiment demonstrating that higher-weight observables cannot achieve higher accuracy on the same phases-of-matter data; this assumption directly supports the conclusion that the problems are classically simulable.
minor comments (2)
- [Introduction] Notation for 'low-bodyness' and 'low-body subspace' is used interchangeably without an explicit definition or reference to the precise operator support size; a short definition in the introduction would improve clarity.
- [Shadow simulation section] The shadow-tomography simulation section would benefit from an explicit statement of the number of shadows required and the error bounds used for the 1024-qubit experiments.
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which help clarify the scope of our claims regarding random initialization versus trained models and the sufficiency of low-bodyness for the benchmark datasets. We provide point-by-point responses below and indicate where revisions will be made to address the concerns.
read point-by-point responses
-
Referee: The restriction to the low-bodyness subspace is derived only for randomly initialized QCNNs. No argument or numerical check is supplied showing that gradient updates preserve membership in this subspace or that the trained circuit remains equivalent to a low-body observable; this gap is load-bearing for the claim that trained QCNNs are classically simulable.
Authors: The derivation is indeed presented for the random initialization case, as the low-bodyness follows directly from the initial circuit structure before any training. For the trained case, the claim of classical simulability assumes that the optimization does not take the model outside this subspace. While this is plausible given the local nature of QCNNs, we acknowledge the absence of an explicit argument or check in the current manuscript. In the revision, we will add a section providing a proof that the bodyness is preserved under gradient updates due to the convolutional architecture, along with numerical verification on small instances. This addresses the load-bearing aspect of the claim. revision: yes
-
Referee: The statement that benchmark datasets are 'precisely classifiable by the information encoded in these low-bodyness observables' is asserted without a control experiment demonstrating that higher-weight observables cannot achieve higher accuracy on the same phases-of-matter data; this assumption directly supports the conclusion that the problems are classically simulable.
Authors: We recognize that the assertion would be strengthened by an explicit control. The phases-of-matter datasets are chosen because they are known to be distinguishable by local properties, but we did not include a direct comparison with higher-body observables. We will incorporate a control experiment in the revised manuscript, training or evaluating classifiers using higher-weight Pauli observables on the same data to show that they do not yield improved accuracy, thereby confirming the sufficiency of the low-bodyness subspace. revision: yes
Circularity Check
No significant circularity; simulation applies standard shadow tomography to low-body subspace identified by QCNN structure.
full rationale
The paper's derivation identifies that randomly initialized QCNNs are restricted to low-bodyness observables and that common benchmark datasets are classifiable from that subspace, then applies Pauli shadow tomography (a standard external technique) to simulate the action on that subspace. No step reduces by construction to a fitted parameter, self-definition, or load-bearing self-citation chain; the central simulability result follows from the QCNN architecture plus known shadow methods without renaming or smuggling ansatzes. The derivation is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard properties of Pauli shadows and low-body observables in quantum information
Forward citations
Cited by 6 Pith papers
-
Accelerating Inference for Multilayer Neural Networks with Quantum Computers
Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.
-
Tensor network surrogate models for variational quantum computation
Tensor network simulations act as effective surrogate models for training QAOA on large 2D lattices, overcoming limits of parameter transfer from small instances and remaining classically feasible with moderate bond d...
-
Variational decision diagrams for quantum-inspired machine learning applications
The paper proposes variational decision diagrams (VDDs) for quantum state representation in QML and reports successful training without barren plateaus on transverse-field Ising and Heisenberg Hamiltonians.
-
Benchmarking quantum devices beyond classical capabilities
Modified Quantum Volume test uses restricted universal circuits to directly determine heavy outputs without exponential classical simulation cost.
-
Optimal quantum reservoir learning in proximity to universality
A tunable mixing parameter p in random quantum circuits controls the transition from classically simulable to expressive quantum reservoir dynamics via entanglement and nonstabilizer content.
-
Iterative Quantum Feature Maps
IQFMs iteratively constructs deep quantum feature maps from shallow circuits via classical augmentation weights and contrastive layer-wise training, outperforming QCNNs on noisy quantum data and matching classical neu...
Reference graph
Works this paper leans on
-
[1]
Proving that datasets commonly used used in the lit- erature as heuristic evidence for QCNNs are locally- easy. In particular, we will see that simple Pauli clas- sical shadows are sufficient to extract the relevant features of the data
-
[2]
Illustrating the classical simulation of QCNNs, as well as the scalability of the proposed techniques (we simulate and train QCNNs with upto 1024 qubits)
-
[3]
Studying how the performance of the simulated QCNNs depends on the amount of measurements used during the shadow tomographic procedure. The simulations presented in this section are based on two techniques which are constructed to only process the information encoded in low-bodyness measurements of the input states. In particular, they accomplish this goa...
-
[4]
Heisenberg bond-alternating XXX model The Hamiltonian of the Heisenberg Bond-Alternating XXX model reads H = n−1X i=1 Ji (XiXi+1 + YiYi+1 + ZiZi+1) , (6) where Xi, Yi and Zi denotethePaulioperatoractingonthe i-thqubit, andwhere Ji = J1(J2) ⩾ 0for i = even(odd). In the thermodynamic limitn → ∞, the ground state of this model presents a phase transition con...
-
[5]
Haldane Chain Next, we will consider the one-dimensional Haldane chain model [108] whose Hamiltonian reads H = −J n−2X i=1 ZiXi+1Zi+2 −h1 nX i=1 Xi −h2 n−1X i=1 XiXi+1 . (7) Here, we take J > 0. As shown in Fig. 4 the model’s phase diagram is characterized by the two ratiosh1/J and h2/J. Inspecting the model in the thermodynamic limit, one can find out th...
-
[6]
ANNNI model In this section, we consider the ANNNI model [109], which is described by the following Hamiltonian H = −J1 n−1X i=1 XiXi+1 − J2 n−2X i=1 XiXi+2 − B nX i=1 Zi . (8) The phases of this model are classified in terms of the di- mensionless ratios κ = −J2/J1 and h = B/J1. The phase diagram for this model is presented in Fig. 5. In particular, 9 we...
-
[7]
classical training and quantum deployment
Cluster Hamiltonian To finish, we consider the cluster model defined by the following Hamiltonian H = nX i=1 (Zi − J1XiXi+1 − J2Xi−1ZiXi+1) . (9) Note that here we employ closed boundary conditions so that Xn+1 ≡ X1 and X−1 ≡ Xn. As shown in Fig. 6, the phase transitions are delimited by the ratios betweenJ1 and J2. Similarly to the previous ANNNI model, ...
-
[8]
I. Goodfellow, Y. Bengio, and A. Courville,Deep Learning (MIT Press, 2016)
work page 2016
- [9]
-
[10]
Schmidhuber, Deep learning in neural networks: An overview, Neural networks61, 85 (2015)
J. Schmidhuber, Deep learning in neural networks: An overview, Neural networks61, 85 (2015)
work page 2015
-
[11]
M. M. Bronstein, J. Bruna, T. Cohen, and P. Veličković, Geometric deep learning: Grids, groups, graphs, geodesics, and gauges, arXiv preprint arXiv:2104.13478 (2021)
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[12]
J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, Quantum machine learning, Na- ture 549, 195 (2017)
work page 2017
-
[13]
M. Schuld and F. Petruccione,Supervised learning with quantum computers, Vol. 17 (Springer, 2018)
work page 2018
- [14]
- [15]
-
[16]
M. Cerezo, G. Verdon, H.-Y. Huang, L. Cincio, and P. J. Coles, Challenges and opportunities in quantum machine learning, Nature Computational Science 10.1038/s43588- 022-00311-3 (2022)
-
[17]
S. Endo, Z. Cai, S. C. Benjamin, and X. Yuan, Hybrid quantum-classical algorithms and quantum error mitiga- tion, Journal of the Physical Society of Japan90, 032001 (2021)
work page 2021
-
[18]
M. Larocca, S. Thanasilp, S. Wang, K. Sharma, J. Bia- monte, P. J. Coles, L. Cincio, J. R. McClean, Z. Holmes, and M. Cerezo, A review of barren plateaus in variational quantum computing, arXiv preprint arXiv:2405.00781 (2024)
-
[19]
J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Babbush, andH.Neven,Barrenplateausinquantumneuralnetwork training landscapes, Nature Communications9, 1 (2018)
work page 2018
- [20]
-
[21]
M. Cerezo and P. J. Coles, Higher order derivatives of quantum neural networks with barren plateaus, Quantum Science and Technology6, 035006 (2021)
work page 2021
-
[22]
A. Arrasmith, Z. Holmes, M. Cerezo, and P. J. Coles, Equivalence of quantum barren plateaus to cost concen- tration and narrow gorges, Quantum Science and Tech- nology 7, 045015 (2022)
work page 2022
- [23]
- [24]
-
[25]
L. Bittel and M. Kliesch, Training variational quantum al- gorithmsisNP-hard,Phys.Rev.Lett. 127,120502(2021)
work page 2021
-
[26]
E. Fontana, M. Cerezo, A. Arrasmith, I. Rungger, and P.J.Coles,Non-trivialsymmetriesinquantumlandscapes and their resilience to quantum noise, Quantum6, 804 (2022)
work page 2022
-
[27]
E. R. Anschuetz and B. T. Kiani, Beyond barren plateaus: Quantum variational algorithms are swamped with traps, Nature Communications13, 7760 (2022). 12
work page 2022
-
[28]
E. R. Anschuetz, Critical points in quantum generative models, International Conference on Learning Represen- tations (2022)
work page 2022
-
[29]
M. Larocca, P. Czarnik, K. Sharma, G. Muraleedharan, P. J. Coles, and M. Cerezo, Diagnosing Barren Plateaus with Tools from Quantum Optimal Control, Quantum6, 824 (2022)
work page 2022
-
[30]
P.Bermejo, B.Aizpurua,andR.Orús,Improvinggradient methods via coordinate transformations: Applications to quantum machine learning, Physical Review Research6, 023069 (2024)
work page 2024
-
[31]
E. Gil-Fuster, C. Gyurik, A. Pérez-Salinas, and V. Dun- jko, On the relation between trainability and dequan- tization of variational quantum learning models, arXiv preprint arXiv:2406.07072 (2024)
- [32]
- [33]
-
[34]
C. Zhao and X.-S. Gao, Analyzing the barren plateau phe- nomenon in training quantum neural networks with the ZX-calculus, Quantum5, 466 (2021)
work page 2021
- [35]
-
[36]
Q. Miao and T. Barthel, Isometric tensor network op- timization for extensive hamiltonians is free of barren plateaus, Physical Review A109, L050402 (2024)
work page 2024
-
[37]
L. Monbroussou, J. Landman, A. B. Grilo, R. Kukla, and E. Kashefi, Trainability and expressivity of hamming- weight preserving quantum circuits for machine learning, arXiv preprint arXiv:2309.15547 (2023)
-
[38]
S. Raj, I. Kerenidis, A. Shekhar, B. Wood, J. Dee, S. Chakrabarti, R. Chen, D. Herman, S. Hu, P. Minssen, et al., Quantum deep hedging, Quantum7, 1191 (2023)
work page 2023
-
[39]
E. Fontana, D. Herman, S. Chakrabarti, N. Kumar, R. Yalovetzky, J. Heredge, S. Hari Sureshbabu, and M. Pistoia, The adjoint is all you need: Characteriz- ing barren plateaus in quantum ansätze, arXiv preprint arXiv:2309.07902 (2023)
- [40]
-
[41]
M. T. West, J. Heredge, M. Sevior, and M. Usman, Prov- ably trainable rotationally equivariant quantum machine learning, PRX Quantum5, 030320 (2024)
work page 2024
- [42]
-
[43]
C.-Y. Park and N. Killoran, Hamiltonian variational ansatz without barren plateaus, Quantum8, 1239 (2024)
work page 2024
- [44]
-
[45]
M. Larocca, F. Sauvage, F. M. Sbahi, G. Verdon, P. J. Coles, and M. Cerezo, Group-invariant quantum machine learning, PRX Quantum3, 030341 (2022)
work page 2022
-
[46]
J. J. Meyer, M. Mularski, E. Gil-Fuster, A. A. Mele, F. Arzani, A. Wilms, and J. Eisert, Exploiting symmetry in variational quantum machine learning, PRX Quantum 4, 010328 (2023)
work page 2023
- [47]
- [48]
-
[49]
Q. T. Nguyen, L. Schatzki, P. Braccia, M. Ragone, P. J. Coles, F. Sauvage, M. Larocca, and M. Cerezo, Theory for equivariant quantum neural networks, PRX Quantum 5, 020328 (2024)
work page 2024
-
[50]
L. Schatzki, M. Larocca, Q. T. Nguyen, F. Sauvage, and M. Cerezo, Theoretical guarantees for permutation- equivariant quantum neural networks, npj Quantum In- formation 10, 12 (2024)
work page 2024
-
[51]
M. Kieferova, O. M. Carlos, and N. Wiebe, Quantum gen- erative training using rényi divergences, arXiv preprint arXiv:2106.09567 (2021)
- [52]
-
[53]
A. Letcher, S. Woerner, and C. Zoufal, Tight and effi- cient gradient bounds for parameterized quantum circuits, arXiv preprint arXiv:2309.12681 (2023)
-
[54]
M. S. Rudolph, J. Miller, D. Motlagh, J. Chen, A. Acharya, and A. Perdomo-Ortiz, Synergistic pretrain- ing of parametrized quantum circuits via tensor networks, Nature Communications14, 8367 (2023)
work page 2023
-
[55]
M. Cerezo, M. Larocca, D. García-Martín, N. L. Diaz, P. Braccia, E. Fontana, M. S. Rudolph, P. Bermejo, A. Ijaz, S. Thanasilp,et al., Does provable absence of bar- ren plateaus imply classical simulability? or, why we need to rethink variational quantum computing, arXiv preprint arXiv:2312.09121 (2023)
-
[56]
I. Cong, S. Choi, and M. D. Lukin, Quantum convolu- tional neural networks, Nature Physics15, 1273 (2019)
work page 2019
-
[57]
T. Hur, L. Kim, and D. K. Park, Quantum convolutional neural network for classical data classification, Quantum Machine Intelligence4, 3 (2022)
work page 2022
-
[58]
S. Oh, J. Choi, and J. Kim, A tutorial on quantum con- volutional neural networks (qcnn), in2020 International Conference on Information and Communication Technol- ogy Convergence (ICTC)(IEEE, 2020) pp. 236–239
work page 2020
- [59]
-
[60]
L.-H.Gong, J.-J.Pei, T.-F.Zhang,andN.-R.Zhou,Quan- tum convolutional neural network based on variational quantum circuits, Optics Communications 550, 129993 (2024)
work page 2024
-
[61]
J. Kim, J. Huh, and D. K. Park, Classical-to-quantum convolutional neural network transfer learning, Neuro- computing 555, 126643 (2023)
work page 2023
- [62]
- [63]
-
[64]
E. Ovalle-Magallanes, D. E. Alvarado-Carrillo, J. G. Avina-Cervantes, I. Cruz-Aceves, and J. Ruiz-Pinales, Quantum angle encoding with learnable rotation applied to quantum–classical convolutional neural networks, Ap- plied Soft Computing141, 110307 (2023)
work page 2023
-
[65]
S. Y. Chang, M. Grossi, B. Le Saux, and S. Vallecorsa, Approximately equivariant quantum neural network for p4m group symmetries in images, in2023 IEEE Interna- tional Conference on Quantum Computing and Engineer- ing (QCE), Vol. 01 (2023) pp. 229–235
work page 2023
-
[66]
F. Fan, Y. Shi, T. Guggemos, and X. X. Zhu, Hybrid quantum-classical convolutional neural net- work model for image classification, IEEE trans- actions on neural networks and learning systems 10.1109/TNNLS.2023.3312170 (2023)
- [67]
-
[68]
Y. Zeng, H. Wang, J. He, Q. Huang, and S. Chang, A multi-classification hybrid quantum neural network using an all-qubit multi-observable measurement strategy, En- tropy 24, 394 (2022)
work page 2022
-
[69]
A. Sebastianelli, D. A. Zaidenberg, D. Spiller, B. Le Saux, and S. L. Ullo, On circuit-based hybrid quantum neural networks for remote sensing imagery classification, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing15, 565 (2021)
work page 2021
-
[70]
A. Chalumuri, R. Kune, S. Kannan, and B. Manoj, Quantum–classical image processing for scene classifica- tion, IEEE Sensors Letters6, 1 (2022)
work page 2022
- [71]
-
[72]
Y. Matsumoto, R. Natsuaki, and A. Hirose, Full-learning rotational quaternion convolutional neural networks and confluence of differently represented data for polsar land classification, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing15, 2914 (2022)
work page 2022
-
[73]
S. Y. Chang, B. Le Saux, S. Vallecorsa, and M. Grossi, Quantum convolutional circuits for earth observation im- age classification, in IGARSS 2022-2022 IEEE Interna- tional Geoscience and Remote Sensing Symposium(IEEE,
work page 2022
-
[74]
Z. N. Aldoski and C. Koren, Impact of traffic sign diver- sityonautonomousvehicles: aliteraturereview,Periodica Polytechnica Transportation Engineering51, 338 (2023)
work page 2023
-
[75]
M. A. Khan, H. Park, and J. Chae, A lightweight convo- lutional neural network (cnn) architecture for traffic sign recognition in urban road networks, Electronics12, 1802 (2023)
work page 2023
-
[76]
S. Y.-C. Chen, T.-C. Wei, C. Zhang, H. Yu, and S. Yoo, Quantum convolutional neural networks for high energy physics data analysis, Phys. Rev. Res.4, 013231 (2022)
work page 2022
- [77]
-
[78]
A. Delgado, K. E. Hamilton, J.-R. Vlimant, D. Magano, Y. Omar, P. Bargassa, A. Francis, A. Gianelle, L. Sestini, D. Lucchesi,et al., Quantum computing for data analysis in high energy physics, arXiv preprint arXiv:2203.08805 (2022)
- [79]
- [80]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.