QNAS: A Neural Architecture Search Framework for Accurate and Efficient Quantum Neural Networks
Pith reviewed 2026-05-10 18:25 UTC · model grok-4.3
The pith
QNAS finds compact accurate quantum neural networks by jointly optimizing error, runtime cost, and circuit cutting overhead.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
QNAS trains a shared parameter SuperCircuit and uses NSGA-II to optimize three objectives jointly: validation error, a runtime cost proxy measuring wall clock evaluation time, and the estimated number of subcircuits under a target qubit budget. It evaluates candidate HQNNs under a few epochs of training and discovers clear Pareto fronts that reveal tradeoffs between accuracy, efficiency, and cutting overhead. Across MNIST, Fashion-MNIST, and Iris benchmarks, embedding type and CNOT mode selection significantly impact both accuracy and efficiency, with angle-y embedding and sparse entangling patterns outperforming other configurations on image datasets, and amplitude embedding excelling on a
What carries the argument
The SuperCircuit with shared parameters, from which candidate architectures are sampled and scored by NSGA-II across accuracy, runtime proxy, and subcircuit count to account for cutting overhead.
If this is right
- On MNIST the best found architecture reaches 97.16 percent test accuracy with an 8-qubit 2-layer circuit.
- On Fashion-MNIST the best architecture reaches 87.38 percent accuracy with a 5-qubit 2-layer circuit.
- On Iris the best architecture reaches 100 percent validation accuracy with a 4-qubit 2-layer circuit.
- Angle-y embedding combined with sparse CNOT patterns improves both accuracy and efficiency on image datasets.
- Amplitude embedding outperforms other choices on tabular data such as Iris.
Where Pith is reading between the lines
- The design rules surfaced by the search, such as preferred embeddings, could be used directly by practitioners to construct circuits manually without rerunning the full optimizer.
- The same SuperCircuit-plus-NSGA-II approach could be applied to other quantum machine learning tasks like regression or generative modeling.
- Replacing the simple runtime proxy with a more detailed hardware noise model would allow the search to produce even more realistic architectures for specific devices.
- Users facing different qubit budgets can select different points along the Pareto front rather than a single fixed architecture.
Load-bearing premise
That performance estimates from only a few epochs of training on candidate architectures are reliable enough to rank them correctly, and that the runtime proxy and subcircuit count accurately predict costs on real NISQ devices.
What would settle it
Full training of a top architecture discovered by QNAS on a quantum simulator or hardware, followed by measurement of its final test accuracy and wall-clock runtime, showing large deviations from the few-epoch estimates and proxies used during search.
Figures
read the original abstract
Designing quantum neural networks (QNNs) that are both accurate and deployable on NISQ hardware is challenging. Handcrafted ansatze must balance expressivity, trainability, and resource use, while limited qubits often necessitate circuit cutting. Existing quantum architecture search methods primarily optimize accuracy while only heuristically controlling quantum and mostly ignore the exponential overhead of circuit cutting. We introduce QNAS, a neural architecture search framework that unifies hardware aware evaluation, multi objective optimization, and cutting overhead awareness for hybrid quantum classical neural networks (HQNNs). QNAS trains a shared parameter SuperCircuit and uses NSGA-II to optimize three objectives jointly: (i) validation error, (ii) a runtime cost proxy measuring wall clock evaluation time, and (iii) the estimated number of subcircuits under a target qubit budget. QNAS evaluates candidate HQNNs under a few epochs of training and discovers clear Pareto fronts that reveal tradeoffs between accuracy, efficiency, and cutting overhead. Across MNIST, Fashion-MNIST, and Iris benchmarks, we observe that embedding type and CNOT mode selection significantly impact both accuracy and efficiency, with angle-y embedding and sparse entangling patterns outperforming other configurations on image datasets, and amplitude embedding excelling on tabular data (Iris). On MNIST, the best architecture achieves 97.16% test accuracy with a compact 8 qubit, 2 layer circuit; on the more challenging Fashion-MNIST, 87.38% with a 5 qubit, 2 layer circuit; and on Iris, 100% validation accuracy with a 4 qubit, 2 layer circuit. QNAS surfaces these design insights automatically during search, guiding practitioners toward architectures that balance accuracy, resource efficiency, and practical deployability on current hardware.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces QNAS, a neural architecture search framework for hybrid quantum-classical neural networks. It trains a shared-parameter SuperCircuit once and applies NSGA-II to jointly optimize three objectives: validation error, a wall-clock runtime cost proxy, and the estimated number of subcircuits required under a target qubit budget to account for circuit cutting. The framework is evaluated on MNIST, Fashion-MNIST, and Iris, reporting test accuracies of 97.16%, 87.38%, and 100% respectively using compact circuits (e.g., 8-qubit 2-layer on MNIST), while automatically surfacing insights on embedding types and entangling patterns.
Significance. If the evaluation protocol holds, QNAS would provide a practical advance in automating the design of deployable QNNs by incorporating hardware constraints and cutting overhead directly into the search, moving beyond accuracy-only optimization. The multi-objective Pareto-front discovery and the reported design insights on angle-y embeddings and sparse CNOT patterns for image data could guide future hardware-efficient quantum ML work.
major comments (3)
- [Experimental Evaluation] Experimental Protocol: The reported benchmark accuracies and design insights lack details on chosen baselines, the number of independent runs or statistical significance tests, and the full hyperparameter protocol for SuperCircuit training and NSGA-II. This makes it difficult to verify whether the claimed performance and insights are robust.
- [Architecture Scoring] Candidate Evaluation: Architectures are scored after only a few epochs of training on the shared SuperCircuit before feeding validation error into NSGA-II. Given known QNN challenges such as barren plateaus, initialization sensitivity, and slow convergence, short runs may produce unstable rankings; without an ablation correlating few-epoch scores to fully converged accuracies, the Pareto fronts and final accuracies rest on an unverified proxy.
- [Objective Functions] Objective Definitions: The runtime cost proxy (wall-clock time) and subcircuit count estimation are load-bearing for the hardware-awareness claim, yet their precise formulations, assumptions about compilation, and validation against real NISQ metrics are not fully specified or ablated in the methods.
minor comments (2)
- [Results] Include explicit visualizations of the Pareto fronts for all three datasets to illustrate the accuracy-efficiency-cutting trade-offs.
- [Method] Formalize the three objective functions with equations and clarify the SuperCircuit parameter-sharing mechanism for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. We address each major comment below and indicate the revisions we will incorporate into the manuscript.
read point-by-point responses
-
Referee: [Experimental Evaluation] Experimental Protocol: The reported benchmark accuracies and design insights lack details on chosen baselines, the number of independent runs or statistical significance tests, and the full hyperparameter protocol for SuperCircuit training and NSGA-II. This makes it difficult to verify whether the claimed performance and insights are robust.
Authors: We agree that these experimental details are necessary to allow independent verification of the reported accuracies and design insights. In the revised manuscript we will add a dedicated experimental protocol subsection that specifies the baselines employed, the number of independent runs, any statistical significance testing performed, and the complete hyperparameter configuration used for SuperCircuit training and NSGA-II. revision: yes
-
Referee: [Architecture Scoring] Candidate Evaluation: Architectures are scored after only a few epochs of training on the shared SuperCircuit before feeding validation error into NSGA-II. Given known QNN challenges such as barren plateaus, initialization sensitivity, and slow convergence, short runs may produce unstable rankings; without an ablation correlating few-epoch scores to fully converged accuracies, the Pareto fronts and final accuracies rest on an unverified proxy.
Authors: We acknowledge the risk that short training runs could yield unstable rankings given known QNN training difficulties. The few-epoch proxy was adopted to keep the overall search computationally feasible. We will add an ablation study in the revision that correlates few-epoch validation errors with fully converged accuracies on a representative subset of architectures, thereby providing empirical support for the proxy used to generate the Pareto fronts. revision: partial
-
Referee: [Objective Functions] Objective Definitions: The runtime cost proxy (wall-clock time) and subcircuit count estimation are load-bearing for the hardware-awareness claim, yet their precise formulations, assumptions about compilation, and validation against real NISQ metrics are not fully specified or ablated in the methods.
Authors: We will expand the methods section to supply the exact mathematical formulations of both the wall-clock runtime proxy and the subcircuit-count estimator, together with the compilation and qubit-mapping assumptions underlying each. We will also include a brief discussion of how these proxies relate to simulated NISQ metrics and note the practical limitations on real-device validation. revision: yes
Circularity Check
No significant circularity detected in derivation chain
full rationale
The QNAS framework trains a shared-parameter SuperCircuit and applies standard NSGA-II to jointly optimize three externally defined objectives: validation error (measured via short training runs on candidate architectures), a wall-clock runtime proxy, and estimated subcircuit count under qubit budget. These objectives are independent metrics drawn from standard ML and quantum hardware considerations, not derived from or equivalent to the search outputs by construction. No self-definitional steps, fitted parameters renamed as predictions, or load-bearing self-citations appear in the described chain; the discovered Pareto fronts and benchmark accuracies (e.g., 97.16% on MNIST) result from the optimization process rather than tautological reduction to inputs. The method is self-contained against external benchmarks like NSGA-II and standard QNN training.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
QNAS trains a shared parameter SuperCircuit and uses NSGA-II to optimize three objectives jointly: validation error, a runtime cost proxy, and the estimated number of subcircuits under a target qubit budget
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanembed_strictMono_of_one_lt unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
few epoch training reliably ranks candidate architectures with >0.9 correlation to final performance
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
Q-PhotoNAS: Hybrid Quantum Neural Architecture Search Framework on Photonic Devices
Q-PhotoNAS applies genetic algorithm search to jointly optimize classical preprocessing, phase encoding, and photonic circuit structure for hybrid quantum-classical models, reporting 99.44% and 98.78% accuracy on Digi...
Reference graph
Works this paper leans on
- [1]
-
[2]
Variational quantum algorithms,
M. Cerezoet al., “Variational quantum algorithms,”Nat. Rev. Phys., 2021
work page 2021
-
[3]
The power of quantum neural networks,
A. Abbaset al., “The power of quantum neural networks,”Nat. Comput. Sci., 2021
work page 2021
-
[4]
Continuous-variable quantum neural networks,
N. Killoranet al., “Continuous-variable quantum neural networks,”Phys. Rev. Res., 2019
work page 2019
-
[5]
Quantum graph neural networks,
G. Verdonet al., “Quantum graph neural networks,”arXiv, 2019
work page 2019
-
[6]
A survey on quantum machine learning: Current trends, challenges, opportunities, and the road ahead,
K. Zamanet al., “A survey on quantum machine learning: Current trends, challenges, opportunities, and the road ahead,”arXiv:2310.10315, 2023
-
[7]
Quantum convolutional neural network based on varia- tional quantum circuits,
L.-H. Gonget al., “Quantum convolutional neural network based on varia- tional quantum circuits,”Opt. Commun., 2024
work page 2024
-
[8]
Quantum neural networks: Concepts, applications, and challenges,
Y . Kwaket al., “Quantum neural networks: Concepts, applications, and challenges,” inICUFN, 2021
work page 2021
-
[9]
A comparative analysis of hybrid-quantum classical neural networks,
K. Zamanet al., “A comparative analysis of hybrid-quantum classical neural networks,” inCSCE, 2024
work page 2024
-
[10]
Next-generation quantum neural networks: Enhancing efficiency, security, and privacy,
N. Innanet al., “Next-generation quantum neural networks: Enhancing efficiency, security, and privacy,” inIOLTS, 2025
work page 2025
-
[11]
Computational advantage in hybrid quantum neural networks: Myth or reality?
M. Kashif, A. Marchisio, and M. Shafique, “Computational advantage in hybrid quantum neural networks: Myth or reality?” inDAC, 2025
work page 2025
-
[12]
Barren plateaus in quantum neural network training landscapes,
J. R. McCleanet al., “Barren plateaus in quantum neural network training landscapes,”Nat. Commun., 2018
work page 2018
-
[13]
Barren plateaus in variational quantum computing,
M. Laroccaet al., “Barren plateaus in variational quantum computing,”Nat. Rev. Phys., 2025
work page 2025
-
[14]
Quantum architecture search: A survey,
D. Martyniuket al., “Quantum architecture search: A survey,”arXiv, 2024
work page 2024
-
[15]
Noise-induced barren plateaus in variational quantum algorithms,
S. Wanget al., “Noise-induced barren plateaus in variational quantum algorithms,”Nat. Commun., 2021
work page 2021
-
[16]
A comparative analysis and noise robustness evaluation in quantum neural networks,
T. Ahmedet al., “A comparative analysis and noise robustness evaluation in quantum neural networks,”Scientific Reports, 2025
work page 2025
-
[17]
An initialization strategy for addressing barren plateaus in parametrized quantum circuits,
E. Grantet al., “An initialization strategy for addressing barren plateaus in parametrized quantum circuits,”Quantum, 2019
work page 2019
-
[18]
Neural architecture search: Insights from 1000 papers,
C. Whiteet al., “Neural architecture search: Insights from 1000 papers,” arXiv, 2023
work page 2023
-
[19]
Neural architecture search with reinforcement learning,
B. Zoph and Q. V . Le, “Neural architecture search with reinforcement learning,”arXiv, 2017
work page 2017
-
[20]
Multi-objective hardware aware neural architecture search using hardware cost diversity,
N. Sinhaet al., “Multi-objective hardware aware neural architecture search using hardware cost diversity,” inCVPRW, 2024
work page 2024
-
[21]
Continuous evolution for efficient quantum architecture search,
Q. Maet al., “Continuous evolution for efficient quantum architecture search,”EPJ Quantum Technol., 2024
work page 2024
-
[22]
Noisy hqnns: A comprehensive analysis of noise robust- ness in hybrid quantum neural networks,
T. Ahmedet al., “Noisy hqnns: A comprehensive analysis of noise robust- ness in hybrid quantum neural networks,” inIJCNN, 2025
work page 2025
-
[23]
Designing robust quantum neural networks via optimized circuit metrics,
W. El Maouakiet al., “Designing robust quantum neural networks via optimized circuit metrics,”Advanced Quantum Technologies, 2025
work page 2025
-
[24]
Understanding and estimating the execution time of quantum circuits,
N. Ma and H. Li, “Understanding and estimating the execution time of quantum circuits,”arXiv, 2025
work page 2025
-
[25]
Is circuit depth accurate for comparing quantum circuit runtimes?
M. Trembaet al., “Is circuit depth accurate for comparing quantum circuit runtimes?”arXiv, 2025
work page 2025
-
[26]
Cutting is all you need: Execution of large-scale quantum neural networks on limited-qubit devices,
A. Marchisioet al., “Cutting is all you need: Execution of large-scale quantum neural networks on limited-qubit devices,” inQAI
-
[27]
Studying the impact of quantum-specific hyperparameters on hybrid quantum-classical neural networks,
K. Zamanet al., “Studying the impact of quantum-specific hyperparameters on hybrid quantum-classical neural networks,” inCSCE, 2024
work page 2024
-
[28]
A comprehensive survey of neural architecture search: Challenges and solutions,
P. Renet al., “A comprehensive survey of neural architecture search: Challenges and solutions,”ACM Comput. Surv., 2021
work page 2021
-
[29]
A survey on evolutionary neural architecture search,
Y . Liuet al., “A survey on evolutionary neural architecture search,”IEEE Trans. Neural Netw. Learn. Syst., 2021
work page 2021
-
[30]
Neural architecture search: A survey,
T. Elskenet al., “Neural architecture search: A survey,”JMLR, 2019
work page 2019
-
[31]
Single path one-shot neural architecture search with uniform sampling,
Z. Guoet al., “Single path one-shot neural architecture search with uniform sampling,” inECCV, 2020
work page 2020
-
[32]
HW-NAS-Bench: Hardware-aware neural architecture search benchmark,
C. Liet al., “HW-NAS-Bench: Hardware-aware neural architecture search benchmark,”arXiv, 2025
work page 2025
-
[33]
A. Marchisioet al., “Nascaps: A framework for neural architecture search to optimize the accuracy and hardware efficiency of convolutional capsule networks,” inICCAD, 2020
work page 2020
-
[34]
FAQNAS: FLOPs-aware hybrid quantum neural architec- ture search using genetic algorithm,
M. Kashifet al., “FAQNAS: FLOPs-aware hybrid quantum neural architec- ture search using genetic algorithm,”arXiv, 2025
work page 2025
-
[35]
M. Kashif, A. Marchisio, and M. Shafique, “Closing the loop: Resource- aware hybrid nas guided by analytical and hardware-calibrated quantum cost modeling,” inDAC, 2026
work page 2026
-
[36]
Gat-qnn: Genetic algorithm-based training of hybrid quantum neural networks,
T. Ahmedet al., “Gat-qnn: Genetic algorithm-based training of hybrid quantum neural networks,” inIJCNN, 2026
work page 2026
-
[37]
A fast and elitist multiobjective genetic algorithm: NSGA- II,
K. Debet al., “A fast and elitist multiobjective genetic algorithm: NSGA- II,”IEEE Trans. Evol. Comput., 2002
work page 2002
-
[38]
Neural optimization: Understanding trade-offs with pareto theory,
F. Pallasdieset al., “Neural optimization: Understanding trade-offs with pareto theory,”Curr. Opin. Neurobiol., 2021
work page 2021
-
[39]
Evolutionary computation and convergence to a pareto front,
D. A. V . Veldhuizenet al., “Evolutionary computation and convergence to a pareto front,” inGP, 1998
work page 1998
-
[40]
An improved pareto front modeling algorithm for large- scale many-objective optimization,
A. Panichella, “An improved pareto front modeling algorithm for large- scale many-objective optimization,” inGECCO, 2022
work page 2022
-
[41]
Hierarchy ranking method for multimodal multiobjective optimization with local pareto fronts,
W. Liet al., “Hierarchy ranking method for multimodal multiobjective optimization with local pareto fronts,”IEEE Trans. Evol. Comput., 2022
work page 2022
-
[42]
Practical and efficient quantum circuit synthesis and transpiling with reinforcement learning,
D. Kremeret al., “Practical and efficient quantum circuit synthesis and transpiling with reinforcement learning,”arXiv, 2024
work page 2024
-
[43]
Qasmtrans: A QASM quantum transpiler framework for NISQ devices,
F. Huaet al., “Qasmtrans: A QASM quantum transpiler framework for NISQ devices,” inSC, 2023
work page 2023
-
[44]
The MNIST database of handwritten digit images for machine learning research,
L. Deng, “The MNIST database of handwritten digit images for machine learning research,”IEEE Signal Process. Mag., 2012
work page 2012
-
[45]
Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms,
H. Xiaoet al., “Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms,”arXiv, 2017
work page 2017
-
[46]
R. A. Fisher, “Iris,” UCI Machine Learning Repository, 1936, DOI: https://doi.org/10.24432/C56C76
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.