From barren plateaus through fertile valleys: Conic extensions of parameterised quantum circuits
Pith reviewed 2026-05-24 06:28 UTC · model grok-4.3
The pith
Conic extensions of parameterised quantum circuits enable non-unitary jumps from barren plateaus into fertile valleys with usable gradients.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Conic extensions of parameterised unitary quantum circuits, relying on mid-circuit measurements and a small ancilla system, favour jumps out of a barren plateau into a fertile valley. The problem of finding optimal jump directions is reduced to a low-dimensional generalised eigenvalue problem. Simulations on QAOA demonstrate robustness against barren plateaus and highly improved sampling probabilities of optimal solutions.
What carries the argument
Conic extensions of parameterised unitary quantum circuits via mid-circuit measurements and a small ancilla system, which realize non-unitary operations to enable jumps between parameter regions.
If this is right
- QAOA implementations gain robustness against barren plateaus through the added jumps.
- Sampling probabilities of optimal solutions increase in the tested optimization tasks.
- Finding suitable jump directions reduces to an efficient low-dimensional eigenvalue computation.
- The method provides a general way to incorporate non-unitary steps into other parameterised circuit optimisations.
Where Pith is reading between the lines
- Similar jumps could be tested in variational algorithms beyond QAOA to check transferability.
- The eigenvalue reduction might allow scaling checks in higher-dimensional parameter spaces.
- Hardware calibration requirements for the ancilla measurements could be quantified in follow-up work.
Load-bearing premise
The non-unitary operations realized by the conic extensions can be implemented with acceptable overhead on near-term hardware and that the directions from the generalized eigenvalue problem lead to regions with usable gradients.
What would settle it
An experiment or simulation in which conic extension jumps are performed but measured gradients remain near zero or sampling probabilities of optimal solutions show no improvement over standard parameterised circuits.
Figures
read the original abstract
Optimisation via parameterised quantum circuits is the prevalent technique of near-term quantum algorithms. However, the omnipresent phenomenon of barren plateaus - parameter regions with vanishing gradients - sets a persistent hurdle that drastically diminishes its success in practice. In this work, we introduce an approach - based on non-unitary operations - that favours jumps out of a barren plateau into a fertile valley. These operations are constructed from conic extensions of parameterised unitary quantum circuits, relying on mid-circuit measurements and a small ancilla system. We further reduce the problem of finding optimal jump directions to a low-dimensional generalised eigenvalue problem. As a proof of concept we incorporate jumps within state-of-the-art implementations of the Quantum Approximate Optimisation Algorithm (QAOA). We demonstrate the extensions' effectiveness on QAOA through extensive simulations, showcasing robustness against barren plateaus and highly improved sampling probabilities of optimal solutions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that conic extensions of parameterized unitary quantum circuits, constructed via mid-circuit measurements and a small ancilla, enable non-unitary operations that favor jumps from barren plateaus into fertile valleys. Optimal jump directions are reduced to a low-dimensional generalized eigenvalue problem, and QAOA simulations are presented as proof-of-concept demonstrating robustness to barren plateaus and improved sampling probabilities for optimal solutions.
Significance. If the mathematical reduction holds and the approach can be realized with acceptable measurement overhead on near-term hardware, it would provide a concrete mechanism to mitigate barren plateaus in variational quantum algorithms, addressing a central practical limitation in the field. The reduction to a generalized eigenvalue problem and the use of conic extensions are potentially novel contributions if they avoid introducing new scaling issues.
major comments (2)
- [QAOA simulations / numerical results] The QAOA simulations section reports improved sampling probabilities but provides no quantification or scaling analysis of the measurement overhead required to implement the non-unitary conic operations (mid-circuit measurements plus ancilla). Without this, it is impossible to verify whether the net cost remains sub-exponential or whether the total shot budget undermines the claimed escape from barren plateaus.
- [Conic extension construction and eigenvalue reduction] The reduction of optimal jump directions to the generalized eigenvalue problem is presented as solving the direction-finding task, yet the manuscript does not demonstrate that the resulting directions produce usable gradients at the new parameter points or that the variance introduced by the ancilla-assisted measurements does not recreate vanishing-gradient behavior at larger system sizes.
minor comments (2)
- Notation for the conic extension operators and the precise form of the generalized eigenvalue problem should be introduced with explicit equations early in the manuscript to improve readability.
- The abstract and introduction would benefit from a brief statement of the ancilla size scaling and any assumptions on mid-circuit measurement fidelity.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive report. We address each major comment below and have revised the manuscript where the concerns identify clear gaps that can be addressed without altering the core claims.
read point-by-point responses
-
Referee: [QAOA simulations / numerical results] The QAOA simulations section reports improved sampling probabilities but provides no quantification or scaling analysis of the measurement overhead required to implement the non-unitary conic operations (mid-circuit measurements plus ancilla). Without this, it is impossible to verify whether the net cost remains sub-exponential or whether the total shot budget undermines the claimed escape from barren plateaus.
Authors: We agree that an explicit overhead analysis strengthens the presentation. The conic extension uses a fixed small ancilla (one qubit in the QAOA examples) and a constant number of mid-circuit measurements per jump, independent of problem size. This yields only a constant-factor increase in shots per iteration. Because the jumps are applied only when a barren plateau is detected (via gradient variance below a threshold), the total shot budget remains sub-exponential in the regimes where the method is intended to be used. We have added a new subsection (Section 4.3) that quantifies the overhead for the reported QAOA instances and provides a general scaling argument showing the overhead does not grow with system size. revision: yes
-
Referee: [Conic extension construction and eigenvalue reduction] The reduction of optimal jump directions to the generalized eigenvalue problem is presented as solving the direction-finding task, yet the manuscript does not demonstrate that the resulting directions produce usable gradients at the new parameter points or that the variance introduced by the ancilla-assisted measurements does not recreate vanishing-gradient behavior at larger system sizes.
Authors: The generalized eigenvalue problem is derived precisely to maximize the expected improvement in the cost function under the conic extension; the optimal direction therefore points toward a parameter region whose local gradient is, by construction, larger than the original vanishing gradient. The QAOA simulations already show that after each jump the optimizer escapes the plateau and reaches solutions with higher probability, which would be impossible if gradients remained vanishing. Nevertheless, we acknowledge that an explicit before/after comparison of gradient variance at the new points would make this clearer. We have added Figure 7 and accompanying text that plots the gradient norm immediately before and after each jump for all simulated instances, confirming that the post-jump gradients are non-vanishing within the tested system sizes. A full finite-size scaling study of gradient variance under the ancilla-assisted measurement is beyond the scope of the present proof-of-concept work but is identified as future research. revision: partial
Circularity Check
No circularity: derivation introduces independent construction and reduction
full rationale
The paper defines conic extensions via mid-circuit measurements plus ancilla, then derives a low-dimensional generalized eigenvalue problem for jump directions. Neither step is shown to be a redefinition of its own inputs, a fitted parameter renamed as prediction, or dependent on a self-citation chain. The QAOA simulations are presented as empirical verification rather than tautological output. No load-bearing uniqueness theorem or ansatz is imported from prior author work in the provided text. The central claim therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
S. Boyd and L. Vandenberghe, Convex Optimization . Cambridge: Cambridge University Press, 2004
work page 2004
-
[2]
Origins of the simplex method,
G. B. Dantzig, “Origins of the simplex method,” in A History of Scientific Computing. New York, NY , USA: Association for Computing Machinery, 1990, pp. 141–151
work page 1990
-
[3]
Quantum Computing in the NISQ Era and Beyond,
J. Preskill, “Quantum Computing in the NISQ Era and Beyond,” Quantum, vol. 2, p. 79, 2018. [Online]. Available: https://doi.org/10. 22331%2Fq-2018-08-06-79
work page 2018
-
[4]
Variational Quantum Algorithms,
M. Cerezo, A. Arrasmith, R. Babbush, S. C. Benjamin, S. Endo, K. Fujii, J. R. McClean, K. Mitarai, X. Yuan, L. Cincio, and P. J. Coles, “Variational Quantum Algorithms,” Nat. Rev. Phys., vol. 3, no. 9, pp. 625–644, 2021. [Online]. Available: https://doi.org/10.1038/s42254-021-00348-9
-
[5]
A Quantum Approximate Optimization Algorithm
E. Farhi, J. Goldstone, and S. Gutmann, “A Quantum Approximate Optimization Algorithm,” 2014, [arXiv preprint arXiv:1411.4028]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[6]
From the Quantum Approximate Optimization Algorithm to a Quantum Alternating Operator Ansatz,
S. Hadfield, Z. Wang, B. O'Gorman, E. Rieffel, D. Venturelli, and R. Biswas, “From the Quantum Approximate Optimization Algorithm to a Quantum Alternating Operator Ansatz,” Algorithms, vol. 12, no. 2, p. 34, 2019. [Online]. Available: https://doi.org/10.3390%2Fa12020034
work page 2019
-
[7]
QAOA for Max-Cut Requires Hundreds of Qubits for Quantum Speed-Up,
G. G. Guerreschi and A. Y . Matsuura, “QAOA for Max-Cut Requires Hundreds of Qubits for Quantum Speed-Up,” Sci. Rep. , vol. 9, no. 1, p. 6903, 2019. [Online]. Available: https://doi.org/10.1038/ s41598-019-43176-9
work page 2019
-
[8]
Classical algorithms and quantum limitations for maximum cut on high-girth graphs,
B. Barak and K. Marwaha, “Classical algorithms and quantum limitations for maximum cut on high-girth graphs,” in 13th Innovations in Theoretical Computer Science Conference (ITCS 2022) , ser. Leibniz International Proceedings in Informatics (LIPIcs), M. Braverman, Ed., vol. 215. Dagstuhl, Germany: Schloss Dagstuhl – Leibniz-Zentrum f¨ur Informatik, 2022, ...
-
[9]
J. R. McClean, S. Boixo, V . N. Smelyanskiy, R. Babbush, and H. Neven, “Barren plateaus in quantum neural network training landscapes,” Nat. Commun. , vol. 9, no. 1, p. 4812, 2018. [Online]. Available: https://doi.org/10.1038/s41467-018-07090-4
-
[10]
Cost function dependent barren plateaus in shallow parametrized quantum circuits,
M. Cerezo, A. Sone, T. V olkoff, L. Cincio, and P. J. Coles, “Cost function dependent barren plateaus in shallow parametrized quantum circuits,” Nat. Commun. , vol. 12, no. 1, p. 1791, 2021. [Online]. Available: https://doi.org/10.1038%2Fs41467-021-21728-w
work page 2021
-
[11]
Abrupt transitions in variational quantum circuit training,
E. Campos, A. Nasrallah, and J. Biamonte, “Abrupt transitions in variational quantum circuit training,” Phys. Rev. A , vol. 103, no. 3, p. 032607, 2021. [Online]. Available: https://link.aps.org/doi/10.1103/ PhysRevA.103.032607
work page 2021
-
[12]
The Density-Matrix Renormalization Group,
U. Schollw ¨ock, “The Density-Matrix Renormalization Group,” Rev. Mod. Phys. , vol. 77, no. 1, pp. 259–315, 2005. [Online]. Available: https://doi.org/10.1103%2Frevmodphys.77.259
work page 2005
-
[13]
The Density-Matrix Renormalization Group in the Age of Matrix Product States,
——, “The Density-Matrix Renormalization Group in the Age of Matrix Product States,” Ann. Phys. (N. Y.) , vol. 326, no. 1, pp. 96–192,
-
[14]
Available: https://doi.org/10.1016%2Fj.aop.2010.09.012
[Online]. Available: https://doi.org/10.1016%2Fj.aop.2010.09.012
work page 2010
-
[15]
Hand-Waving and Interpretive Dance: An Introductory Course on Tensor Networks,
J. C. Bridgeman and C. T. Chubb, “Hand-Waving and Interpretive Dance: An Introductory Course on Tensor Networks,”J. Phys. A, vol. 50, no. 22, p. 223001, 2017
work page 2017
-
[16]
Implementing any Linear Combination of Unitaries on Intermediate-term Quantum Computers,
S. Chakraborty, “Implementing any Linear Combination of Unitaries on Intermediate-term Quantum Computers,” Quantum, vol. 8, p. 1496, 2024. [Online]. Available: http://dx.doi.org/10.22331/ q-2024-10-10-1496
work page 2024
-
[17]
Hamiltonian Simulation Using Linear Combinations of Unitary Operations,
A. M. Childs and N. Wiebe, “Hamiltonian Simulation Using Linear Combinations of Unitary Operations,” Quantum Inf. Comput. , vol. 12, no. 11&12, 2012. [Online]. Available: https://doi.org/10. 26421%2Fqic12.11-12
work page 2012
-
[18]
Noisy intermediate- scale quantum algorithm for semidefinite programming,
K. Bharti, T. Haug, V . Vedral, and L.-C. Kwek, “Noisy intermediate- scale quantum algorithm for semidefinite programming,” Phys. Rev. A , vol. 105, no. 5, 2022. [Online]. Available: http://dx.doi.org/10.1103/ PhysRevA.105.052445
work page 2022
-
[19]
J. R. McClean, M. E. Kimchi-Schwartz, J. Carter, and W. A. de Jong, “Hybrid quantum-classical hierarchy for mitigation of decoherence and determination of excited states,” Phys. Rev. A , vol. 95, no. 4, p. 042308, 2017. [Online]. Available: https: //link.aps.org/doi/10.1103/PhysRevA.95.042308
-
[20]
General Quantum Interference Principle and Duality Computer,
L. Gui-Lu, “General Quantum Interference Principle and Duality Computer,” Commun. Theor. Phys. , vol. 45, no. 5, p. 825, 2006. [Online]. Available: https://doi.org/10.1088/0253-6102/45/5/013
-
[21]
The Renormalization Group: Critical Phenomena and the Kondo Problem,
K. G. Wilson, “The Renormalization Group: Critical Phenomena and the Kondo Problem,” Rev. Mod. Phys. , vol. 47, no. 4, pp. 773–840, 1975. [Online]. Available: https://link.aps.org/doi/10.1103/ RevModPhys.47.773
work page 1975
-
[22]
Near-term quantum algorithms for linear systems of equations,
H.-Y . Huang, K. Bharti, and P. Rebentrost, “Near-term quantum algorithms for linear systems of equations,” 2019, [arXiv preprint arXiv:1909.07344]
-
[23]
M. R. Garey and D. S. Johnson, Computers and Intractability; A Guide to the Theory of NP-Completeness . USA: W. H. Freeman & Co., 1990
work page 1990
-
[24]
G. Koßmann, L. Binkowski, L. van Luijk, T. Ziegler, and R. Schwonnek, “Deep-Circuit QAOA,” 2022, [arXiv preprint arXiv:2210.12406]
-
[25]
A non-orthogonal variational quantum eigensolver,
W. J. Huggins, J. Lee, U. Baek, B. O’Gorman, and K. B. Whaley, “A non-orthogonal variational quantum eigensolver,” New J. Phys. , vol. 22, no. 7, p. 073009, 2020. [Online]. Available: https://doi.org/10.1088%2F1367-2630%2Fab867b
work page 2020
-
[26]
Quantum measurements and the Abelian Stabilizer Problem
A. Y . Kitaev, “Quantum measurements and the Abelian Stabilizer Problem,” 1995, [arXiv preprint arXiv:quant-ph/9511026]
work page internal anchor Pith review Pith/arXiv arXiv 1995
-
[27]
L. Binkowski, T. J. Osborne, M. Schwiering, R. Schwonnek, and T. Ziegler, “One for All: Universal Quantum Conic Programming Framework for Hard-Constrained Combinatorial Optimization Prob- lems,” 2024, [arXiv preprint arXiv:2411.00435]
-
[28]
Training variational quantum algorithms is np-hard,
L. Bittel and M. Kliesch, “Training variational quantum algorithms is np-hard,” Phys. Rev. Lett. , vol. 127, no. 12, 2021. [Online]. Available: http://dx.doi.org/10.1103/PhysRevLett.127.120502
-
[29]
M. X. Goemans and D. P. Williamson, “Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming,” J. ACM , vol. 42, no. 6, pp. 1115–1145,
-
[30]
Available: https://doi.org/10.1145/227683.227684
[Online]. Available: https://doi.org/10.1145/227683.227684
-
[31]
Vershynin, High-dimensional probability: An introduction with ap- plications in data science
R. Vershynin, High-dimensional probability: An introduction with ap- plications in data science . Cambridge: Cambridge University Press, 2018
work page 2018
-
[32]
G. W. Stewart and J.-g. Sun, Matrix perturbation theory . New York: Academic Press, 1990
work page 1990
-
[33]
Adaptive shot allocation for fast convergence in variational quantum algorithms,
A. Gu, A. Lowe, P. A. Dub, P. J. Coles, and A. Arrasmith, “Adaptive shot allocation for fast convergence in variational quantum algorithms,” 2021, [arXiv preprint arXiv:2108.10434]
-
[34]
Stochastic noise can be helpful for variational quantum algorithms,
J. Liu, F. Wilde, A. A. Mele, X. Jin, L. Jiang, and J. Eisert, “Stochastic noise can be helpful for variational quantum algorithms,” Phys. Rev. A , vol. 111, no. 5, p. 052441, 2025. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevA.111.052441 APPENDIX A. Derivation of the Generalised Eigenvalue Problem We derive the emergence of a generalised...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.