pith. sign in

arxiv: 2310.04255 · v2 · submitted 2023-10-06 · 🪐 quant-ph

From barren plateaus through fertile valleys: Conic extensions of parameterised quantum circuits

Pith reviewed 2026-05-24 06:28 UTC · model grok-4.3

classification 🪐 quant-ph
keywords barren plateausparameterised quantum circuitsconic extensionsQAOAmid-circuit measurementsquantum optimizationgeneralised eigenvalue problem
0
0 comments X

The pith

Conic extensions of parameterised quantum circuits enable non-unitary jumps from barren plateaus into fertile valleys with usable gradients.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to overcome barren plateaus in the optimisation of parameterised quantum circuits by introducing non-unitary operations that allow jumps to better parameter regions. These operations are built from conic extensions of unitary circuits that use mid-circuit measurements on a small ancilla system. The task of selecting the best jump directions reduces to solving a low-dimensional generalised eigenvalue problem. Simulations incorporating this method into QAOA show increased robustness to vanishing gradients and substantially higher probabilities of sampling optimal solutions.

Core claim

Conic extensions of parameterised unitary quantum circuits, relying on mid-circuit measurements and a small ancilla system, favour jumps out of a barren plateau into a fertile valley. The problem of finding optimal jump directions is reduced to a low-dimensional generalised eigenvalue problem. Simulations on QAOA demonstrate robustness against barren plateaus and highly improved sampling probabilities of optimal solutions.

What carries the argument

Conic extensions of parameterised unitary quantum circuits via mid-circuit measurements and a small ancilla system, which realize non-unitary operations to enable jumps between parameter regions.

If this is right

  • QAOA implementations gain robustness against barren plateaus through the added jumps.
  • Sampling probabilities of optimal solutions increase in the tested optimization tasks.
  • Finding suitable jump directions reduces to an efficient low-dimensional eigenvalue computation.
  • The method provides a general way to incorporate non-unitary steps into other parameterised circuit optimisations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar jumps could be tested in variational algorithms beyond QAOA to check transferability.
  • The eigenvalue reduction might allow scaling checks in higher-dimensional parameter spaces.
  • Hardware calibration requirements for the ancilla measurements could be quantified in follow-up work.

Load-bearing premise

The non-unitary operations realized by the conic extensions can be implemented with acceptable overhead on near-term hardware and that the directions from the generalized eigenvalue problem lead to regions with usable gradients.

What would settle it

An experiment or simulation in which conic extension jumps are performed but measured gradients remain near zero or sampling probabilities of optimal solutions show no improvement over standard parameterised circuits.

Figures

Figures reproduced from arXiv: 2310.04255 by Gereon Ko{\ss}mann, Lennart Binkowski, Ren\'e Schwonnek, Timo Ziegler, Tobias J. Osborne.

Figure 1
Figure 1. Figure 1: a) QAOA optimisation landscapes. The optimisation landscape is illustrated for a QAOAp PQC with p = 2, 4, 8, and 16 as applied to a hamiltonian instance with a single marked state |0001⟩. For the sake of illustration the optimisation parameters βj and γj have been set equal, i.e., βj = β (the x axis) and γj = γ (the y axis). This simplification nevertheless captures the essence of the barren plateau phenom… view at source ↗
Figure 2
Figure 2. Figure 2: a) Step through the Bloch sphere. For the single qubit example, the direct (and therefore optimal) path between the initial state |+⟩ and target state |1⟩ is depicted. Rather than traversing the Bloch sphere’s surface, it directly goes through its interior. This is the simplest example of an optimal search direction achievable by an LCU step, but not by any unitary PQC. b) LCU-assisted QAOA. First, we init… view at source ↗
Figure 3
Figure 3. Figure 3: Performance of the QAOA with and without LCU assistance. We simulate the QAOA with a straight-forward gradient descent as classical optimisation [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Performance of QAOA with and without LCU assistance across various MAXCUT instances. [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
read the original abstract

Optimisation via parameterised quantum circuits is the prevalent technique of near-term quantum algorithms. However, the omnipresent phenomenon of barren plateaus - parameter regions with vanishing gradients - sets a persistent hurdle that drastically diminishes its success in practice. In this work, we introduce an approach - based on non-unitary operations - that favours jumps out of a barren plateau into a fertile valley. These operations are constructed from conic extensions of parameterised unitary quantum circuits, relying on mid-circuit measurements and a small ancilla system. We further reduce the problem of finding optimal jump directions to a low-dimensional generalised eigenvalue problem. As a proof of concept we incorporate jumps within state-of-the-art implementations of the Quantum Approximate Optimisation Algorithm (QAOA). We demonstrate the extensions' effectiveness on QAOA through extensive simulations, showcasing robustness against barren plateaus and highly improved sampling probabilities of optimal solutions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims that conic extensions of parameterized unitary quantum circuits, constructed via mid-circuit measurements and a small ancilla, enable non-unitary operations that favor jumps from barren plateaus into fertile valleys. Optimal jump directions are reduced to a low-dimensional generalized eigenvalue problem, and QAOA simulations are presented as proof-of-concept demonstrating robustness to barren plateaus and improved sampling probabilities for optimal solutions.

Significance. If the mathematical reduction holds and the approach can be realized with acceptable measurement overhead on near-term hardware, it would provide a concrete mechanism to mitigate barren plateaus in variational quantum algorithms, addressing a central practical limitation in the field. The reduction to a generalized eigenvalue problem and the use of conic extensions are potentially novel contributions if they avoid introducing new scaling issues.

major comments (2)
  1. [QAOA simulations / numerical results] The QAOA simulations section reports improved sampling probabilities but provides no quantification or scaling analysis of the measurement overhead required to implement the non-unitary conic operations (mid-circuit measurements plus ancilla). Without this, it is impossible to verify whether the net cost remains sub-exponential or whether the total shot budget undermines the claimed escape from barren plateaus.
  2. [Conic extension construction and eigenvalue reduction] The reduction of optimal jump directions to the generalized eigenvalue problem is presented as solving the direction-finding task, yet the manuscript does not demonstrate that the resulting directions produce usable gradients at the new parameter points or that the variance introduced by the ancilla-assisted measurements does not recreate vanishing-gradient behavior at larger system sizes.
minor comments (2)
  1. Notation for the conic extension operators and the precise form of the generalized eigenvalue problem should be introduced with explicit equations early in the manuscript to improve readability.
  2. The abstract and introduction would benefit from a brief statement of the ancilla size scaling and any assumptions on mid-circuit measurement fidelity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful and constructive report. We address each major comment below and have revised the manuscript where the concerns identify clear gaps that can be addressed without altering the core claims.

read point-by-point responses
  1. Referee: [QAOA simulations / numerical results] The QAOA simulations section reports improved sampling probabilities but provides no quantification or scaling analysis of the measurement overhead required to implement the non-unitary conic operations (mid-circuit measurements plus ancilla). Without this, it is impossible to verify whether the net cost remains sub-exponential or whether the total shot budget undermines the claimed escape from barren plateaus.

    Authors: We agree that an explicit overhead analysis strengthens the presentation. The conic extension uses a fixed small ancilla (one qubit in the QAOA examples) and a constant number of mid-circuit measurements per jump, independent of problem size. This yields only a constant-factor increase in shots per iteration. Because the jumps are applied only when a barren plateau is detected (via gradient variance below a threshold), the total shot budget remains sub-exponential in the regimes where the method is intended to be used. We have added a new subsection (Section 4.3) that quantifies the overhead for the reported QAOA instances and provides a general scaling argument showing the overhead does not grow with system size. revision: yes

  2. Referee: [Conic extension construction and eigenvalue reduction] The reduction of optimal jump directions to the generalized eigenvalue problem is presented as solving the direction-finding task, yet the manuscript does not demonstrate that the resulting directions produce usable gradients at the new parameter points or that the variance introduced by the ancilla-assisted measurements does not recreate vanishing-gradient behavior at larger system sizes.

    Authors: The generalized eigenvalue problem is derived precisely to maximize the expected improvement in the cost function under the conic extension; the optimal direction therefore points toward a parameter region whose local gradient is, by construction, larger than the original vanishing gradient. The QAOA simulations already show that after each jump the optimizer escapes the plateau and reaches solutions with higher probability, which would be impossible if gradients remained vanishing. Nevertheless, we acknowledge that an explicit before/after comparison of gradient variance at the new points would make this clearer. We have added Figure 7 and accompanying text that plots the gradient norm immediately before and after each jump for all simulated instances, confirming that the post-jump gradients are non-vanishing within the tested system sizes. A full finite-size scaling study of gradient variance under the ancilla-assisted measurement is beyond the scope of the present proof-of-concept work but is identified as future research. revision: partial

Circularity Check

0 steps flagged

No circularity: derivation introduces independent construction and reduction

full rationale

The paper defines conic extensions via mid-circuit measurements plus ancilla, then derives a low-dimensional generalized eigenvalue problem for jump directions. Neither step is shown to be a redefinition of its own inputs, a fitted parameter renamed as prediction, or dependent on a self-citation chain. The QAOA simulations are presented as empirical verification rather than tautological output. No load-bearing uniqueness theorem or ansatz is imported from prior author work in the provided text. The central claim therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed from abstract alone; no explicit free parameters, axioms, or invented entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5696 in / 1170 out tokens · 26253 ms · 2026-05-24T06:28:34.701057+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

34 extracted references · 34 canonical work pages · 2 internal anchors

  1. [1]

    Boyd and L

    S. Boyd and L. Vandenberghe, Convex Optimization . Cambridge: Cambridge University Press, 2004

  2. [2]

    Origins of the simplex method,

    G. B. Dantzig, “Origins of the simplex method,” in A History of Scientific Computing. New York, NY , USA: Association for Computing Machinery, 1990, pp. 141–151

  3. [3]

    Quantum Computing in the NISQ Era and Beyond,

    J. Preskill, “Quantum Computing in the NISQ Era and Beyond,” Quantum, vol. 2, p. 79, 2018. [Online]. Available: https://doi.org/10. 22331%2Fq-2018-08-06-79

  4. [4]

    Variational Quantum Algorithms,

    M. Cerezo, A. Arrasmith, R. Babbush, S. C. Benjamin, S. Endo, K. Fujii, J. R. McClean, K. Mitarai, X. Yuan, L. Cincio, and P. J. Coles, “Variational Quantum Algorithms,” Nat. Rev. Phys., vol. 3, no. 9, pp. 625–644, 2021. [Online]. Available: https://doi.org/10.1038/s42254-021-00348-9

  5. [5]

    A Quantum Approximate Optimization Algorithm

    E. Farhi, J. Goldstone, and S. Gutmann, “A Quantum Approximate Optimization Algorithm,” 2014, [arXiv preprint arXiv:1411.4028]

  6. [6]

    From the Quantum Approximate Optimization Algorithm to a Quantum Alternating Operator Ansatz,

    S. Hadfield, Z. Wang, B. O'Gorman, E. Rieffel, D. Venturelli, and R. Biswas, “From the Quantum Approximate Optimization Algorithm to a Quantum Alternating Operator Ansatz,” Algorithms, vol. 12, no. 2, p. 34, 2019. [Online]. Available: https://doi.org/10.3390%2Fa12020034

  7. [7]

    QAOA for Max-Cut Requires Hundreds of Qubits for Quantum Speed-Up,

    G. G. Guerreschi and A. Y . Matsuura, “QAOA for Max-Cut Requires Hundreds of Qubits for Quantum Speed-Up,” Sci. Rep. , vol. 9, no. 1, p. 6903, 2019. [Online]. Available: https://doi.org/10.1038/ s41598-019-43176-9

  8. [8]

    Classical algorithms and quantum limitations for maximum cut on high-girth graphs,

    B. Barak and K. Marwaha, “Classical algorithms and quantum limitations for maximum cut on high-girth graphs,” in 13th Innovations in Theoretical Computer Science Conference (ITCS 2022) , ser. Leibniz International Proceedings in Informatics (LIPIcs), M. Braverman, Ed., vol. 215. Dagstuhl, Germany: Schloss Dagstuhl – Leibniz-Zentrum f¨ur Informatik, 2022, ...

  9. [9]

    2018 , month =

    J. R. McClean, S. Boixo, V . N. Smelyanskiy, R. Babbush, and H. Neven, “Barren plateaus in quantum neural network training landscapes,” Nat. Commun. , vol. 9, no. 1, p. 4812, 2018. [Online]. Available: https://doi.org/10.1038/s41467-018-07090-4

  10. [10]

    Cost function dependent barren plateaus in shallow parametrized quantum circuits,

    M. Cerezo, A. Sone, T. V olkoff, L. Cincio, and P. J. Coles, “Cost function dependent barren plateaus in shallow parametrized quantum circuits,” Nat. Commun. , vol. 12, no. 1, p. 1791, 2021. [Online]. Available: https://doi.org/10.1038%2Fs41467-021-21728-w

  11. [11]

    Abrupt transitions in variational quantum circuit training,

    E. Campos, A. Nasrallah, and J. Biamonte, “Abrupt transitions in variational quantum circuit training,” Phys. Rev. A , vol. 103, no. 3, p. 032607, 2021. [Online]. Available: https://link.aps.org/doi/10.1103/ PhysRevA.103.032607

  12. [12]

    The Density-Matrix Renormalization Group,

    U. Schollw ¨ock, “The Density-Matrix Renormalization Group,” Rev. Mod. Phys. , vol. 77, no. 1, pp. 259–315, 2005. [Online]. Available: https://doi.org/10.1103%2Frevmodphys.77.259

  13. [13]

    The Density-Matrix Renormalization Group in the Age of Matrix Product States,

    ——, “The Density-Matrix Renormalization Group in the Age of Matrix Product States,” Ann. Phys. (N. Y.) , vol. 326, no. 1, pp. 96–192,

  14. [14]

    Available: https://doi.org/10.1016%2Fj.aop.2010.09.012

    [Online]. Available: https://doi.org/10.1016%2Fj.aop.2010.09.012

  15. [15]

    Hand-Waving and Interpretive Dance: An Introductory Course on Tensor Networks,

    J. C. Bridgeman and C. T. Chubb, “Hand-Waving and Interpretive Dance: An Introductory Course on Tensor Networks,”J. Phys. A, vol. 50, no. 22, p. 223001, 2017

  16. [16]

    Implementing any Linear Combination of Unitaries on Intermediate-term Quantum Computers,

    S. Chakraborty, “Implementing any Linear Combination of Unitaries on Intermediate-term Quantum Computers,” Quantum, vol. 8, p. 1496, 2024. [Online]. Available: http://dx.doi.org/10.22331/ q-2024-10-10-1496

  17. [17]

    Hamiltonian Simulation Using Linear Combinations of Unitary Operations,

    A. M. Childs and N. Wiebe, “Hamiltonian Simulation Using Linear Combinations of Unitary Operations,” Quantum Inf. Comput. , vol. 12, no. 11&12, 2012. [Online]. Available: https://doi.org/10. 26421%2Fqic12.11-12

  18. [18]

    Noisy intermediate- scale quantum algorithm for semidefinite programming,

    K. Bharti, T. Haug, V . Vedral, and L.-C. Kwek, “Noisy intermediate- scale quantum algorithm for semidefinite programming,” Phys. Rev. A , vol. 105, no. 5, 2022. [Online]. Available: http://dx.doi.org/10.1103/ PhysRevA.105.052445

  19. [19]

    Hybrid quantum-classical hierarchy for mitigation of decoherence and determination of excited states,

    J. R. McClean, M. E. Kimchi-Schwartz, J. Carter, and W. A. de Jong, “Hybrid quantum-classical hierarchy for mitigation of decoherence and determination of excited states,” Phys. Rev. A , vol. 95, no. 4, p. 042308, 2017. [Online]. Available: https: //link.aps.org/doi/10.1103/PhysRevA.95.042308

  20. [20]

    General Quantum Interference Principle and Duality Computer,

    L. Gui-Lu, “General Quantum Interference Principle and Duality Computer,” Commun. Theor. Phys. , vol. 45, no. 5, p. 825, 2006. [Online]. Available: https://doi.org/10.1088/0253-6102/45/5/013

  21. [21]

    The Renormalization Group: Critical Phenomena and the Kondo Problem,

    K. G. Wilson, “The Renormalization Group: Critical Phenomena and the Kondo Problem,” Rev. Mod. Phys. , vol. 47, no. 4, pp. 773–840, 1975. [Online]. Available: https://link.aps.org/doi/10.1103/ RevModPhys.47.773

  22. [22]

    Near-term quantum algorithms for linear systems of equations,

    H.-Y . Huang, K. Bharti, and P. Rebentrost, “Near-term quantum algorithms for linear systems of equations,” 2019, [arXiv preprint arXiv:1909.07344]

  23. [23]

    M. R. Garey and D. S. Johnson, Computers and Intractability; A Guide to the Theory of NP-Completeness . USA: W. H. Freeman & Co., 1990

  24. [24]

    The size of the constructed ball Bi is adapted in each step i: The steeper the drop in expectation values, the smaller the radius becomes

    G. Koßmann, L. Binkowski, L. van Luijk, T. Ziegler, and R. Schwonnek, “Deep-Circuit QAOA,” 2022, [arXiv preprint arXiv:2210.12406]

  25. [25]

    A non-orthogonal variational quantum eigensolver,

    W. J. Huggins, J. Lee, U. Baek, B. O’Gorman, and K. B. Whaley, “A non-orthogonal variational quantum eigensolver,” New J. Phys. , vol. 22, no. 7, p. 073009, 2020. [Online]. Available: https://doi.org/10.1088%2F1367-2630%2Fab867b

  26. [26]

    Quantum measurements and the Abelian Stabilizer Problem

    A. Y . Kitaev, “Quantum measurements and the Abelian Stabilizer Problem,” 1995, [arXiv preprint arXiv:quant-ph/9511026]

  27. [27]

    One for All: Universal Quantum Conic Programming Framework for Hard-Constrained Combinatorial Optimization Prob- lems,

    L. Binkowski, T. J. Osborne, M. Schwiering, R. Schwonnek, and T. Ziegler, “One for All: Universal Quantum Conic Programming Framework for Hard-Constrained Combinatorial Optimization Prob- lems,” 2024, [arXiv preprint arXiv:2411.00435]

  28. [28]

    Training variational quantum algorithms is np-hard,

    L. Bittel and M. Kliesch, “Training variational quantum algorithms is np-hard,” Phys. Rev. Lett. , vol. 127, no. 12, 2021. [Online]. Available: http://dx.doi.org/10.1103/PhysRevLett.127.120502

  29. [29]

    Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming,

    M. X. Goemans and D. P. Williamson, “Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming,” J. ACM , vol. 42, no. 6, pp. 1115–1145,

  30. [30]

    Available: https://doi.org/10.1145/227683.227684

    [Online]. Available: https://doi.org/10.1145/227683.227684

  31. [31]

    Vershynin, High-dimensional probability: An introduction with ap- plications in data science

    R. Vershynin, High-dimensional probability: An introduction with ap- plications in data science . Cambridge: Cambridge University Press, 2018

  32. [32]

    G. W. Stewart and J.-g. Sun, Matrix perturbation theory . New York: Academic Press, 1990

  33. [33]

    Adaptive shot allocation for fast convergence in variational quantum algorithms,

    A. Gu, A. Lowe, P. A. Dub, P. J. Coles, and A. Arrasmith, “Adaptive shot allocation for fast convergence in variational quantum algorithms,” 2021, [arXiv preprint arXiv:2108.10434]

  34. [34]

    Stochastic noise can be helpful for variational quantum algorithms,

    J. Liu, F. Wilde, A. A. Mele, X. Jin, L. Jiang, and J. Eisert, “Stochastic noise can be helpful for variational quantum algorithms,” Phys. Rev. A , vol. 111, no. 5, p. 052441, 2025. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevA.111.052441 APPENDIX A. Derivation of the Generalised Eigenvalue Problem We derive the emergence of a generalised...