A four-player potential game for barren-plateau-aware quantum ansatz design

Rub\'en Dar\'io Guerrero

arxiv: 2604.21955 · v1 · submitted 2026-04-23 · 🪐 quant-ph · cs.MA

A four-player potential game for barren-plateau-aware quantum ansatz design

Rub\'en Dar\'io Guerrero This is my paper

Pith reviewed 2026-05-09 21:42 UTC · model grok-4.3

classification 🪐 quant-ph cs.MA

keywords quantum ansatz designpotential gameNash equilibriumbarren plateauvariational quantum eigensolvercircuit optimizationnon-stabilizernesshardware cost

0 comments

The pith

Quantum ansatz design cast as a four-player potential game improves balance across trainability, non-stabilizerness, performance, and cost.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper frames the creation of parameterized quantum circuits as a four-player potential game on a circuit directed acyclic graph. The players each pursue one objective—trainability to sidestep barren plateaus, non-stabilizerness to generate useful entanglement, task performance such as solving MaxCut or finding molecular energies, and hardware cost measured by gate count and depth. Moves consist of appending, removing, retyping, or rewiring gates, and the search continues until the block-coordinate ε-Nash residual indicates that no player can unilaterally improve its score. If successful, the resulting circuits reach higher overall potential on small hardware layouts and deliver compact representations for chemistry problems while keeping accuracy high.

Core claim

The authors show that treating circuit design as a four-player potential game with objectives for trainability, non-stabilizerness, task performance, and hardware cost, and searching via block-coordinate updates until an ε-Nash equilibrium is reached, produces ansatzes with superior aggregate potential on three four-qubit topologies and a reduced-depth circuit for LiH that retains 97.7% of the correlation energy from a larger starting point.

What carries the argument

A four-player potential game on the circuit DAG, with restricted action sets for append/remove/retype/rewire and a block-coordinate ε-Nash residual that certifies no unilateral improvement is possible.

Load-bearing premise

The four player objectives and the block-coordinate ε-Nash residual on the circuit DAG accurately reflect the key trade-offs without leaving out critical interactions that would alter the reported frontiers and hardware performance.

What would settle it

A demonstration that simulated annealing or another baseline consistently matches or exceeds the Nash search's mean potential on the same four-qubit topologies, or that the LiH circuit loses more than 2.3% correlation energy, would falsify the reported advantage.

Figures

Figures reproduced from arXiv: 2604.21955 by Rub\'en Dar\'io Guerrero.

**Figure 1.** Figure 1: Four-player potential game on circuit DAGs. The shared state is a PQC DAG whose nodes carry gate types and parameters. Four players own disjoint action sets: f1 (trainability) retypes a gate to enlarge the dynamical Lie algebra, f2 (non-stabilizerness) retypes into non-Clifford primitives, f3 (task) rewires a two-qubit gate, f4 (hardware) removes a gate. For example, starting from a two-gate H2 circuit on … view at source ↗

**Figure 2.** Figure 2: Pareto frontier for MaxCut on K4. Each point is a Nash-equilibrium circuit at a distinct weight corner (w1, w2); sixteen corners are shown with w3 fixed. The frontier spans from a Clifford endpoint (nonstabilizerness M2/n = 0, ⟨H⟩ = 4.00) to a non-Clifford endpoint (0.48, 3.30). An interior Pareto-knee solution at (0.076, 3.939) is reached by (w1, w2) = (1, 0.3). Results are shown for n = 4 qubits only [… view at source ↗

**Figure 3.** Figure 3: Head-to-head comparison at matched budget across three hardware topologies. Bars show five-seed means of the scalar potential Φ; error bars are 95% bootstrap confidence intervals on the mean. Nash achieves the highest mean on every topology with ∆Φ = +0.09 (heavy-hex), +0.15 (2 × 2 grid), and +0.13 (Rydberg). Per-topology paired Wilcoxon tests do not reject the null on five seeds (see [PITH_FULL_IMAGE:fig… view at source ↗

**Figure 4.** Figure 4: Scaling and chemistry summary. (a) TFIM ground-state relative error vs. qubit count n for warmstarted (QAOA p = 1) and cold-started Nash; five seeds per size; shaded bands are 95% bootstrap confidence intervals on the mean. (b) The same data plotted as a function of Nash outer iteration. (c) Per-iteration wall clock grows approximately linearly at ∼ 4.7 s/qubit. Inset: δNash at final iteration is bimodal … view at source ↗

read the original abstract

We cast the design of parameterized quantum circuits as a four-player potential game whose state is a circuit directed acyclic graph (DAG) and whose players encode trainability, non-stabilizerness, task performance, and hardware cost. Per-player restricted action sets factorize the move space into append, remove, retype, and rewire operations; a block-coordinate $\varepsilon$-Nash residual $\delta_\text{Nash}$ certifies that no single player can improve unilaterally. A single weight sweep on MaxCut $K_4$ traces a Pareto frontier from a Clifford endpoint $(M_2/n,\langle H\rangle)=(0,4.00)$ to a non-Clifford endpoint $(0.48,3.30)$. On three four-qubit hardware topologies (heavy-hex, $2\times 2$ grid, Rydberg all-to-all), Nash search achieves the highest mean potential; on the $2\times 2$ grid Nash reaches the theoretical ceiling $\Phi_\text{max}=4.10$ on two of five seeds while the simulated-annealing baseline does so on one; paired Wilcoxon tests over five seeds cannot reject the null on any single topology ($p\ge 0.22$). On LiH/STO-3G, seeding Nash from a 58-gate Givens-doubles ansatz produces a 48-operation, depth-25 circuit retaining $97.7\%$ of the correlation energy while simultaneously reducing gate count, increasing non-stabilizerness, and controlling trainability. The framework is complementary to energy-only searches such as ADAPT-VQE and k-UpCCGSD, which reach chemical accuracy with fewer operations but do not optimize the other three axes.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The four-player game on circuit DAGs is a fresh multi-objective angle, but the hardware superiority claims rest on five seeds with non-significant tests.

read the letter

The main takeaway is that this paper frames ansatz design as a four-player potential game on a circuit DAG, with players handling trainability, non-stabilizerness, task performance, and hardware cost through restricted moves like append, remove, retype, and rewire. A block-coordinate epsilon-Nash residual stops the search when no player can improve alone. This setup is distinct from single-objective methods like ADAPT-VQE or k-UpCCGSD because it explicitly balances the four axes instead of optimizing energy alone.

Referee Report

3 major / 2 minor

Summary. The manuscript casts parameterized quantum circuit design as a four-player potential game on a circuit DAG, with players encoding trainability, non-stabilizerness, task performance, and hardware cost. Restricted actions (append/remove/retype/rewire) and a block-coordinate ε-Nash residual are used to search for equilibria. A weight sweep produces a Pareto frontier on MaxCut K4; on three 4-qubit topologies Nash search reports the highest mean potential (reaching Φ_max=4.10 on 2/5 seeds for the 2×2 grid); a single LiH/STO-3G run yields a 48-gate circuit retaining 97.7% correlation energy while improving the other axes.

Significance. If the multi-objective framework and its empirical outcomes hold, the work supplies a principled game-theoretic method for balancing barren-plateau avoidance, expressivity, accuracy, and hardware cost in ansatz design. The explicit Pareto trace from Clifford to non-Clifford regimes and the complementarity to energy-only methods such as ADAPT-VQE are potentially valuable for NISQ-era circuit optimization.

major comments (3)

[Abstract / hardware results] Abstract and hardware-topology results: the claim that Nash search achieves the highest mean potential on heavy-hex, 2×2 grid, and Rydberg topologies is based on only five random seeds per topology, with paired Wilcoxon tests yielding p≥0.22 on every topology. This sample size is insufficient to support a superiority conclusion; the observed differences are consistent with stochastic variation in the DAG search process itself.
[LiH/STO-3G experiment] LiH experiment: the 97.7% correlation-energy retention, gate-count reduction, and simultaneous improvement on the other three axes are reported from a single run seeded from a 58-gate Givens-doubles ansatz. No error bars, multiple independent seeds, or statistical comparison to the baseline are provided, so the result cannot be assessed for robustness.
[Method / potential game definition] Potential-function construction: the four player objectives and the overall potential Φ are described at a high level but lack explicit formulas, normalization details, and the precise definition of the block-coordinate ε-Nash residual δ_Nash. Without these, independent verification of the reported Nash residuals and Pareto frontier is not possible.

minor comments (2)

[Hardware results] The theoretical ceiling Φ_max=4.10 is stated for the 2×2 grid but its derivation (maximum attainable value of the four-player potential) is not shown; a short derivation or reference to the relevant equation would improve clarity.
[Abstract] The abstract reports concrete numerical outcomes (means, 97.7% retention, p-values) without accompanying standard deviations or confidence intervals; adding these would strengthen the presentation of all empirical claims.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the careful and constructive review. We address each major comment below and have revised the manuscript to improve statistical qualification, experimental transparency, and methodological detail.

read point-by-point responses

Referee: [Abstract / hardware results] Abstract and hardware-topology results: the claim that Nash search achieves the highest mean potential on heavy-hex, 2×2 grid, and Rydberg topologies is based on only five random seeds per topology, with paired Wilcoxon tests yielding p≥0.22 on every topology. This sample size is insufficient to support a superiority conclusion; the observed differences are consistent with stochastic variation in the DAG search process itself.

Authors: We agree that five seeds yield insufficient power and that p≥0.22 precludes any claim of statistical superiority. We have revised the abstract and results section to state only that Nash search reports the highest mean potential on each topology, while explicitly noting the small sample, the non-significant p-values, and the stochastic character of the DAG search. A sentence recommending larger-scale validation has been added. revision: partial
Referee: [LiH/STO-3G experiment] LiH experiment: the 97.7% correlation-energy retention, gate-count reduction, and simultaneous improvement on the other three axes are reported from a single run seeded from a 58-gate Givens-doubles ansatz. No error bars, multiple independent seeds, or statistical comparison to the baseline are provided, so the result cannot be assessed for robustness.

Authors: We concur that a single run precludes robustness assessment. Because of the substantial computational cost of the Nash search for LiH, we cannot supply additional independent seeds in this revision. We have rewritten the LiH paragraph to present the outcome strictly as an illustrative single-run example, removed any language implying generality, and added an explicit limitations statement in the conclusions. revision: partial
Referee: [Method / potential game definition] Potential-function construction: the four player objectives and the overall potential Φ are described at a high level but lack explicit formulas, normalization details, and the precise definition of the block-coordinate ε-Nash residual δ_Nash. Without these, independent verification of the reported Nash residuals and Pareto frontier is not possible.

Authors: We thank the referee for identifying this gap. A new subsection has been inserted in the Methods that supplies the exact mathematical definitions of each player’s objective, the potential Φ, all normalization constants, and the precise expression for the block-coordinate ε-Nash residual δ_Nash, together with the algorithmic steps used to compute it. These additions enable independent verification. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the potential-game framework or empirical claims

full rationale

The paper defines a four-player potential game whose state is an explicit circuit DAG and whose four objectives (trainability, non-stabilizerness, task performance, hardware cost) are stated as independent player utilities. The block-coordinate ε-Nash residual δ_Nash is a derived certification quantity computed from those utilities; it does not redefine or presuppose the search outcome. The weight sweep that traces the Pareto frontier is an explicit design choice over the same fixed objectives, not a fit to the final hardware or LiH results. The reported performance comparisons (mean potential on three topologies, fraction of seeds reaching Φ_max=4.10, LiH gate reduction) are empirical outcomes of running the defined search procedure against a simulated-annealing baseline; they are not algebraically forced by the paper's own equations or by any self-citation chain. No uniqueness theorem, ansatz smuggling, or renaming of known results appears in the derivation. The framework is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 1 invented entities

The framework rests on domain assumptions about game-theoretic modeling of circuit objectives and the existence of a potential function; limited information is available from the abstract alone.

free parameters (1)

weight parameter for Pareto frontier sweep
A single weight sweep traces the frontier from Clifford to non-Clifford endpoints, indicating a tunable scalar that controls the trade-off.

axioms (2)

domain assumption The circuit design problem admits a potential game formulation with the four stated player objectives.
Core modeling premise that enables use of Nash equilibrium concepts.
standard math Block-coordinate ε-Nash residual certifies that no unilateral improvement is possible.
Standard concept from potential game theory invoked to validate equilibria.

invented entities (1)

Four-player potential game on circuit DAG no independent evidence
purpose: To jointly optimize trainability, non-stabilizerness, performance, and cost
New modeling construct introduced for ansatz design.

pith-pipeline@v0.9.0 · 5607 in / 1800 out tokens · 47380 ms · 2026-05-09T21:42:40.222913+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

15 extracted references · 15 canonical work pages

[1]

Cerezoet al., Variational quantum algorithms, Nat

M. Cerezoet al., Variational quantum algorithms, Nat. Rev. Phys.3, 625 (2021)

work page 2021
[2]

Bhartiet al., Noisy intermediate-scale quantum algorithms, Rev

K. Bhartiet al., Noisy intermediate-scale quantum algorithms, Rev. Mod. Phys.94, 015004 (2022). 7

work page 2022
[3]

J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Babbush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nat. Commun.9, 4812 (2018)

work page 2018
[4]

Ragoneet al., A Lie algebraic theory of barren plateaus for deep parameterized quantum circuits, Nat

M. Ragoneet al., A Lie algebraic theory of barren plateaus for deep parameterized quantum circuits, Nat. Commun.15, 7172 (2024)

work page 2024
[5]

Cerezoet al., Does provable absence of barren plateaus imply classical simulability?, Nat

M. Cerezoet al., Does provable absence of barren plateaus imply classical simulability?, Nat. Commun.16, 7907 (2025)

work page 2025
[6]

Kandalaet al., Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets, Nature549, 242 (2017)

A. Kandalaet al., Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets, Nature549, 242 (2017)

work page 2017
[7]

H. R. Grimsley, S. E. Economou, E. Barnes, and N. J. Mayhall, An adaptive variational algorithm for exact molecular simulations on a quantum computer, Nat. Commun.10, 3007 (2019)

work page 2019
[8]

J. Lee, W. J. Huggins, M. Head-Gordon, and K. B. Whaley, Generalized unitary coupled cluster wave functions for quantum computation, J. Chem. Theory Comput.15, 311 (2019)

work page 2019
[9]

Zhang, C.-Y

S.-X. Zhang, C.-Y. Hsieh, S. Zhang, and H. Yao, Differentiable quantum architecture search, Quantum Sci. Technol.7, 045023 (2022)

work page 2022
[10]

M. Hein, J. Eisert, and H. J. Briegel, Multiparty entanglement in graph states, Phys. Rev. A 69, 062311 (2004)

work page 2004
[11]

Raussendorf and H

R. Raussendorf and H. J. Briegel, A one-way quantum computer, Phys. Rev. Lett.86, 5188 (2001)

work page 2001
[12]

Leone, S

L. Leone, S. F. E. Oliviero, and A. Hamma, Stabilizer Rényi entropy, Phys. Rev. Lett.128, 050402 (2022)

work page 2022
[13]

S.Bravyi, G.Smith, andJ.A.Smolin, Tradingclassicalandquantumcomputationalresources, Phys. Rev. X6, 021043 (2016)

work page 2016
[14]

Monderer and L

D. Monderer and L. S. Shapley, Potential games, Games Econ. Behav.14, 124 (1996)

work page 1996
[15]

Zhanget al., TensorCircuit: a quantum software framework for the NISQ era, Quantum 7, 912 (2023)

S.-X. Zhanget al., TensorCircuit: a quantum software framework for the NISQ era, Quantum 7, 912 (2023). 8

work page 2023

[1] [1]

Cerezoet al., Variational quantum algorithms, Nat

M. Cerezoet al., Variational quantum algorithms, Nat. Rev. Phys.3, 625 (2021)

work page 2021

[2] [2]

Bhartiet al., Noisy intermediate-scale quantum algorithms, Rev

K. Bhartiet al., Noisy intermediate-scale quantum algorithms, Rev. Mod. Phys.94, 015004 (2022). 7

work page 2022

[3] [3]

J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Babbush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nat. Commun.9, 4812 (2018)

work page 2018

[4] [4]

Ragoneet al., A Lie algebraic theory of barren plateaus for deep parameterized quantum circuits, Nat

M. Ragoneet al., A Lie algebraic theory of barren plateaus for deep parameterized quantum circuits, Nat. Commun.15, 7172 (2024)

work page 2024

[5] [5]

Cerezoet al., Does provable absence of barren plateaus imply classical simulability?, Nat

M. Cerezoet al., Does provable absence of barren plateaus imply classical simulability?, Nat. Commun.16, 7907 (2025)

work page 2025

[6] [6]

Kandalaet al., Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets, Nature549, 242 (2017)

A. Kandalaet al., Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets, Nature549, 242 (2017)

work page 2017

[7] [7]

H. R. Grimsley, S. E. Economou, E. Barnes, and N. J. Mayhall, An adaptive variational algorithm for exact molecular simulations on a quantum computer, Nat. Commun.10, 3007 (2019)

work page 2019

[8] [8]

J. Lee, W. J. Huggins, M. Head-Gordon, and K. B. Whaley, Generalized unitary coupled cluster wave functions for quantum computation, J. Chem. Theory Comput.15, 311 (2019)

work page 2019

[9] [9]

Zhang, C.-Y

S.-X. Zhang, C.-Y. Hsieh, S. Zhang, and H. Yao, Differentiable quantum architecture search, Quantum Sci. Technol.7, 045023 (2022)

work page 2022

[10] [10]

M. Hein, J. Eisert, and H. J. Briegel, Multiparty entanglement in graph states, Phys. Rev. A 69, 062311 (2004)

work page 2004

[11] [11]

Raussendorf and H

R. Raussendorf and H. J. Briegel, A one-way quantum computer, Phys. Rev. Lett.86, 5188 (2001)

work page 2001

[12] [12]

Leone, S

L. Leone, S. F. E. Oliviero, and A. Hamma, Stabilizer Rényi entropy, Phys. Rev. Lett.128, 050402 (2022)

work page 2022

[13] [13]

S.Bravyi, G.Smith, andJ.A.Smolin, Tradingclassicalandquantumcomputationalresources, Phys. Rev. X6, 021043 (2016)

work page 2016

[14] [14]

Monderer and L

D. Monderer and L. S. Shapley, Potential games, Games Econ. Behav.14, 124 (1996)

work page 1996

[15] [15]

Zhanget al., TensorCircuit: a quantum software framework for the NISQ era, Quantum 7, 912 (2023)

S.-X. Zhanget al., TensorCircuit: a quantum software framework for the NISQ era, Quantum 7, 912 (2023). 8

work page 2023