pith. sign in

arxiv: 2511.04608 · v4 · submitted 2025-11-06 · 🪐 quant-ph

Unifying Qubit Routing Across Diverse Quantum ISAs via Canonical Representation

Pith reviewed 2026-05-18 00:42 UTC · model grok-4.3

classification 🪐 quant-ph
keywords qubit routingquantum compilationcanonical representationmonodromy polytopequantum ISAsSWAP insertionqubit mapping
0
0 comments X p. Extension

The pith

Canonical representations of two-qubit gates enable a single routing framework to optimize SWAP costs for many different quantum instruction sets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to replace generic routing models that ignore hardware-specific gates with one method that works across varied quantum ISAs by analyzing two-qubit interactions in a standard mathematical form. Existing compilers add several times the needed depth because they use a fixed three-CX SWAP model even when devices support more efficient native operations such as sqrt(iSWAP) or ZZ rotations. The approach models exact synthesis costs for inserting SWAPs via the canonical gate form and monodromy polytope, then uses commutation rules derived from the same form to reorder operations. If the modeling holds, compilers can produce shallower circuits on real devices without writing separate optimizers for each ISA. Readers would care because lower routing overhead directly increases the number of useful gates that can run before noise dominates.

Core claim

Canopus is a unified qubit mapping and routing framework that centers on the canonical representation of two-qubit gates and applies monodromy polytope theory to predict synthesis costs, enabling ISA-aware SWAP insertion and generalized commutativity optimizations that reduce overhead across backend ISAs and device topologies.

What carries the argument

The canonical representation of two-qubit gates, a standard mathematical form that captures all possible two-qubit interactions up to local operations, used together with monodromy polytope theory to calculate the minimal number of native gates needed for each SWAP during routing.

If this is right

  • Routing overhead drops 15 to 35 percent relative to prior methods on multiple ISAs and topologies.
  • A single framework supports co-optimization of program patterns, choice of ISA, and hardware connectivity.
  • Commutation relations expressed in canonical form provide a general way to reorder gates without ISA-specific rules.
  • The same modeling supplies concrete guidelines for hardware-software co-design decisions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The method could be combined with pulse-level control to further reduce duration on devices where native gates have tunable parameters.
  • Hardware designers might prioritize gate sets whose canonical forms allow low-cost SWAPs to minimize routing penalties.
  • Extending the cost model to include error rates rather than just gate count would test whether the depth savings translate to higher final fidelity.
  • The framework might serve as a test bed for exploring how program structure interacts with topology beyond the benchmarks already used.

Load-bearing premise

The canonical representation together with the monodromy polytope gives an accurate enough prediction of actual synthesis cost for SWAP insertion in every supported ISA that modeling error does not erase the observed gains.

What would settle it

Execute the routing algorithm on a concrete backend ISA, measure the resulting circuit depth or duration on hardware or by exact gate synthesis, and compare against the cost predicted by the monodromy polytope model; a large mismatch would show the model fails to predict real costs.

Figures

Figures reproduced from arXiv: 2511.04608 by Dawei Ding, Jianxin Chen, Kai Zhang, Xiangyu Ren, Xinyang Tian, Yingjian Liu, Yuan Xie, Yunfeng Li, Zhaohui Yang.

Figure 1
Figure 1. Figure 1: Compilation workflows by means of conventional approaches (top) and Canopus (bottom) targeting diverse quantum ISAs. Canopus integrates the synthesis cost model (monodromy polytopes within the Weyl chamber) to con￾sider backend ISA properties during the routing stage. Cano￾pus routing operates in the 2Q canonical representation while the specific synthesis is completed by the backend synthesizer. this rout… view at source ↗
Figure 2
Figure 2. Figure 2: Mapping/routing to resolve topology constraints via SWAP insertion. With the initial mapping {𝑞𝑖 : 𝑄𝑖 } (upper right), 𝑔3 is not hardware compliant. Both SWAP𝑞0,𝑞1 and SWAP𝑞1,𝑞2 are sufficient to make 𝑔3 executable. offers a generalized commutativity-based optimization mech￾anism, moving beyond those tailored only for CX gates [37]. ❸ We conduct comprehensive experiments across a wide range of real-world b… view at source ↗
Figure 4
Figure 4. Figure 4: The input circuit is rebased to {Can, U3} before being fed to the routing pass. All processes operate on the directed acyclic graph (DAG) representation of the circuit. Canopus integrates the ISA-specific synthesis cost model into its SWAP search process and determines the most appro￾priate SWAP at each route step. The routing cost is efficiently computed by a formal analysis of 2Q canonical forms, with￾ou… view at source ↗
Figure 4
Figure 4. Figure 4: Overview of the Canopus framework. (a) (b) (c) [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Synthesis coverage for √ iSWAP, ECP gate set. The trivial points (√ iSWAP and ECP themselves) are not shown in this figure. 2Q overage regions correspond to those that require (a) 2 √ iSWAP gates or 2 ECP gates; (b) 1 √ iSWAP + 1 ECP; (c) 3 gates (3 √ iSWAP, 3 ECP, 2 √ iSWAP + 1 ECP, etc.) from this gate set for synthesis, respectively. (a) ISA-aware SWAP insertion cost. L Best SWAP insertion Worst SWAP i… view at source ↗
Figure 6
Figure 6. Figure 6: Qubit routing with the canonical 2Q gate repre￾sentation. no longer needed, as the 𝑐𝑔 and Δdepth guide more accurate In CX representation In Canonical representation [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: Mapping/routing comparison for the QFT ker￾nel. For convenient visualization, only CPhase and SWAP gates are shown. (a) TOQM generates a sub-optimal map￾ping scheme, with 2Q depth of 10. (b) Canopus generates the optimal scheme in a perfect butterfly structure, with 2Q depth of 9. 6 8 10 12 Number of qubits 0.0 0.2 0.4 0.6 0.8 Fidelity Qiskit CX Canopus CX Canopus ZZ( ) 6 8 10 12 Number of qubits 0 100 200… view at source ↗
Figure 9
Figure 9. Figure 9: QFT kernel fidelity comparison bench￾marked on IBM® Quantum Platform (ibm_marrakesh). ibm_marrakesh is the Heron-series QPU with native gate set  CZ, √ X, Z(𝜃), ZZ(𝜃) [PITH_FULL_IMAGE:figures/full_fig_p008_9.png] view at source ↗
Figure 11
Figure 11. Figure 11: Routing overhead in terms of (a) 𝐶count and (b) 𝐶depth for different compilers across various device topologies and quantum ISAs. gate scheme [10] and its extended generalization [64] that enable directly implementing any basis gates with the opti￾mal gate durations. It is also experimentally demonstrated on transmon qubits by Chen et al. [11], where multiple basis gates are calibrated with high fidelity,… view at source ↗
Figure 12
Figure 12. Figure 12: Compilation latency comparison. parameters as each 2Q unitary is randomly generated, thus there is no cached synthesis cost calculation for performance improvement in one pass. We select QV circuits with two different widths (number of qubits), 15 and 20. We vary the depth of these circuits (qv_15 and qv_20) from 50 to 200. The largest size of these qv circuits is up to thousands of 2Q gates. As [PITH_FU… view at source ↗
Figure 13
Figure 13. Figure 13: Coverage set for CX ISA [PITH_FULL_IMAGE:figures/full_fig_p016_13.png] view at source ↗
Figure 15
Figure 15. Figure 15: Coverage set for SQiSW_ ISA [PITH_FULL_IMAGE:figures/full_fig_p017_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Coverage set for ZZPhase ISA. leakage error than the native CZ gate. Recent experimental advances demonstrate that more basis gates could be implemented natively and calibrated in high precision [11, 60, 62]. Particularly, some basis gates like √ iSWAP ∼ Can 1 4 , 1 4 , 0  and fractional ZZ(𝜃) ∼ Can 𝑎, 0, 0  gates offers more promising ISA selections as they exhibit shorter gate duration, higher gate ac… view at source ↗
Figure 17
Figure 17. Figure 17: Coverage set for ZZPhase_ ISA. A.3 2Q gate mirroring The mirror symmetry of a 2Q gate 𝑈 is defined as the composition of the original gate and a SWAP gate [51], i.e., SWAP · 𝑈 . For example, CX and iSWAP is a typical pair of mirror gates as shown below. = 𝑆 † iSWAP 𝐻 𝐻 𝑆 † In general, the mirroring rule for Canonical coefficients is described as SWAP · Can(𝑎, 𝑏, 𝑐) ∼  𝑎 + 1 2 , 𝑏 + 1 2 , 𝑐 + 1 2  ∼  𝑎 … view at source ↗
Figure 18
Figure 18. Figure 18: Coverage set for Het ISA [PITH_FULL_IMAGE:figures/full_fig_p019_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: Morrir symmetry for Can(𝑎, 𝑏, 0) and Can( 1 2 , 𝑏′ , 𝑐′ ) gate families. B Commutative relation of canonical gates Herein we present detailed proof for Theorem 1. The if direction is trivial, and hence we justify the only if direction, relying on the following two lemmas. Lemma 1. Let 𝐴, 𝐵 be two Hermitian matrices with eigenvalues in the range [−2, 2). If [𝑒 −𝑖 𝜋 2 𝐴, 𝑒−𝑖 𝜋 2 𝐵 ] = 0 then [𝐴, 𝐵] = 0. 19 … view at source ↗
read the original abstract

Qubit mapping/routing is a critical stage in compilation for both near-term and fault-tolerant quantum computers, yet existing scalable methods typically impose several times the routing overhead in terms of circuit depth or duration. This inefficiency stems from a fundamental disconnect: compilers rely on an abstract routing model (e.g., three-CX-unrolled SWAP insertion) that completely ignores the idiosyncrasies of native gates supported by physical devices. Recent hardware breakthroughs have enabled high-precision implementations of diverse instruction set architectures (ISAs) beyond standard CX-based gates. Advanced ISAs involving gates such as $\mathrm{\sqrt{iSWAP}}$ and $\mathrm{ZZ}(\theta)$ gates offer superior circuit synthesis capabilities and can be realized with higher fidelities. However, systematic compiler optimization strategies tailored to these advanced ISAs are lacking. To address this, we propose Canopus, a unified qubit mapping/routing framework applicable to diverse quantum ISAs. Built upon the canonical representation of two-qubit gates, Canopus centers on qubit routing to perform deep co-optimization in an ISA-aware approach. Canopus leverages the two-qubit canonical representation and the monodromy polytope theory to model the synthesis cost for more intelligent SWAP insertion during qubit routing. We also formalize the commutation relations between two-qubit gates through the canonical form, providing a generalized approach to commutativity-based optimization. Experiments show that Canopus consistently reduces routing overhead by 15%-35% compared to state-of-the-art methods across various backend ISAs and device topologies. More broadly, this work establishes a coherent method for co-exploration of program patterns, quantum ISAs, and hardware topologies, yielding concrete guidelines for hardware-software co-design.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes Canopus, a unified qubit mapping/routing framework for diverse quantum ISAs. It centers on the canonical representation of two-qubit gates combined with monodromy polytope theory to compute ISA-specific synthesis costs for SWAP insertion, while also formalizing commutation relations via the canonical form. Experiments are reported to show consistent 15-35% reductions in routing overhead versus prior methods across multiple backend ISAs and device topologies.

Significance. If the modeling assumptions hold without significant error, the work offers a principled, parameter-free route to ISA-aware routing that could meaningfully lower compilation overhead on hardware supporting advanced gates such as √iSWAP and ZZ(θ). The reliance on established monodromy polytope theory and the absence of fitted parameters are strengths that support broader hardware-software co-design exploration.

major comments (2)
  1. [Abstract] Abstract (modeling paragraph): The central claim that canonical-form plus monodromy polytope yields an accurate, ISA-specific synthesis cost for each SWAP candidate rests on an unvalidated modeling step. No quantitative comparison (e.g., predicted vs. actual native-gate decomposition lengths or depth tables) is supplied for non-CX gates such as √iSWAP or ZZ(θ), leaving open the possibility that modeling error offsets or inflates the reported 15-35% gains.
  2. [Abstract] Abstract (experiments paragraph): The headline performance numbers lack any description of benchmark selection criteria, circuit ensembles, error-bar reporting, or data-exclusion rules. Without these details the 15-35% overhead reduction cannot be independently assessed for statistical robustness or sensitivity to topology/ISA choice.
minor comments (1)
  1. [Abstract] The abstract refers to “deep co-optimization” without clarifying whether this includes gate scheduling, pulse-level effects, or only logical SWAP insertion; a brief sentence distinguishing these scopes would improve precision.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major point below, indicating where revisions have been made to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract (modeling paragraph): The central claim that canonical-form plus monodromy polytope yields an accurate, ISA-specific synthesis cost for each SWAP candidate rests on an unvalidated modeling step. No quantitative comparison (e.g., predicted vs. actual native-gate decomposition lengths or depth tables) is supplied for non-CX gates such as √iSWAP or ZZ(θ), leaving open the possibility that modeling error offsets or inflates the reported 15-35% gains.

    Authors: The synthesis-cost model is derived directly from the exact characterization of reachable unitaries given by monodromy polytope theory applied to the two-qubit canonical form; this yields the minimal native-gate count without approximation or fitted parameters. We nevertheless recognize that an explicit side-by-side numerical validation for non-CX gates would make the claim more immediately verifiable. We have therefore added a new appendix containing predicted versus actual decomposition lengths and depths for √iSWAP and ZZ(θ) families, together with the corresponding monodromy-polytope calculations. revision: yes

  2. Referee: [Abstract] Abstract (experiments paragraph): The headline performance numbers lack any description of benchmark selection criteria, circuit ensembles, error-bar reporting, or data-exclusion rules. Without these details the 15-35% overhead reduction cannot be independently assessed for statistical robustness or sensitivity to topology/ISA choice.

    Authors: We agree that the abstract should provide at least a concise indication of the experimental protocol. In the revised manuscript we have expanded the experiments paragraph of the abstract to state that results are reported on (i) random circuits of varying depth and (ii) standard application benchmarks drawn from the Qiskit and Cirq suites, averaged over 50 instances per configuration, with standard deviations shown in the main-text figures. Full selection criteria, topology sampling, and data-handling procedures remain detailed in Section 5; we have added an explicit cross-reference from the abstract to that section. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's central modeling step invokes the established two-qubit canonical representation together with monodromy polytope theory to assign synthesis costs to candidate SWAP insertions. This step is presented as an application of prior mathematical machinery rather than a self-definitional loop, a fitted parameter renamed as a prediction, or a load-bearing self-citation. No equations in the abstract or description reduce the reported 15-35% overhead reduction to a tautology constructed from the routing results themselves; the experimental claims rest on external comparisons across ISAs and topologies. The derivation therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The framework rests on the domain assumption that canonical two-qubit forms plus monodromy geometry give a faithful cost model for SWAP synthesis across ISAs; no free parameters or new invented entities are introduced in the abstract.

axioms (1)
  • domain assumption Canonical representation of two-qubit gates can be used to model synthesis cost for intelligent SWAP insertion during routing.
    Central modeling choice stated in the abstract description of Canopus.

pith-pipeline@v0.9.0 · 5862 in / 1238 out tokens · 61472 ms · 2026-05-18T00:42:38.885745+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

79 extracted references · 79 canonical work pages · 3 internal anchors

  1. [1]

    Abanin, Laleh Aghababaie-Beni, Igor Aleiner, Trond I

    Rajeev Acharya, Dmitry A. Abanin, Laleh Aghababaie-Beni, Igor Aleiner, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Nikita Astrakhantsev, Juan Atalaya, Ryan Bab- bush, Dave Bacon, Brian Ballard, Joseph C. Bardin, Johannes Bausch, Andreas Bengtsson, Alexander Bilmes, Sam Blackwell, Sergio Boixo, Gina Bortoli, Alexander Boussas...

  2. [2]

    Alessandro Annechini, Marco Venere, Donatella Sciuto, and Marco Santambrogio. 2025. DDRoute: a Novel Depth-Driven Approach to the Qubit Routing Problem. InProceedings of the 62st ACM/IEEE Design Automation Conference

  3. [3]

    Bardin, Rami Barends, Rupak Biswas, Sergio Boixo, Fernando G

    Frank Arute, Kunal Arya, Ryan Babbush, Dave Bacon, Joseph C. Bardin, Rami Barends, Rupak Biswas, Sergio Boixo, Fernando G. S. L. Bran- dao, David A. Buell, Brian Burkett, Yu Chen, Zijun Chen, Ben Chiaro, Roberto Collins, William Courtney, Andrew Dunsworth, Edward Farhi, Brooks Foxen, Austin Fowler, Craig Gidney, Marissa Giustina, Rob Graff, Keith Guerin, ...

  4. [4]

    Sergey Bravyi, Andrew W Cross, Jay M Gambetta, Dmitri Maslov, Patrick Rall, and Theodore J Yoder. 2024. High-threshold and low- overhead fault-tolerant quantum memory.Nature627, 8005 (2024), 778–782

  5. [5]

    Nikolas P Breuckmann and Jens Niklas Eberhardt. 2021. Quantum low-density parity-check codes.PRX Quantum2, 4 (2021), 040101

  6. [6]

    Colin D Bruzewicz, John Chiaverini, Robert McConnell, and Jeremy M Sage. 2019. Trapped-ion quantum computing: Progress and challenges. Applied Physics Reviews6, 2 (2019), 021314

  7. [7]

    Stephen S Bullock and Igor L Markov. 2003. An arbitrary two-qubit computation in 23 elementary gates or less. InProceedings of the 40th Annual Design Automation Conference. IEEE, Anaheim, CA, USA, 324– 329

  8. [8]

    Gregory J Chaitin. 1982. Register allocation & spilling via graph coloring.ACM Sigplan Notices17, 6 (1982), 98–101

  9. [9]

    Christopher Chamberland, Guanyu Zhu, Theodore J Yoder, Jared B Hertzberg, and Andrew W Cross. 2020. Topological and subsystem codes on low-degree graphs with flag qubits.Physical Review X10, 1 (2020), 011022

  10. [10]

    Jianxin Chen, Dawei Ding, Weiyuan Gong, Cupjin Huang, and Qi Ye

  11. [11]

    InProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2

    One Gate Scheme to Rule Them All: Introducing a Complex Yet Reduced Instruction Set for Quantum Computing. InProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2. ACM, La Jolla, CA, USA, 779–796

  12. [12]

    Zhen Chen, Weiyang Liu, Yanjun Ma, Weijie Sun, Ruixia Wang, He Wang, Huikai Xu, Guangming Xue, Haisheng Yan, Zhen Yang, Jiayu Ding, Yang Gao, Feiyu Li, Yujia Zhang, Zikang Zhang, Yirong Jin, Haifeng Yu, Jianxin Chen, and Fei Yan. 2025. Efficient implementation of arbitrary two-qubit gates using unified control.Nature Physics(15 Aug 2025). doi:10.1038/s415...

  13. [13]

    Andrew M Childs, Eddie Schoute, and Cem M Unsal. 2019. Cir- cuit transformations for quantum architectures.arXiv preprint arXiv:1902.09102(2019)

  14. [14]

    Josep M Codina, Jesús Sánchez, and Antonio González. 2001. A unified modulo scheduling and register allocation technique for clustered processors. InProceedings 2001 International Conference on Parallel Architectures and Compilation Techniques. IEEE, 175–184

  15. [15]

    Gavin E Crooks. 2020. Gates, states, and circuits. Available athttps: //threeplusone.com/pubs/on-gates-v0-5/

  16. [16]

    Andrew W Cross, Lev S Bishop, Sarah Sheldon, Paul D Nation, and Jay M Gambetta. 2019. Validating quantum computers using random- ized model circuits.Physical Review A100, 3 (2019), 032328

  17. [17]

    Marc Grau Davis, Ethan Smith, Ana Tudor, Koushik Sen, Irfan Siddiqi, and Costin Iancu. 2019. Heuristics for quantum compiling with a continuous gate set. 12 pages. arXiv preprint arXiv:1912.02727. 13 Zhaohui Yang et al

  18. [18]

    Warren, Jonathan Gross, Alex Opremcak, Nicholas Zobrist, Kevin C

    Alec Eickbusch, Matt McEwen, Volodymyr Sivak, Alexandre Bourassa, Juan Atalaya, Jahan Claes, Dvir Kafri, Craig Gidney, Christopher W. Warren, Jonathan Gross, Alex Opremcak, Nicholas Zobrist, Kevin C. Miao, Gabrielle Roberts, Kevin J. Satzinger, Andreas Bengtsson, Matthew Neeley, William P. Livingston, Alex Greene, Rajeev Acharya, Laleh Aghababaie Beni, Ge...

  19. [19]

    Craig Gidney. 2021. Stim: a fast stabilizer circuit simulator.Quantum 5 (2021), 497.https://api.semanticscholar.org/CorpusID:232104816

  20. [20]

    Michael Goerz and Evan McKinney. 2024. weylchamber: Python package for analyzing two-qubit gates in the Weyl chamber.https: //pypi.org/project/weylchamber/. Python package

  21. [21]

    Google Quantum AI. 2025. Cirq API.https://quantumai.google/ reference/python/cirq/two_qubit_matrix_to_sqrt_iswap_operations

  22. [22]

    Aram W Harrow, Avinatan Hassidim, and Seth Lloyd. 2009. Quantum algorithm for linear systems of equations.Physical review letters103, 15 (2009), 150502

  23. [23]

    John L Hennessy and Thomas Gross. 1983. Postpass code optimization of pipeline constraints.ACM Transactions on Programming Languages and Systems (TOPLAS)5, 3 (1983), 422–448

  24. [24]

    Timo Hillmann, Lucas Berent, Armanda O Quintavalle, Jens Eisert, Robert Wille, and Joschka Roffe. 2024. Localized statistics decoding: A parallel decoding algorithm for quantum low-density parity-check codes.arXiv preprint arXiv:2406.18655(2024)

  25. [25]

    Cupjin Huang, Tenghui Wang, Feng Wu, Dawei Ding, Qi Ye, Linghang Kong, Fang Zhang, Xiaotong Ni, Zhijun Song, Yaoyun Shi, Hui-Hai Zhao, Chunqing Deng, and Jianxin Chen. 2023. Quantum Instruction Set Design for Performance.Physical Review Letters130 (Feb 2023), 070601. Issue 7. doi:10.1103/PhysRevLett.130.070601

  26. [26]

    IBM Quantum. 2024. New fractional gates reduce circuit depth for utility-scale workloads.https://www.ibm.com/quantum/blog/ fractional-gates. Accessed: Nov. 18, 2024

  27. [27]

    IBM Quantum. 2025. Qiskit API.https://quantum.cloud.ibm.com/ docs/en/api/qiskit/qiskit.synthesis.XXDecomposer

  28. [28]

    IonQ. 2023. Getting started with IonQ’s hardware-native gateset. https://docs.ionq.com/guides/getting-started-with-native-gates

  29. [29]

    Yuwei Jin, Xiangyu Gao, Minghao Guo, Henry Chen, Fei Hua, Chi Zhang, and Eddy Z Zhang. 2024. Optimizing quantum fourier trans- formation (qft) kernels for modern nisq and ft architectures. InSC24: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, 1–15

  30. [30]

    Justin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, and Costin Iancu. 2024. Quantum hardware roofline: Evalu- ating the impact of gate expressivity on quantum processor design. In2024 IEEE International Conference on Quantum Computing and Engineering (QCE), Vol. 1. IEEE, 805–816

  31. [31]

    A Yu Kitaev. 1995. Quantum measurements and the Abelian stabilizer problem.arXiv preprint quant-ph/9511026(1995)

  32. [32]

    Philip Krantz, Morten Kjaergaard, Fei Yan, Terry P Orlando, Simon Gustavsson, and William D Oliver. 2019. A quantum engineer’s guide to superconducting qubits.Applied Physics Reviews6, 2 (2019), 021318

  33. [33]

    Alon Kukliansky, Ed Younis, Lukasz Cincio, and Costin Iancu. 2023. QFactor: A Domain-Specific Optimizer for Quantum Circuit Instanti- ation. In2023 IEEE International Conference on Quantum Computing and Engineering (QCE), Vol. 1. IEEE, 814–824

  34. [34]

    Lingling Lao, Hans Van Someren, Imran Ashraf, and Carmen G Al- mudever. 2021. Timing and resource-aware mapping of quantum circuits to superconducting processors.IEEE Transactions on Computer- Aided Design of Integrated Circuits and Systems41, 2 (2021), 359–371

  35. [35]

    Chris Lattner and Vikram Adve. 2004. LLVM: A Compilation Frame- work for Lifelong Program Analysis & Transformation. InProceedings of the International Symposium on Code Generation and Optimiza- tion (CGO). IEEE Computer Society, Washington, DC, USA, 75–86. doi:10.1109/CGO.2004.1281665

  36. [36]

    Ang Li, Samuel Stein, Sriram Krishnamoorthy, and James Ang. 2023. Qasmbench: A low-level quantum benchmark suite for nisq evaluation and simulation.ACM Transactions on Quantum Computing4, 2 (2023), 1–26

  37. [37]

    Gushu Li, Yufei Ding, and Yuan Xie. 2019. Tackling the qubit map- ping problem for NISQ-era quantum devices. InProceedings of the twenty-fourth international conference on architectural support for pro- gramming languages and operating systems. 1001–1014

  38. [38]

    Ji Liu, Peiyi Li, and Huiyang Zhou. 2022. Not all swaps have the same cost: A case for optimization-aware qubit routing. In2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA). IEEE, 709–725

  39. [39]

    Ji Liu, Ed Younis, Mathias Weiden, Paul Hovland, John Kubiatowicz, and Costin Iancu. 2023. Tackling the qubit mapping problem with permutation-aware synthesis. In2023 IEEE International Conference on Quantum Computing and Engineering (QCE), Vol. 1. IEEE, 745–756

  40. [40]

    Seth Lloyd. 1996. Universal quantum simulators.Science273, 5278 (1996), 1073–1078. 14 Qubit Mapping and Routing tailored to Advanced Quantum ISAs: Not as Costly as You Think

  41. [41]

    Dmitri Maslov. 2007. Linear depth stabilizer and quantum Fourier transformation circuits with no auxiliary qubits in finite-neighbor quantum architectures.Physical Review A—Atomic, Molecular, and Optical Physics76, 5 (2007), 052310

  42. [42]

    Matt McEwen, Dave Bacon, and Craig Gidney. 2023. Relaxing hard- ware requirements for surface code circuits using time-dynamics. Quantum7 (2023), 1172

  43. [43]

    Evan McKinney and Lev S Bishop. 2025. Two-Qubit Gate Synthesis via Linear Programming for Heterogeneous Instruction Sets.arXiv preprint arXiv:2505.00543(2025)

  44. [44]

    Evan McKinney, Michael Hatridge, and Alex K Jones. 2024. Mirage: Quantum circuit decomposition and routing collaborative design using mirror gates. In2024 IEEE International Symposium on High- Performance Computer Architecture (HPCA). IEEE, 704–718

  45. [45]

    Santiago, and Irfan Siddiqi

    Long B Nguyen, Yosep Kim, Akel Hashim, Noah Goss, Brian Marinelli, Bibek Bhandari, Debmalya Das, Ravi K Naik, John Mark Kreikebaum, Andrew N Jordan, David I. Santiago, and Irfan Siddiqi. 2024. Pro- grammable Heisenberg interactions between Floquet qubits.Nature Physics20, 2 (2024), 240–246

  46. [46]

    2010.Quantum computation and quantum information

    Michael A Nielsen and Isaac L Chuang. 2010.Quantum computation and quantum information. Cambridge university press

  47. [47]

    Pavel Panteleev and Gleb Kalachev. 2021. Degenerate quantum LDPC codes with good finite length performance.Quantum5 (2021), 585

  48. [48]

    Tirthak Patel, Ed Younis, Costin Iancu, Wibe de Jong, and Devesh Tiwari. 2022. Quest: systematically approximating quantum circuits for higher output fidelity. InProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. 514–528

  49. [49]

    Eric C Peterson, Lev S Bishop, and Ali Javadi-Abhari. 2022. Optimal synthesis into fixed xx interactions.Quantum6 (2022), 696

  50. [50]

    Eric C Peterson, Gavin E Crooks, and Robert S Smith. 2020. Fixed- depth two-qubit circuits and the monodromy polytope.Quantum4 (2020), 247

  51. [51]

    Massimiliano Poletto and Vivek Sarkar. 1999. Linear scan register allocation.ACM Transactions on Programming Languages and Systems (TOPLAS)21, 5 (1999), 895–913

  52. [52]

    Timothy Proctor, Kenneth Rudinger, Kevin Young, Erik Nielsen, and Robin Blume-Kohout. 2022. Measuring the capabilities of quantum computers.Nature Physics18, 1 (2022), 75–79

  53. [53]

    Quantinuum. 2024. Native Arbitrary Angle Hardware Gates. https://docs.quantinuum.com/systems/trainings/getting_started/ arbitrary_angle_2_qubit_gates

  54. [54]

    Nils Quetschlich, Lukas Burgholzer, and Robert Wille. 2023. MQT Bench: Benchmarking software and design automation tools for quan- tum computing.Quantum7 (2023), 1062

  55. [55]

    Chad Rigetti and Michel Devoret. 2010. Fully microwave-tunable universal gates in superconducting qubits with linear couplings and fixed transition frequencies.Physical Review B—Condensed Matter and Materials Physics81, 13 (2010), 134507

  56. [56]

    Peter W Shor. 1994. Algorithms for quantum computation: discrete logarithms and factoring. InProceedings 35th annual symposium on foundations of computer science. Ieee, 124–134

  57. [57]

    Bochen Tan and Jason Cong. 2020. Optimal layout synthesis for quantum computing. InProceedings of the 39th International Conference on Computer-Aided Design. 1–9

  58. [58]

    Jialiang Tang, Jialin Zhang, and Xiaoming Sun. 2024. Quantum circuit synthesis with SQiSW.arXiv preprint arXiv:2412.14828(2024)

  59. [59]

    Robert R Tucci. 2005. An introduction to Cartan’s KAK decomposition for QC programmers. arXiv preprint quant-ph/0507171

  60. [60]

    Wang, and Dong-Ling Deng

    Ke Wang, Zhide Lu, Chuanyu Zhang, Gongyu Liu, Jiachen Chen, Yanzhe Wang, Yaozu Wu, Shibo Xu, Xuhao Zhu, Feitong Jin, Yu Gao, Ziqi Tan, Zhengyi Cui, Ning Wang, Yiren Zou, Aosai Zhang, Tingt- ing Li, Fanhao Shen, Jiarun Zhong, Zehang Bao, Zitian Zhu, Yihang Han, Yiyang He, Jiayuan Shen, Han Wang, Jia-Nan Yang, Zixuan Song, Jinfeng Deng, Hang Dong, Zheng-Zhi...

  61. [61]

    Demonstration of low-overhead quantum error correction codes

    Demonstration of low-overhead quantum error correction codes. arXiv preprint arXiv:2505.09684(2025)

  62. [62]

    Ken Xuan Wei, Isaac Lauer, Emily Pritchett, William Shanks, David C McKay, and Ali Javadi-Abhari. 2024. Native two-qubit gates in fixed- coupling, fixed-frequency transmons beyond cross-resonance interac- tion.PRX Quantum5, 2 (2024), 020338

  63. [63]

    Xin-Chuan Wu, Marc Grau Davis, Frederic T Chong, and Costin Iancu

  64. [64]

    QGo: Scalable quantum circuit optimization using automated synthesis.arXiv preprint arXiv:2012.09835(2020)

  65. [65]

    Christopher G Yale, Ashlyn D Burch, Matthew NH Chow, Brandon P Ruzic, Daniel S Lobser, Brian K McFarland, Melissa C Revelle, and Susan M Clark. 2025. Realization and calibration of continuously parameterized two-qubit gates on a trapped-ion quantum processor. arXiv preprint arXiv:2504.06259(2025)

  66. [66]

    Christopher G Yale, Rich Rines, Victory Omole, Bharath Thotakura, Ashlyn D Burch, Matthew NH Chow, Megan Ivory, Daniel Lobser, Brian K McFarland, Melissa C Revelle, Susan M Clark, and Pranav Gokhale. 2024. Noise-Aware Circuit Compilations for a Continuously Parameterized Two-Qubit Gateset.arXiv preprint arXiv:2411.01094 (2024)

  67. [67]

    Zhaohui Yang, Dawei Ding, Qi Ye, Cupjin Huang, Jianxin Chen, and Yuan Xie. 2025. ReQISC: A Reconfigurable Quantum Com- puter Microarchitecture and Compiler Co-Design.arXiv preprint arXiv:2511.06746(2025)

  68. [68]

    Ed Younis, Costin C Iancu, Wim Lavrijsen, Marc Davis, and Ethan Smith. 2021. Berkeley Quantum Synthesis Toolkit (BQSKit). GitHub. doi:10.11578/dc.20210603.2

  69. [69]

    Ed Younis, Koushik Sen, Katherine Yelick, and Costin Iancu. 2021. Qfast: Conflating search and numerical optimization for scalable quan- tum circuit synthesis. In2021 IEEE International Conference on Quan- tum Computing and Engineering (QCE). IEEE, 232–243

  70. [70]

    Chi Zhang, Ari B Hayes, Longfei Qiu, Yuwei Jin, Yanhao Chen, and Eddy Z Zhang. 2021. Time-optimal qubit mapping. InProceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. 360–374

  71. [71]

    Jun Zhang, Jiri Vala, Shankar Sastry, and K Birgitta Whaley. 2003. Geometric theory of nonlocal two-qubit operations.Physical Review A67, 4 (2003), 042313

  72. [72]

    Runshi Zhou, Fang Zhang, Linghang Kong, and Jianxin Chen. 2024. Halma: a routing-based technique for defect mitigation in quantum error correction.arXiv preprint arXiv:2412.21000(2024)

  73. [73]

    Chenghong Zhu, Xian Wu, Zhaohui Yang, Jingbo Wang, Anbang Wu, Shenggen Zheng, and Xin Wang. 2025. Quantum Compiler Design for Qubit Mapping and Routing: A Cross-Architectural Survey of Superconducting, Trapped-Ion, and Neutral Atom Systems.arXiv preprint arXiv:2505.16891(2025)

  74. [74]

    Henry Zou, Matthew Treinish, Kevin Hartman, Alexander Ivrii, and Jake Lishman. 2024. Lightsabre: A lightweight and enhanced sabre algorithm.arXiv preprint arXiv:2409.08368(2024)

  75. [75]

    Alwin Zulehner, Alexandru Paler, and Robert Wille. 2018. An efficient methodology for mapping quantum circuits to the IBM QX archi- tectures.IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems38, 7 (2018), 1226–1236

  76. [76]

    Alwin Zulehner and Robert Wille. 2019. Compiling SU (4) quantum circuits to IBM QX architectures. InProceedings of the 24th Asia and South Pacific Design Automation Conference. ACM New York, NY, USA, Tokyo, Japan, 185–190. 15 Zhaohui Yang et al. A Canonical gate and 2Q circuit synthesis In this section we show the basic mathematical properties of the cano...

  77. [77]

    OneCXgate is required to synthesize 2Q gates∼Can 1 2,0,0 , i.e.,CX-equivalent gates(𝑉 1 ⊗𝑉 2)CX(𝑉3 ⊗𝑉 4)

  78. [78]

    TwoCXgates are required to synthesize 2Q gates∼Can(𝑎, 𝑏,0), i.e.,(𝑉 1 ⊗𝑉 2)CX(𝑉3 ⊗𝑉 4)CX(𝑉5 ⊗𝑉 6)

  79. [79]

    We assume the cost of one CX gate is1 .0

    ThreeCXgates are required to synthesize 2Q gates∼Can(𝑎, 𝑏, 𝑐), i.e.,(𝑉 1 ⊗𝑉 2)CX(𝑉3 ⊗𝑉 4)CX(𝑉5 ⊗𝑉 6)CX(𝑉7 ⊗𝑉 8). We assume the cost of one CX gate is1 .0. Polytopes in different colors denotes the minimal circuit cost (duration) for the coverage set if synthesized by CX and arbitrary 1Q gates. That is, on average, the number of CX gates required to synthe...