Convolutional neural network based decoders for surface codes
Pith reviewed 2026-05-24 05:09 UTC · model grok-4.3
The pith
Convolutional neural network decoders achieve good performance on surface codes and adapt to different noise models.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors report that convolutional neural network decoders for surface codes deliver good performance on error syndrome data drawn from different code distances and from multiple noise models. They further show that the same network architecture can be retrained or adapted to handle shifts in the underlying noise statistics. Explainable machine learning analysis is used to map which input features drive the decoder's outputs and where mistakes arise, providing a route to iterative improvements in robustness.
What carries the argument
convolutional neural network trained to map surface-code syndrome patterns to error corrections, with the network weights adjusted to different noise statistics
If this is right
- Decoding time no longer scales with classical algorithm complexity when code distance grows.
- A single decoder architecture can be reused across runs that experience different noise environments.
- Explainable analysis of network decisions can guide targeted retraining to reduce specific error classes.
- Surface-code logical qubits can be operated at higher effective rates if syndrome processing keeps pace with hardware cycle times.
Where Pith is reading between the lines
- Hardware implementations could embed the network weights directly in control electronics to remove data transfer latency between quantum device and classical processor.
- Periodic retraining on fresh calibration data might allow the decoder to track slow drifts in qubit parameters without interrupting computation.
- The same convolutional approach could be tested on other topological codes whose syndrome graphs share local structure with the surface code.
Load-bearing premise
The simulated noise models and training distributions match the actual error processes that occur on physical quantum hardware.
What would settle it
Running the trained CNN decoder on a real quantum device and measuring logical error rates that deviate sharply from the simulated predictions under the same nominal noise parameters.
Figures
read the original abstract
The decoding of error syndromes of surface codes with classical algorithms may slow down quantum computation. To overcome this problem it is possible to implement decoding algorithms based on artificial neural networks. This work reports a study of decoders based on convolutional neural networks, tested on different code distances and noise models. The results show that decoders based on convolutional neural networks have good performance and can adapt to different noise models. Moreover, explainable machine learning techniques have been applied to the neural network of the decoder to better understand the behaviour and errors of the algorithm, in order to produce a more robust and performing algorithm.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript investigates convolutional neural network (CNN) based decoders for surface code error syndromes. It evaluates performance on varying code distances and noise models (including depolarizing and bit-flip), asserts that the decoders achieve good performance and adapt to different noise models, and applies explainable machine learning techniques to analyze network behavior and errors for improved robustness.
Significance. If the performance and adaptation results are substantiated by rigorous quantitative benchmarks against established decoders, this could support development of efficient, flexible decoders relevant to fault-tolerant quantum computation. The application of explainable ML techniques is a positive feature that may aid interpretability and refinement of ML-based quantum decoders.
major comments (2)
- [Abstract and results] Abstract and results sections: performance and adaptation claims are stated without reported numerical error rates, thresholds, baselines (e.g., MWPM or union-find decoder comparisons), error bars, or training/validation split details. This is load-bearing for the central claim that the CNN decoders 'have good performance and can adapt to different noise models'.
- [Results and discussion] Results and discussion on noise models: adaptability is demonstrated only within fixed phenomenological or circuit-level simulated models; no transfer metrics, cross-model generalization tests, or validation against hardware-calibrated or non-Markovian noise distributions are provided. This directly affects the hardware relevance of the adaptation result.
minor comments (2)
- [Methods] Provide explicit description of the CNN architecture, layer dimensions, activation functions, loss function, and optimizer in the methods section for reproducibility.
- [Figures] Ensure all figures include clear legends, axis labels, and captions that allow direct comparison of CNN performance to baselines across code distances.
Simulated Author's Rebuttal
We thank the referee for the detailed review and valuable feedback. We address each major comment below and outline the revisions we will make to improve clarity and substantiation of the claims.
read point-by-point responses
-
Referee: [Abstract and results] Abstract and results sections: performance and adaptation claims are stated without reported numerical error rates, thresholds, baselines (e.g., MWPM or union-find decoder comparisons), error bars, or training/validation split details. This is load-bearing for the central claim that the CNN decoders 'have good performance and can adapt to different noise models'.
Authors: We agree that explicit numerical reporting strengthens the manuscript. The performance comparisons are visualized in the results figures (logical error rates vs. physical error rate for varying distances and noise models), but we will add a dedicated table in the revised results section listing specific error rates at key points, estimated thresholds where observable, direct numerical comparisons to MWPM and union-find baselines, standard error bars from repeated training runs, and the precise training/validation/test split ratios used. This addresses the load-bearing aspect of the central claim without altering the underlying data. revision: yes
-
Referee: [Results and discussion] Results and discussion on noise models: adaptability is demonstrated only within fixed phenomenological or circuit-level simulated models; no transfer metrics, cross-model generalization tests, or validation against hardware-calibrated or non-Markovian noise distributions are provided. This directly affects the hardware relevance of the adaptation result.
Authors: The manuscript demonstrates adaptation by retraining the same CNN architecture independently on each noise model (phenomenological depolarizing and bit-flip) and obtaining competitive performance in each case. We acknowledge that this does not include cross-model transfer metrics or tests on hardware-calibrated/non-Markovian noise. In revision we will expand the discussion section to explicitly state the scope (simulated Markovian models only), add a limitations paragraph on the absence of transfer learning experiments, and note that hardware validation remains future work. No new experiments will be added at this stage. revision: partial
Circularity Check
No circularity: standard empirical ML evaluation on simulated data
full rationale
The paper applies convolutional neural networks to surface-code decoding via supervised training on simulated syndromes. No derivation chain, equations, or first-principles claims exist that reduce to self-definition, fitted inputs renamed as predictions, or self-citation load-bearing steps. Performance and adaptability results are obtained by training on one noise model and evaluating on held-out data or different models, which is externally falsifiable against the simulations and does not collapse by construction. The central claims rest on empirical benchmarks rather than any internal redefinition.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Surface-code syndromes contain sufficient information to identify correctable errors under the assumed noise models
- domain assumption Supervised training on simulated data produces a decoder that generalizes to the target noise distribution
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The results show that decoders based on convolutional neural networks have good performance and can adapt to different noise models.
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery theorems unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
explainable machine learning techniques have been applied to the neural network of the decoder
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Maximilian Schlosshauer. “Quantum de- coherence”. In: Physics Reports 831 (2019), pp. 1–57. issn: 0370-1573. doi: https : / / doi . org / 10 . 1016 / j . physrep . 2019 . 10 . 001. url: https : //www.sciencedirect.com/science/ article/pii/S0370157319303084
work page 2019
-
[2]
G. Benenti, G. Casati, and G. Strini. Principles Of Quantum Computation And Information - Volume II: Basic Tools And Special Topics . Italy: World Scientific Publishing Company, 2007. isbn: 9789814365550. url: https : / / books . google . it / books ? id = Its7DQAAQBAJ
work page 2007
-
[3]
Quantum error correc- tion: an introductory guide
Joschka Roffe. “Quantum error correc- tion: an introductory guide”. In: Con- temporary Physics 60.3 (July 2019), pp. 226–245. issn: 1366-5812. doi: 10 . 1080 / 00107514 . 2019 . 1667078. url: http : / / dx . doi . org / 10 . 1080 / 00107514.2019.1667078
-
[4]
Theory of Quantum Error Correction for General Noise
Emanuel Knill, Raymond Laflamme, and Lorenza Viola. “Theory of Quantum Error Correction for General Noise”. In: Physical Review Letters 84.11 (Mar. 2000), pp. 2525–2528. issn: 1079-7114. doi: 10 . 1103 / physrevlett . 84 . 2525. url: http : / / dx . doi . org / 10 . 1103 / PhysRevLett.84.2525
work page 2000
-
[5]
Fault-tolerant quantum computation by anyons
A.Yu. Kitaev. “Fault-tolerant quantum computation by anyons”. In: Annals of Physics 303.1 (Jan. 2003), pp. 2–30. issn: 0003-4916. doi: 10.1016/s0003- 4916(02 ) 00018 - 0. url: http : / / dx . doi . org / 10 . 1016 / S0003 - 4916(02 ) 00018-0
-
[6]
Surface codes: Towards practical large-scale quantum computation
Austin G. Fowler et al. “Surface codes: Towards practical large-scale quantum computation”. In: Physical Review A 86.3 (Sept. 2012). issn: 1094-1622. doi: 10 . 1103 / physreva . 86 . 032324. url: http : / / dx . doi . org / 10 . 1103 / PhysRevA.86.032324
work page 2012
-
[7]
Decoding surface code with a distributed neural network based decoder
Savvas Varsamopoulos, Koen Bertels, and Carmen G. Almudever. “Decoding surface code with a distributed neu- ral network based decoder”. In: (2019). arXiv: 1901.10847 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[8]
Com- paring Neural Network Based Decoders for the Surface Code
Savvas Varsamopoulos, Koen Bertels, and Carmen Garcia Almudever. “Com- paring Neural Network Based Decoders for the Surface Code”. In: IEEE Trans- actions on Computers 69.2 (Feb. 2020), pp. 300–311. issn: 2326-3814. doi: 10 . 1109 / tc . 2019 . 2948612. url: http : / / dx . doi . org / 10 . 1109 / TC . 2019 . 2948612
work page 2020
-
[9]
Michael A. Nielsen and Isaac L. Chuang. Quantum Computation and Quantum Information: 10th Anniversary Edition . 10th. USA: Cambridge University Press,
-
[10]
Surface code quan- tum computing by lattice surgery
Clare Horsman et al. “Surface code quan- tum computing by lattice surgery”. In: New Journal of Physics 14.12 (Dec. 2012), p. 123011. issn: 1367-2630. doi: 10 . 1088 / 1367 - 2630 / 14 / 12 / 123011. url: http : / / dx . doi . org / 10 . 1088 / 1367-2630/14/12/123011
work page 2012
-
[11]
Scaling of the quantum approximate optimization algorithm on superconducting qubit based hardware,
Daniel Litinski. “A Game of Surface Codes: Large-Scale Quantum Comput- ing with Lattice Surgery”. In: Quantum 3 (Mar. 2019), p. 128. issn: 2521-327X. doi: 10 . 22331 / q - 2019 - 03 - 05 - 128. url: http://dx.doi.org/10.22331/q- 2019-03-05-128
work page doi:10.22331/q- 2019
-
[12]
Quantum error cor- rection for quantum memories
Barbara M. Terhal. “Quantum error cor- rection for quantum memories”. In: Re- views of Modern Physics 87.2 (Apr. 2015), pp. 307–346. issn: 1539-0756. doi: 10 . 1103 / revmodphys . 87 . 307. url: 20 http : / / dx . doi . org / 10 . 1103 / RevModPhys.87.307
work page 2015
-
[13]
Analytic asymptotic performance of topological codes
Austin G. Fowler. “Analytic asymptotic performance of topological codes”. In: Physical Review A 87.4 (Apr. 2013). issn: 1094-1622. doi: 10 . 1103 / physreva . 87 . 040301. url: http : / / dx . doi . org / 10 . 1103 / PhysRevA . 87 . 040301
work page 2013
-
[14]
Threshold error rates for the toric and surface codes
D. S. Wang et al. “Threshold error rates for the toric and surface codes”. In: (2009). arXiv: 0905.0531 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2009
-
[15]
Low- distance surface codes under realistic quantum noise
Yu Tomita and Krysta M. Svore. “Low- distance surface codes under realistic quantum noise”. In: Physical Review A 90.6 (Dec. 2014). issn: 1094-1622. doi: 10 . 1103 / physreva . 90 . 062320. url: http : / / dx . doi . org / 10 . 1103 / PhysRevA.90.062320
work page 2014
-
[16]
Stabilizer Codes and Quantum Error Correction
Daniel Gottesman. “Stabilizer Codes and Quantum Error Correction”. In: (1997). arXiv: 9705052 [quant-ph]
work page 1997
-
[17]
Adaptive Weight Estimator for Quantum Error Correction in a Time-Dependent Environment
Stephen T. Spitz et al. “Adaptive Weight Estimator for Quantum Error Correction in a Time-Dependent Environment”. In: Advanced Quantum Technologies 1.1 (July 2018). issn: 2511-9044. doi: 10 . 1002 / qute . 201800012. url: http : / / dx.doi.org/10.1002/qute.201800012
-
[18]
Austin G. Fowler. “Minimum weight per- fect matching of fault-tolerant topologi- cal quantum error correction in average O(1) parallel time”. In: (2014). arXiv: 1307.1740 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[19]
Com- puting Minimum-Weight Perfect Match- ings
William Cook and Andr´ e Rohe. “Com- puting Minimum-Weight Perfect Match- ings”. In: INFORMS Journal on Com- puting 11.2 (1999), pp. 138–148. doi: 10. 1287/ijoc.11.2.138 . url: https:// doi.org/10.1287/ijoc.11.2.138
-
[20]
Available: https://link.aps.org/doi/10.1103/PhysRevA
Sergey Bravyi, Martin Suchara, and Alexander Vargo. “Efficient algorithms for maximum likelihood decoding in the surface code”. In: Physical Review A 90.3 (Sept. 2014). doi: 10.1103/physreva. 90.032326. url: https://doi.org/10. 11032Fphysreva.90.032326
-
[21]
A renormalization group decod- ing algorithm for topological quantum codes
Guillaume Duclos-Cianci and David Poulin. “A renormalization group decod- ing algorithm for topological quantum codes”. In: (2010). arXiv: 1006 . 1362 [quant-ph]
work page 2010
-
[22]
Fast Decoders for Topological Quantum Codes
Guillaume Duclos-Cianci and David Poulin. “Fast Decoders for Topological Quantum Codes”. In: Physical Review Letters 104.5 (Feb. 2010). issn: 1079-
work page 2010
-
[23]
doi: 10.1103/physrevlett.104. 050504. url: http://dx.doi.org/10. 1103/PhysRevLett.104.050504
-
[24]
Cellular- automaton decoders for topological quantum memories
Michael Herold et al. “Cellular- automaton decoders for topological quantum memories”. In: npj Quantum Information 1.1 (Oct. 2015). issn: 2056-
work page 2015
-
[25]
doi: 10 . 1038 / npjqi . 2015 . 10. url: http : / / dx . doi . org / 10 . 1038 / npjqi.2015.10
work page 2015
-
[26]
Efficient algorithms for maximum likelihood decoding in the surface code
Sergey Bravyi, Martin Suchara, and Alexander Vargo. “Efficient algorithms for maximum likelihood decoding in the surface code”. In: Physical Review A 90.3 (Sept. 2014). issn: 1094-1622. doi: 10. 1103/physreva.90.032326. url: http: //dx.doi.org/10.1103/PhysRevA.90. 032326
-
[27]
Efficient Markov chain Monte Carlo algorithm for the surface code
Adrian Hutter, James R. Wootton, and Daniel Loss. “Efficient Markov chain Monte Carlo algorithm for the surface code”. In: Physical Review A 89.2 (Feb. 2014). issn: 1094-1622. doi: 10 . 1103 / physreva . 89 . 022326. url: http : / / dx . doi . org / 10 . 1103 / PhysRevA . 89 . 022326. 21
work page 2014
-
[28]
Decoding small sur- face codes with feedforward neural net- works
Savvas Varsamopoulos, Ben Criger, and Koen Bertels. “Decoding small sur- face codes with feedforward neural net- works”. In: Quantum Science and Tech- nology 3.1 (Nov. 2017). issn: 2058-9565. doi: 10.1088/2058-9565/aa955a . url: http://dx.doi.org/10.1088/2058- 9565/aa955a
-
[29]
Optimal complexity correction of correlated errors in the surface code
Austin G. Fowler. “Optimal complexity correction of correlated errors in the sur- face code”. In: (2013). arXiv: 1310.0863 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[30]
A decoding algorithm for CSS codes using the X/Z correlations
Nicolas Delfosse and Jean-Pierre Tillich. “A decoding algorithm for CSS codes using the X/Z correlations”. In: 2014 IEEE International Symposium on In- formation Theory (June 2014). doi: 10. 1109/isit.2014.6874997 . url: http: //dx.doi.org/10.1109/ISIT.2014. 6874997
-
[31]
Machine- learning-assisted correction of correlated qubit errors in a topological code
Paul Baireuther et al. “Machine- learning-assisted correction of correlated qubit errors in a topological code”. In: Quantum 2 (Jan. 2018), p. 48. issn: 2521-327X. doi: 10.22331/q-2018-01- 29- 48. url: http://dx.doi.org/10. 22331/q-2018-01-29-48
-
[32]
Giacomo Torlai and Roger G. Melko. “Neural Decoder for Topological Codes”. In: Physical Review Letters 119.3 (July 2017). issn: 1079-7114. doi: 10 . 1103 / physrevlett.119.030501 . url: http: //dx.doi.org/10.1103/PhysRevLett. 119.030501
-
[33]
Deep Neural Network Probabilistic De- coder for Stabilizer Codes
Stefan Krastanov and Liang Jiang. “Deep Neural Network Probabilistic De- coder for Stabilizer Codes”. In: Scientific Reports 7 (May 2017). doi: 10 . 1038 / s41598-017-11266-1
work page 2017
-
[34]
Neural ensemble de- coding for topological quantum error- correcting codes
Milap Sheth, Sara Zafar Jafarzadeh, and Vlad Gheorghiu. “Neural ensemble de- coding for topological quantum error- correcting codes”. In: Physical Review A 101.3 (Mar. 2020). issn: 2469-9934. doi: 10 . 1103 / physreva . 101 . 032338. url: http : / / dx . doi . org / 10 . 1103 / PhysRevA.101.032338
work page 2020
-
[35]
Efficient De- coding of Surface Code Syndromes for Error Correction in Quantum Comput- ing
Debasmita Bhoumik et al. “Efficient De- coding of Surface Code Syndromes for Error Correction in Quantum Comput- ing”. In: (2021). arXiv: 2110 . 10896 [quant-ph]
work page 2021
-
[36]
Tech- niques for combining fast local decoders with global decoders under circuit-level noise
Christopher Chamberland et al. “Tech- niques for combining fast local decoders with global decoders under circuit-level noise”. In: Quantum Science and Tech- nology 8.4 (July 2023), p. 045011. issn: 2058-9565. doi: 10 . 1088 / 2058 - 9565 / ace64d. url: http://dx.doi.org/10. 1088/2058-9565/ace64d
work page 2023
-
[37]
Deep neural decoders for near term fault-tolerant experiments
Christopher Chamberland and Pooya Ronagh. “Deep neural decoders for near term fault-tolerant experiments”. In: Quantum Science and Technology 3.4 (July 2018), p. 044002. issn: 2058-9565. doi: 10.1088/2058-9565/aad1f7 . url: http://dx.doi.org/10.1088/2058- 9565/aad1f7
-
[38]
Scalable Neural Decoder for Topological Surface Codes
Kai Meinerz, Chae-Yeun Park, and Si- mon Trebst. “Scalable Neural Decoder for Topological Surface Codes”. In: (2021). arXiv: 2101.07285 [quant-ph]
-
[39]
Neural Network Decoders for Large-Distance 2D Toric Codes
Xiaotong Ni. “Neural Network Decoders for Large-Distance 2D Toric Codes”. In: Quantum 4 (Aug. 2020), p. 310. issn: 2521-327X. doi: 10.22331/q-2020-08- 24-310. url: http://dx.doi.org/10. 22331/q-2020-08-24-310
-
[40]
Amarsanaa Davaasuren et al. “Gen- eral framework for constructing fast and near-optimal machine-learning-based de- coder of the topological stabilizer codes”. In: Physical Review Research 2.3 (Sept. 2020). issn: 2643-1564. doi: 10 . 1103 / 22 physrevresearch . 2 . 033399 . url: http : / / dx . doi . org / 10 . 1103 / PhysRevResearch.2.033399
work page 2020
-
[41]
Austin G. Fowler et al. “Topological Code Autotune”. In: Physical Review X 2.4 (Oct. 2012). doi: 10 . 1103 / physrevx . 2 . 041003. url: https : / / doi . org / 10 . 11032Fphysrevx . 2 . 041003
work page 2012
-
[42]
Multi- Scale Context Aggregation by Dilated Convolutions
Fisher Yu and Vladlen Koltun. “Multi- Scale Context Aggregation by Dilated Convolutions”. In: (2016). arXiv: 1511. 07122 [cs.CV]
work page 2016
-
[43]
Object Detectors Emerge in Deep Scene CNNs
Bolei Zhou et al. “Object Detectors Emerge in Deep Scene CNNs”. In: (2015). arXiv: 1412.6856 [cs.CV]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[44]
Grad- CAM: Visual Explanations from Deep Networks via Gradient-Based Localiza- tion
Ramprasaath R. Selvaraju et al. “Grad- CAM: Visual Explanations from Deep Networks via Gradient-Based Localiza- tion”. In: International Journal of Com- puter Vision 128.2 (Oct. 2019), pp. 336–
work page 2019
-
[45]
issn: 1573-1405. doi: 10 . 1007 / s11263 - 019 - 01228 - 7. url: http : / / dx . doi . org / 10 . 1007 / s11263 - 019 - 01228-7
-
[46]
Visualizing and Understanding Convolutional Networks
Matthew D Zeiler and Rob Fergus. “Visualizing and Understanding Convo- lutional Networks”. In: (2013). arXiv: 1311.2901 [cs.CV]. 23
work page internal anchor Pith review Pith/arXiv arXiv 2013
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.