Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

Anqi Gong; Joseph M. Renes; Sebastian Cammerer

arxiv: 2310.17758 · v2 · pith:Y5YGUV5Unew · submitted 2023-10-26 · 🪐 quant-ph

Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

Anqi Gong , Sebastian Cammerer , Joseph M. Renes This is my paper

Pith reviewed 2026-05-24 06:27 UTC · model grok-4.3

classification 🪐 quant-ph

keywords quantum LDPC codesgraph neural networksbelief propagationerror floorquantum error correctiondifferentiable decoderpost-processing methodstrapping sets

0 comments

The pith

A hybrid decoder interleaves graph neural network layers with belief propagation on the same graph to compensate for trapping sets in quantum LDPC codes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a fully differentiable decoder that runs classical belief propagation stages separated by graph neural network layers operating on the identical sparse graph. The GNN uses information from a prior BP run that has stalled to reinitialize the next run, allowing the system to learn around short cycles and trapping sets that arise from quantum code design rules. If the approach works, it reduces the error floor while requiring fewer post-processing attempts than random perturbation, enhanced feedback, augmentation, or ordered-statistics decoding. The entire pipeline stays differentiable so that gradient descent can tune the GNN component directly from decoding performance.

Core claim

By placing GNN layers between consecutive BP runs on the shared sparse decoding graph, the decoder learns to extract and apply knowledge from previous iterations that have become trapped, thereby compensating for the sub-optimal graphs forced by quantum LDPC construction constraints and producing a measurable drop in the error floor.

What carries the argument

Hybrid architecture of belief propagation stages interleaved with graph neural network layers defined over the identical sparse decoding graph.

If this is right

The decoder scales to large codes because both BP and GNN components remain sparse.
Full differentiability permits direct optimization of the GNN weights via gradient descent on decoding error rates.
The method outperforms random perturbation, enhanced feedback, augmentation, and OSD while using significantly fewer post-processing attempts.
Error-floor reduction occurs specifically by learning compensation for trapping sets and short cycles.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar interleaving of learned layers with classical iterative steps could be tested on classical LDPC codes that suffer from comparable graph defects.
Training across multiple code instances might produce a single decoder usable across an entire family of quantum LDPC codes.
The approach suggests that learned components can augment rather than replace existing message-passing algorithms when the underlying graph is fixed by code design.

Load-bearing premise

Gradient-descent training produces a GNN that generalizes to error patterns and code instances outside the training set rather than overfitting to the specific codes or noise models used.

What would settle it

Performance measurements on quantum LDPC codes and error patterns withheld from training show no lowering of the error floor relative to plain BP or the listed post-processing baselines.

Figures

Figures reproduced from arXiv: 2310.17758 by Anqi Gong, Joseph M. Renes, Sebastian Cammerer.

**Figure 1.** Figure 1: Block diagram of the proposed decoder architecture consisting of trainable GNN layers (orange) and classical BP iterations (yellow). The same GNN is sandwiched between block BP runs of iteration p64, 16, 16, 16q. pends on the unsatisfied checks. The goal of our work is to learn those perturbations using a GNN. The GNN follows the concept of [12] and acts as an intermediate layer between independent BP r… view at source ↗

**Figure 2.** Figure 2: Unrolled feedback GNN operating on the Tanner graph, showing the inside of the orange boxes in Fig. (1). The VN feature is initialized using Λpost from the previous BP run and the CN feature is calculated using Eq (9,10). Each edge message is calculated using the features of its two endpoints. After that, each variable node aggregates the incoming X (red) and Ztype (blue) messages and then uses them toge… view at source ↗

**Figure 3.** Figure 3: Logical error rate of ¹1270, 28,ď 46º codes using feedback GNNs on depolarizing channel. Comparison of the performance of the coarse and the refined GNN trained on easy and mixed samples respectively. the coarse GNN has an error floor of a logical error rate around 10´6 while the finetuned version shows a significantly lower error floor. 5 Simulation The message passing (MP) decoder used in this work is… view at source ↗

**Figure 4.** Figure 4: Logical error rate of the ¹1270, 28,ď 46º and the ¹882, 24,ď 24º codes using various post-processing methods on depolarizing channel. Na is the maximum number of attempts. For our feedback GNNs, only the first block run of BP4 needs 64 iterations, while 16 iterations are enough for the post-processing block BP4 run. For example, three attempts will involve 64 ` 16 ˆ 3 “ 112 iterations of flooding BP in tot… view at source ↗

read the original abstract

In this work, we propose a fully differentiable iterative decoder for quantum low-density parity-check (LDPC) codes. The proposed algorithm is composed of classical belief propagation (BP) decoding stages and intermediate graph neural network (GNN) layers. Both component decoders are defined over the same sparse decoding graph enabling a seamless integration and scalability to large codes. The core idea is to use the GNN component between consecutive BP runs, so that the knowledge from the previous BP run, if stuck in a local minima caused by trapping sets or short cycles in the decoding graph, can be leveraged to better initialize the next BP run. By doing so, the proposed decoder can learn to compensate for sub-optimal BP decoding graphs that result from the design constraints of quantum LDPC codes. Since the entire decoder remains differentiable, gradient descent-based training is possible. We compare the error rate performance of the proposed decoder against various post-processing methods such as random perturbation, enhanced feedback, augmentation, and ordered-statistics decoding (OSD) and show that a carefully designed training process lowers the error-floor significantly. As a result, our proposed decoder outperforms the former three methods using significantly fewer post-processing attempts. The source code of our experiments is available online.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper interleaves differentiable GNN layers with BP on the same quantum LDPC graph to handle trapping sets, but the reported gains rest on unverified generalization to new codes and error patterns.

read the letter

The main takeaway is a hybrid decoder that runs classical BP, feeds the output into GNN layers on the identical sparse graph, then feeds the GNN output back to initialize the next BP round. The whole pipeline stays differentiable, so it trains end-to-end. This is a concrete new arrangement for quantum LDPC codes; prior work has used neural networks for decoding but not this tight interleaving on the same graph with explicit compensation for short cycles and trapping sets. The code release is also useful for anyone who wants to reproduce or extend it. The comparisons to random perturbation, enhanced feedback, augmentation, and OSD are the right baselines to check against. The soft spot is exactly the one the stress-test flags. The abstract claims the trained GNN lowers the error floor and beats the other methods with fewer post-processing steps, yet it gives no numbers on block lengths, code families, training versus test split, or whether test instances come from larger or different codes. If the experiments only evaluate on the same codes or noise realizations used in training, the outperformance could be overfitting rather than a general fix for sub-optimal BP graphs. Without those details the central claim stays provisional. This paper is for people already working on decoders for quantum LDPC codes who need practical improvements over plain BP. It deserves peer review because the architecture is well-specified, the differentiability argument is clean, and the code is public; a referee can verify the training protocol and ask for the missing generalization checks. I would send it out rather than desk-reject.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a fully differentiable hybrid decoder for quantum LDPC codes that interleaves classical belief propagation (BP) iterations with graph neural network (GNN) layers defined on the same sparse Tanner graph. The GNN component is inserted between BP runs to use information from a prior run (potentially trapped by short cycles or trapping sets) to reinitialize the next BP run. The entire architecture is trained end-to-end via gradient descent; the authors claim that a suitably trained model lowers the error floor and outperforms random perturbation, enhanced feedback, augmentation, and ordered-statistics decoding (OSD) while requiring significantly fewer post-processing attempts. Source code is stated to be available online.

Significance. If the reported error-rate improvements hold under proper train/test separation, the work would demonstrate a practical, scalable route to mitigating the impact of sub-optimal decoding graphs that arise from the CSS construction constraints of quantum LDPC codes. The seamless BP-GNN integration and end-to-end differentiability are technically attractive, and the public release of code is a clear reproducibility strength.

major comments (2)

[Abstract / Evaluation (implied)] The central performance claim (lower error floor and outperformance versus the listed baselines) rests on the GNN generalizing to error patterns and code instances outside the training distribution. The abstract and evaluation description do not state whether the test codes, block lengths, or noise models are drawn from a disjoint family or larger instances than those used for training; without this information the reported gains cannot be assessed for overfitting versus genuine compensation for trapping sets.
[Abstract] No quantitative error-rate curves, code parameters (n,k,d), training-set sizes, or statistical significance tests are supplied in the abstract; the manuscript must include these data (with explicit train/test separation) to substantiate the claim that the hybrid decoder outperforms the baselines with fewer attempts.

minor comments (1)

[Abstract] The abstract asserts that the decoder 'outperforms the former three methods using significantly fewer post-processing attempts' but does not define what constitutes a 'post-processing attempt' for the GNN component; a precise operational definition would aid comparison.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting the need for clearer statements on generalization and quantitative details. We address each point below and will revise the manuscript to strengthen these aspects.

read point-by-point responses

Referee: [Abstract / Evaluation (implied)] The central performance claim (lower error floor and outperformance versus the listed baselines) rests on the GNN generalizing to error patterns and code instances outside the training distribution. The abstract and evaluation description do not state whether the test codes, block lengths, or noise models are drawn from a disjoint family or larger instances than those used for training; without this information the reported gains cannot be assessed for overfitting versus genuine compensation for trapping sets.

Authors: We agree that explicit confirmation of train/test separation is necessary to support the generalization claims. The full manuscript describes experiments using code instances, block lengths, and noise realizations drawn from distributions disjoint from the training set (see evaluation sections). To make this evident at a glance, we will revise the abstract to state that performance is reported on unseen test codes and error patterns. revision: yes
Referee: [Abstract] No quantitative error-rate curves, code parameters (n,k,d), training-set sizes, or statistical significance tests are supplied in the abstract; the manuscript must include these data (with explicit train/test separation) to substantiate the claim that the hybrid decoder outperforms the baselines with fewer attempts.

Authors: The current abstract is concise by design, but we acknowledge that including key quantitative elements would better substantiate the claims. We will expand the abstract to report representative code parameters (n,k,d), training-set sizes, the number of post-processing attempts, and an explicit reference to the disjoint train/test regime while preserving brevity. revision: yes

Circularity Check

0 steps flagged

No circularity in hybrid BP-GNN decoder claims or training

full rationale

The paper's central contribution is an empirically trained hybrid decoder combining belief propagation stages with intermediate GNN layers on the same sparse graph. Performance is evaluated via direct simulation against external baselines (random perturbation, enhanced feedback, augmentation, OSD) on quantum LDPC codes, with the training process described as standard end-to-end gradient descent. No equations, uniqueness theorems, or ansatzes reduce by construction to fitted parameters or self-citations; the generalization claim rests on experimental results rather than definitional equivalence. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review reveals no explicit free parameters, axioms, or invented entities; the method relies on standard BP message passing, standard GNN layers, and differentiability for training.

pith-pipeline@v0.9.0 · 5746 in / 1093 out tokens · 31802 ms · 2026-05-24T06:27:43.283989+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages · 1 internal anchor

[1]

Bal- anced product quantum codes

N. P . Breuckmann and J. N. Eberhardt, “Bal- anced product quantum codes”, IEEE Transac- tions on Information Theory 67 (2021)

work page 2021
[2]

Quantum LDPC codes with almost linear minimum distance

P . Panteleev and G. Kalachev, “Quantum LDPC codes with almost linear minimum distance”, IEEE Transactions on Information Theory 68 (2021)

work page 2021
[3]

Asymptotically good quantum and locally testable classical LDPC codes

P . Panteleev and G. Kalachev, “Asymptotically good quantum and locally testable classical LDPC codes”, in Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Com- puting (2022)

work page 2022
[4]

Quantum Tanner codes

A. Leverrier and G. Zémor , “Quantum Tanner codes”, in IEEE 63rd Annual Symposium on Foundations of Computer Science (2022)

work page 2022
[5]

High-threshold and low-overhead fault-tolerant quantum memory

S. Bravyi, A. W . Cross, J. M. Gambetta, D. Maslov, P . Rall, and T . J. Yoder , “High-threshold and low-overhead fault-tolerant quantum memory”, arXiv preprint arXiv:2308.07915 (2023)

work page arXiv 2023
[6]

Degenerate quantum LDPC codes With good ﬁnite length performance

P . Panteleev and G. Kalachev, “Degenerate quantum LDPC codes With good ﬁnite length performance”, Quantum 5 (2021),

work page 2021
[7]

Fifteen years of quantum LDPC coding and improved decoding strategies

Z. Babar, P . Botsinis, D. Alanis, S. X. Ng, and L. Hanzo, “Fifteen years of quantum LDPC coding and improved decoding strategies”, IEEE Access 3 (2015)

work page 2015
[8]

On the iterative decod- ing of sparse quantum codes

D. Poulin and Y . Chung, “On the iterative decod- ing of sparse quantum codes”, Quantum Infor- mation and Computation 8 (2008),

work page 2008
[9]

Trapping sets of quantum LDPC codes

N. Raveendran and B. V asi ´c, “Trapping sets of quantum LDPC codes”, Quantum 5 (2021)

work page 2021
[10]

Stabi- lizer inactivation for message-passing decoding of quantum LDPC codes

J. Du Crest, M. Mhalla, and V . Savin, “Stabi- lizer inactivation for message-passing decoding of quantum LDPC codes”, in IEEE Information Theory Workshop (2022)

work page 2022
[11]

Enhanced feedback iterative decoding of sparse quantum codes

Y .-J. Wang, B. C. Sanders, B.-M. Bai, and X.-M. Wang, “Enhanced feedback iterative decoding of sparse quantum codes”, IEEE Transactions on Information Theory 58 (2012),

work page 2012
[12]

Graph neural networks for channel de- coding

S. Cammerer , J. Hoydis, F . A. Aoudia, and A. Keller, “Graph neural networks for channel de- coding”, in IEEE Globecom Workshops (2022)

work page 2022
[13]

On deep learning-based channel decod- ing

T . Gruber , S. Cammerer , J. Hoydis, and S. ten Brink, “On deep learning-based channel decod- ing”, in IEEE Conference on Information Sci- ences and Systems (2017)

work page 2017
[14]

Good quan- tum error-correcting codes exist

A. R. Calderbank and P . W . Shor, “Good quan- tum error-correcting codes exist”, Physical Re- view A 54 (1996),

work page 1996
[15]

Multiple-particle interference and quantum error correction

A. Steane, “Multiple-particle interference and quantum error correction”, Proceedings of the Royal Society A 452 (1996),

work page 1996
[16]

Modiﬁed belief propagation decoders for quantum low- density parity-check codes

A. Rigby, J. C. Olivier, and P . Jarvis, “Modiﬁed belief propagation decoders for quantum low- density parity-check codes”, Physical Review A 100 (2019),

work page 2019
[17]

Reﬁned belief propaga- tion decoding of sparse-graph quantum codes

K.-Y . Kuo and C.-Y . Lai, “Reﬁned belief propaga- tion decoding of sparse-graph quantum codes”, IEEE Journal on Selected Areas in Information Theory 1 (2020)

work page 2020
[18]

Log-domain decod- ing of quantum LDPC codes over binary ﬁnite ﬁelds

C.-Y . Lai and K.-Y . Kuo, “Log-domain decod- ing of quantum LDPC codes over binary ﬁnite ﬁelds”, IEEE Transactions on Quantum Engi- neering 2 (2021),

work page 2021
[19]

Low-density parity-check codes

R. Gallager , “Low-density parity-check codes”, IRE Transactions on Information Theory 8 (1962)

work page 1962
[20]

Neural belief- propagation decoders for quantum error- correcting codes

Y .-H. Liu and D. Poulin, “Neural belief- propagation decoders for quantum error- correcting codes”, Physical Review Letters 122 (2019),

work page 2019
[21]

Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

J. Du Crest, F . Garcia-Herrero, M. Mhalla, V . Savin, and J. V alls, “Layered decoding of quan- tum LDPC codes”, in IEEE 12th International Symposium on Topics in Coding (2023). 8 This figure "pattern_200.png" is available in "png" format from: http://arxiv.org/ps/2310.17758v2

work page internal anchor Pith review Pith/arXiv arXiv 2023

[1] [1]

Bal- anced product quantum codes

N. P . Breuckmann and J. N. Eberhardt, “Bal- anced product quantum codes”, IEEE Transac- tions on Information Theory 67 (2021)

work page 2021

[2] [2]

Quantum LDPC codes with almost linear minimum distance

P . Panteleev and G. Kalachev, “Quantum LDPC codes with almost linear minimum distance”, IEEE Transactions on Information Theory 68 (2021)

work page 2021

[3] [3]

Asymptotically good quantum and locally testable classical LDPC codes

P . Panteleev and G. Kalachev, “Asymptotically good quantum and locally testable classical LDPC codes”, in Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Com- puting (2022)

work page 2022

[4] [4]

Quantum Tanner codes

A. Leverrier and G. Zémor , “Quantum Tanner codes”, in IEEE 63rd Annual Symposium on Foundations of Computer Science (2022)

work page 2022

[5] [5]

High-threshold and low-overhead fault-tolerant quantum memory

S. Bravyi, A. W . Cross, J. M. Gambetta, D. Maslov, P . Rall, and T . J. Yoder , “High-threshold and low-overhead fault-tolerant quantum memory”, arXiv preprint arXiv:2308.07915 (2023)

work page arXiv 2023

[6] [6]

Degenerate quantum LDPC codes With good ﬁnite length performance

P . Panteleev and G. Kalachev, “Degenerate quantum LDPC codes With good ﬁnite length performance”, Quantum 5 (2021),

work page 2021

[7] [7]

Fifteen years of quantum LDPC coding and improved decoding strategies

Z. Babar, P . Botsinis, D. Alanis, S. X. Ng, and L. Hanzo, “Fifteen years of quantum LDPC coding and improved decoding strategies”, IEEE Access 3 (2015)

work page 2015

[8] [8]

On the iterative decod- ing of sparse quantum codes

D. Poulin and Y . Chung, “On the iterative decod- ing of sparse quantum codes”, Quantum Infor- mation and Computation 8 (2008),

work page 2008

[9] [9]

Trapping sets of quantum LDPC codes

N. Raveendran and B. V asi ´c, “Trapping sets of quantum LDPC codes”, Quantum 5 (2021)

work page 2021

[10] [10]

Stabi- lizer inactivation for message-passing decoding of quantum LDPC codes

J. Du Crest, M. Mhalla, and V . Savin, “Stabi- lizer inactivation for message-passing decoding of quantum LDPC codes”, in IEEE Information Theory Workshop (2022)

work page 2022

[11] [11]

Enhanced feedback iterative decoding of sparse quantum codes

Y .-J. Wang, B. C. Sanders, B.-M. Bai, and X.-M. Wang, “Enhanced feedback iterative decoding of sparse quantum codes”, IEEE Transactions on Information Theory 58 (2012),

work page 2012

[12] [12]

Graph neural networks for channel de- coding

S. Cammerer , J. Hoydis, F . A. Aoudia, and A. Keller, “Graph neural networks for channel de- coding”, in IEEE Globecom Workshops (2022)

work page 2022

[13] [13]

On deep learning-based channel decod- ing

T . Gruber , S. Cammerer , J. Hoydis, and S. ten Brink, “On deep learning-based channel decod- ing”, in IEEE Conference on Information Sci- ences and Systems (2017)

work page 2017

[14] [14]

Good quan- tum error-correcting codes exist

A. R. Calderbank and P . W . Shor, “Good quan- tum error-correcting codes exist”, Physical Re- view A 54 (1996),

work page 1996

[15] [15]

Multiple-particle interference and quantum error correction

A. Steane, “Multiple-particle interference and quantum error correction”, Proceedings of the Royal Society A 452 (1996),

work page 1996

[16] [16]

Modiﬁed belief propagation decoders for quantum low- density parity-check codes

A. Rigby, J. C. Olivier, and P . Jarvis, “Modiﬁed belief propagation decoders for quantum low- density parity-check codes”, Physical Review A 100 (2019),

work page 2019

[17] [17]

Reﬁned belief propaga- tion decoding of sparse-graph quantum codes

K.-Y . Kuo and C.-Y . Lai, “Reﬁned belief propaga- tion decoding of sparse-graph quantum codes”, IEEE Journal on Selected Areas in Information Theory 1 (2020)

work page 2020

[18] [18]

Log-domain decod- ing of quantum LDPC codes over binary ﬁnite ﬁelds

C.-Y . Lai and K.-Y . Kuo, “Log-domain decod- ing of quantum LDPC codes over binary ﬁnite ﬁelds”, IEEE Transactions on Quantum Engi- neering 2 (2021),

work page 2021

[19] [19]

Low-density parity-check codes

R. Gallager , “Low-density parity-check codes”, IRE Transactions on Information Theory 8 (1962)

work page 1962

[20] [20]

Neural belief- propagation decoders for quantum error- correcting codes

Y .-H. Liu and D. Poulin, “Neural belief- propagation decoders for quantum error- correcting codes”, Physical Review Letters 122 (2019),

work page 2019

[21] [21]

Graph Neural Networks for Enhanced Decoding of Quantum LDPC Codes

J. Du Crest, F . Garcia-Herrero, M. Mhalla, V . Savin, and J. V alls, “Layered decoding of quan- tum LDPC codes”, in IEEE 12th International Symposium on Topics in Coding (2023). 8 This figure "pattern_200.png" is available in "png" format from: http://arxiv.org/ps/2310.17758v2

work page internal anchor Pith review Pith/arXiv arXiv 2023