Variational Diffusion Channel Decoder

Chengwei Zhang; Siyu Liao; Yifan Du

arxiv: 2605.18902 · v1 · pith:Z5RUTGDNnew · submitted 2026-05-17 · 💻 cs.IT · cs.LG· math.IT

Variational Diffusion Channel Decoder

Chengwei Zhang , Yifan Du , Siyu Liao This is my paper

Pith reviewed 2026-05-20 13:06 UTC · model grok-4.3

classification 💻 cs.IT cs.LGmath.IT

keywords neural channel decodervariational diffusionbelief propagationerror correctionlow-complexity decodingwireless communicationdiffusion models

0 comments

The pith

A channel decoder fuses belief propagation with variational diffusion to match top error-correction accuracy while cutting model size and compute sharply.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to build a neural decoder for error-correcting codes that keeps the high accuracy gains of recent learning-based methods but avoids their heavy storage and run-time demands. It does so by embedding the iterative message-passing steps of classical belief propagation inside a variational diffusion framework. A reader would care because practical communication and storage links need reliable decoding that can run on modest hardware without long delays. If the integration works as claimed, the resulting decoder becomes small enough and fast enough for real-time use while still outperforming prior neural alternatives on error rates.

Core claim

The variational diffusion channel decoder integrates the domain-specific belief propagation process into the modern diffusion model. By reaping the low-cost benefits of belief propagation and the strong learning capability of the diffusion model, the decoder simultaneously achieves very low cost and high error-correcting performance. Experimental results show that, compared with the state-of-the-art neural channel decoders, the model provides a feasible solution for practical deployment via achieving the best decoding performance with significantly reduced computational cost and model size.

What carries the argument

Variational diffusion model that folds in belief-propagation message passing, letting each diffusion step reuse low-cost iterative updates while learning noise patterns from data.

If this is right

The decoder reaches the highest error-correcting rates among neural methods while using substantially fewer parameters and fewer operations per codeword.
The same architecture supplies a practical path for deploying advanced decoders inside latency-sensitive or memory-limited systems such as wireless modems and flash controllers.
The hybrid construction preserves the interpretability and low per-iteration cost of belief propagation while gaining the generative flexibility of diffusion steps.
Training and inference remain stable enough to produce consistent gains across the tested channel conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same fusion pattern could be tried on other classical iterative algorithms such as turbo or LDPC message passing to reduce their neural counterparts' overhead.
Because the diffusion steps are now anchored by domain-specific updates, the model may generalize more readily to unseen channel statistics than a pure diffusion decoder.
Hardware implementations could exploit the sparse, iterative nature of the belief-propagation component to achieve further speed-ups on FPGA or ASIC targets.

Load-bearing premise

That inserting belief propagation into the variational diffusion process keeps the efficiency of belief propagation and the learning power of diffusion without adding instabilities or forcing performance trade-offs.

What would settle it

A benchmark run on standard LDPC or polar codes that shows the new decoder either matches or exceeds prior neural decoders in bit-error rate only when its parameter count and FLOPs are comparable to those earlier models.

Figures

Figures reproduced from arXiv: 2605.18902 by Chengwei Zhang, Siyu Liao, Yifan Du.

**Figure 1.** Figure 1: Standard BER curves for LDPC (121, 60). TABLE III COMPLEXITY COMPARISON (LDPC 121,60 LIGHTEST SETTING) Model Total FLOPs Model Size BP 316.4 K 0 B HGN 1.6 G 1.6 MB DDECC-Max 140.3 G 226.3 KB Ours-20 377.6 K 264.0 B decoding latency depends heavily on hardware, FLOPs serve as a deterministic proxy. Our VCDC achieves up to 5 orders of magnitude reduction in FLOPs compared to the strong baseline DDECC. Moreov… view at source ↗

read the original abstract

Neural channel decoder, as a data-driven channel decoding strategy, has shown very promising improvement on error-correcting capability over the classical methods. However, the success of those deep learning-based decoder comes at the cost of drastically increased model storage and computational complexity, hindering their practical adoptions in real-world time-sensitive resource-sensitive communication and storage systems. To address this challenge, we propose an efficient variational diffusion model-based channel decoder, which effectively integrates the domain-specific belief propagation process to the modern diffusion model. By reaping the low-cost benefits of belief propagation and strong learning capability of diffusion model, our proposed neural decoder simultaneously achieves very low cost and high error-correcting performance. Experimental results show that, compared with the state-of-the-art neural channel decoders, our model provides a feasible solution for practical deployment via achieving the best decoding performance with significantly reduced computational cost and model size.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This decoder mixes variational diffusion with belief propagation and reports better performance at lower cost than other neural decoders.

read the letter

Colleague, The main point is this decoder mixes variational diffusion with belief propagation by putting the BP steps into the reverse diffusion process. Experiments show it gets top decoding performance with much lower cost and size than other neural ones. They adapt the diffusion framework so that each reverse step includes a round of belief propagation messages based on the code's graph. This lets the model use the known structure instead of learning it all. The paper explains the changes to the variational training objective and then gives results against recent neural decoders. Those results use standard error rate measures and show the hybrid coming out ahead on accuracy while being lighter on resources. The soft spots are not big. It is possible the added BP steps could affect the diffusion's ability to explore or add some overhead, but the reported numbers don't reflect that and the method is laid out without contradictions. The experiments follow common practices in the area. This is for engineers and researchers trying to deploy neural decoders in hardware with tight constraints. It shows one way to make them more efficient. I would send this to peer review. It has a clear contribution and enough detail to review properly.

Referee Report

1 major / 3 minor

Summary. The paper proposes a variational diffusion model for channel decoding that embeds domain-specific belief propagation iterations into the diffusion reverse process. It claims this hybrid construction delivers superior error-correction performance relative to existing neural decoders while using substantially lower computational cost and smaller model size, thereby offering a practical solution for resource-constrained systems.

Significance. If the reported gains hold under the experimental conditions, the work demonstrates a viable route to practical neural decoders by retaining the low per-iteration cost of belief propagation while leveraging the representational power of diffusion models. This addresses a central barrier to deployment of learned decoders in time-sensitive communication and storage applications and may encourage further domain-knowledge injection into generative models for coding problems.

major comments (1)

§4.2, complexity analysis: the claim of 'significantly reduced computational cost' is supported by reported FLOPs and parameter counts, yet the scaling with block length and diffusion steps is only shown for a single code length; an explicit complexity expression or additional curves for longer codes would be needed to substantiate the practical-deployment conclusion.

minor comments (3)

Abstract: the statement 'best decoding performance' is not accompanied by any numerical delta; adding the key BER or BLER improvement figures would strengthen the abstract.
§3.3, training objective: the modified ELBO that incorporates the BP messages is presented clearly, but the weighting schedule between the diffusion loss and the BP consistency term is not stated explicitly; a short equation or table entry would remove ambiguity.
Figure 4: the legend does not distinguish the proposed model from the 'BP-Diffusion' ablation; a clearer marker or caption would improve readability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive evaluation and the recommendation for minor revision. The single major comment is addressed below; we will incorporate the requested additions to strengthen the complexity analysis.

read point-by-point responses

Referee: [—] §4.2, complexity analysis: the claim of 'significantly reduced computational cost' is supported by reported FLOPs and parameter counts, yet the scaling with block length and diffusion steps is only shown for a single code length; an explicit complexity expression or additional curves for longer codes would be needed to substantiate the practical-deployment conclusion.

Authors: We agree that an explicit complexity expression and scaling results for additional block lengths would better substantiate the practical-deployment claims. In the revised manuscript we will add a closed-form complexity expression (in terms of block length N and diffusion steps T) to §4.2 and include supplementary curves for N = 128 and N = 256 that report FLOPs and parameter counts versus T, confirming the favorable scaling relative to competing neural decoders. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper presents a hybrid architecture that embeds domain-specific belief propagation steps inside a variational diffusion reverse process for channel decoding. All load-bearing claims about superior error-correction performance and reduced model size/complexity are framed as outcomes of empirical experiments on standard metrics, not as quantities derived by algebraic identity or parameter fitting from the model equations themselves. No self-citation is used to justify uniqueness or to close the derivation loop, and the integration is described as a constructive design choice rather than a redefinition of inputs as outputs. The manuscript therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no equations, training objectives, or architectural details are available to identify free parameters, axioms, or invented entities. The model presumably contains standard diffusion hyperparameters and channel-model assumptions, but none are stated.

pith-pipeline@v0.9.0 · 5672 in / 1219 out tokens · 53646 ms · 2026-05-20T13:06:39.872917+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

integrates the domain-specific belief propagation process to the modern diffusion model... VDM’s unconstrained αt/σt parameterization naturally accommodates the AWGN channel as the forward process
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean LogicNat induction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

limit our reverse process up to 20 timesteps... early stopping when parity-check errors reach zero

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

28 extracted references · 28 canonical work pages · 1 internal anchor

[1]

Statistical pruning for near-maximum likelihood decoding,

R. Gowaikar and B. Hassibi, “Statistical pruning for near-maximum likelihood decoding,”IEEE Transactions on Signal Processing, vol. 55, no. 6, pp. 2661–2675, 2007

work page 2007
[2]

A 58.6/91.3 pj/b dual-mode belief-propagation decoder for ldpc and polar codes in the 5g communi- cations standard,

B.-S. Su, C.-H. Lee, and T.-D. Chiueh, “A 58.6/91.3 pj/b dual-mode belief-propagation decoder for ldpc and polar codes in the 5g communi- cations standard,”IEEE Solid-State Circuits Letters, vol. 5, pp. 98–101, 2022

work page 2022
[3]

Constructing free-energy approximations and generalized belief propagation algorithms,

J. S. Yedidia, W. T. Freeman, and Y . Weiss, “Constructing free-energy approximations and generalized belief propagation algorithms,”IEEE Transactions on information theory, vol. 51, no. 7, pp. 2282–2312, 2005

work page 2005
[4]

Learning to decode linear codes using deep learning,

E. Nachmani, Y . Be’ery, and D. Burshtein, “Learning to decode linear codes using deep learning,” in2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton). IEEE, 2016, pp. 341–346

work page 2016
[5]

Hyper-graph-network decoders for block codes,

E. Nachmani and L. Wolf, “Hyper-graph-network decoders for block codes,” inAdvances in Neural Information Processing Systems, 2019, pp. 2326–2336

work page 2019
[6]

Deep learning for decoding of linear codes-a syndrome-based approach,

A. Bennatan, Y . Choukroun, and P. Kisilev, “Deep learning for decoding of linear codes-a syndrome-based approach,” in2018 IEEE International Symposium on Information Theory (ISIT). IEEE, 2018, pp. 1595–1599

work page 2018
[7]

Error correction code transformer,

Y . Choukroun and L. Wolf, “Error correction code transformer,”Ad- vances in Neural Information Processing Systems, vol. 35, pp. 38 695– 38 705, 2022

work page 2022
[8]

Attention is all you need,

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,”Advances in neural information processing systems, vol. 30, 2017

work page 2017
[9]

Denoising diffusion error correction codes,

Y . Choukroun and L. Wolf, “Denoising diffusion error correction codes,” inThe Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [Online]. Available: https://openreview.net/pdf?id=rLwC0 MG-4w

work page 2023
[10]

A survey on low latency towards 5g: Ran, core network and caching solutions,

I. Parvez, A. Rahmati, I. Guvenc, A. I. Sarwat, and H. Dai, “A survey on low latency towards 5g: Ran, core network and caching solutions,” IEEE Communications Surveys & Tutorials, vol. 20, no. 4, pp. 3098– 3130, 2018

work page 2018
[11]

A survey of end-to-end solutions for reliable low-latency communications in 5g networks,

D. Rico and P. Merino, “A survey of end-to-end solutions for reliable low-latency communications in 5g networks,”IEEE Access, vol. 8, pp. 192 808–192 834, 2020

work page 2020
[12]

Variational diffusion models,

D. Kingma, T. Salimans, B. Poole, and J. Ho, “Variational diffusion models,”Advances in neural information processing systems, vol. 34, pp. 21 696–21 707, 2021

work page 2021
[13]

Neural offset min-sum decoding,

L. Lugosch and W. J. Gross, “Neural offset min-sum decoding,” in2017 IEEE International Symposium on Information Theory (ISIT). IEEE, 2017, pp. 1361–1365

work page 2017
[14]

An iterative bp-cnn architecture for chan- nel decoding,

F. Liang, C. Shen, and F. Wu, “An iterative bp-cnn architecture for chan- nel decoding,”IEEE Journal of Selected Topics in Signal Processing, vol. 12, no. 1, pp. 144–159, 2018

work page 2018
[15]

Doubly residual neural decoder: Towards low-complexity high-performance channel decoding,

S. Liao, C. Deng, M. Yin, and B. Yuan, “Doubly residual neural decoder: Towards low-complexity high-performance channel decoding,” inProceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 10, 2021, pp. 8574–8582

work page 2021
[16]

On deep learning- based channel decoding,

T. Gruber, S. Cammerer, J. Hoydis, and S. Ten Brink, “On deep learning- based channel decoding,” in2017 51st annual conference on information sciences and systems (CISS). IEEE, 2017, pp. 1–6

work page 2017
[17]

Denoising diffusion implicit models,

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” in9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [Online]. Available: https://openreview.net/forum?id=St1giarCHLP

work page 2021
[18]

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps,

C. Lu, Y . Zhou, F. Bao, J. Chen, C. Li, and J. Zhu, “Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps,”Advances in Neural Information Processing Systems, vol. 35, pp. 5775–5787, 2022

work page 2022
[19]

Progressive distillation for fast sampling of diffusion models,

T. Salimans and J. Ho, “Progressive distillation for fast sampling of diffusion models,” inThe Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29,

work page 2022
[20]

[Online]

OpenReview.net, 2022. [Online]. Available: https://openreview. net/forum?id=TIdIXIpzhoI

work page 2022
[21]

Lin and D

S. Lin and D. Costello,Error Control Coding: Fundamentals and Appli- cations, ser. Pearson education. Pearson-Prentice Hall, 2004. [Online]. Available: https://books.google.com/books?id=ENwdtAEACAAJ

work page 2004
[22]

Efficient implementations of the sum-product algorithm for decoding ldpc codes,

X.-Y . Hu, E. Eleftheriou, D.-M. Arnold, and A. Dholakia, “Efficient implementations of the sum-product algorithm for decoding ldpc codes,” inGLOBECOM’01. IEEE Global Telecommunications Conference (Cat. No. 01CH37270), vol. 2. IEEE, 2001, pp. 1036–1036E

work page 2001
[23]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020
[24]

Deep unsupervised learning using nonequilibrium thermodynamics,

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” inInternational conference on machine learning. PMLR, 2015, pp. 2256–2265

work page 2015
[25]

Channel polarization: A method for constructing capacity- achieving codes for symmetric binary-input memoryless channels,

E. Arikan, “Channel polarization: A method for constructing capacity- achieving codes for symmetric binary-input memoryless channels,”IEEE Transactions on information Theory, vol. 55, no. 7, pp. 3051–3073, 2009

work page 2009
[26]

Low-density parity-check codes,

R. Gallager, “Low-density parity-check codes,”IRE Transactions on information theory, vol. 8, no. 1, pp. 21–28, 1962

work page 1962
[27]

Database of Channel Codes and ML Simulation Results,

M. Helmling, S. Scholl, F. Gensheimer, T. Dietz, K. Kraft, S. Ruzika, and N. Wehn, “Database of Channel Codes and ML Simulation Results,” www.uni-kl.de/channel-codes, 2019

work page 2019
[28]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Y . Bengio and Y . LeCun, Eds., 2015. [Online]. Available: http://arxiv.org/abs/1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2015

[1] [1]

Statistical pruning for near-maximum likelihood decoding,

R. Gowaikar and B. Hassibi, “Statistical pruning for near-maximum likelihood decoding,”IEEE Transactions on Signal Processing, vol. 55, no. 6, pp. 2661–2675, 2007

work page 2007

[2] [2]

A 58.6/91.3 pj/b dual-mode belief-propagation decoder for ldpc and polar codes in the 5g communi- cations standard,

B.-S. Su, C.-H. Lee, and T.-D. Chiueh, “A 58.6/91.3 pj/b dual-mode belief-propagation decoder for ldpc and polar codes in the 5g communi- cations standard,”IEEE Solid-State Circuits Letters, vol. 5, pp. 98–101, 2022

work page 2022

[3] [3]

Constructing free-energy approximations and generalized belief propagation algorithms,

J. S. Yedidia, W. T. Freeman, and Y . Weiss, “Constructing free-energy approximations and generalized belief propagation algorithms,”IEEE Transactions on information theory, vol. 51, no. 7, pp. 2282–2312, 2005

work page 2005

[4] [4]

Learning to decode linear codes using deep learning,

E. Nachmani, Y . Be’ery, and D. Burshtein, “Learning to decode linear codes using deep learning,” in2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton). IEEE, 2016, pp. 341–346

work page 2016

[5] [5]

Hyper-graph-network decoders for block codes,

E. Nachmani and L. Wolf, “Hyper-graph-network decoders for block codes,” inAdvances in Neural Information Processing Systems, 2019, pp. 2326–2336

work page 2019

[6] [6]

Deep learning for decoding of linear codes-a syndrome-based approach,

A. Bennatan, Y . Choukroun, and P. Kisilev, “Deep learning for decoding of linear codes-a syndrome-based approach,” in2018 IEEE International Symposium on Information Theory (ISIT). IEEE, 2018, pp. 1595–1599

work page 2018

[7] [7]

Error correction code transformer,

Y . Choukroun and L. Wolf, “Error correction code transformer,”Ad- vances in Neural Information Processing Systems, vol. 35, pp. 38 695– 38 705, 2022

work page 2022

[8] [8]

Attention is all you need,

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,”Advances in neural information processing systems, vol. 30, 2017

work page 2017

[9] [9]

Denoising diffusion error correction codes,

Y . Choukroun and L. Wolf, “Denoising diffusion error correction codes,” inThe Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [Online]. Available: https://openreview.net/pdf?id=rLwC0 MG-4w

work page 2023

[10] [10]

A survey on low latency towards 5g: Ran, core network and caching solutions,

I. Parvez, A. Rahmati, I. Guvenc, A. I. Sarwat, and H. Dai, “A survey on low latency towards 5g: Ran, core network and caching solutions,” IEEE Communications Surveys & Tutorials, vol. 20, no. 4, pp. 3098– 3130, 2018

work page 2018

[11] [11]

A survey of end-to-end solutions for reliable low-latency communications in 5g networks,

D. Rico and P. Merino, “A survey of end-to-end solutions for reliable low-latency communications in 5g networks,”IEEE Access, vol. 8, pp. 192 808–192 834, 2020

work page 2020

[12] [12]

Variational diffusion models,

D. Kingma, T. Salimans, B. Poole, and J. Ho, “Variational diffusion models,”Advances in neural information processing systems, vol. 34, pp. 21 696–21 707, 2021

work page 2021

[13] [13]

Neural offset min-sum decoding,

L. Lugosch and W. J. Gross, “Neural offset min-sum decoding,” in2017 IEEE International Symposium on Information Theory (ISIT). IEEE, 2017, pp. 1361–1365

work page 2017

[14] [14]

An iterative bp-cnn architecture for chan- nel decoding,

F. Liang, C. Shen, and F. Wu, “An iterative bp-cnn architecture for chan- nel decoding,”IEEE Journal of Selected Topics in Signal Processing, vol. 12, no. 1, pp. 144–159, 2018

work page 2018

[15] [15]

Doubly residual neural decoder: Towards low-complexity high-performance channel decoding,

S. Liao, C. Deng, M. Yin, and B. Yuan, “Doubly residual neural decoder: Towards low-complexity high-performance channel decoding,” inProceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 10, 2021, pp. 8574–8582

work page 2021

[16] [16]

On deep learning- based channel decoding,

T. Gruber, S. Cammerer, J. Hoydis, and S. Ten Brink, “On deep learning- based channel decoding,” in2017 51st annual conference on information sciences and systems (CISS). IEEE, 2017, pp. 1–6

work page 2017

[17] [17]

Denoising diffusion implicit models,

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” in9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [Online]. Available: https://openreview.net/forum?id=St1giarCHLP

work page 2021

[18] [18]

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps,

C. Lu, Y . Zhou, F. Bao, J. Chen, C. Li, and J. Zhu, “Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps,”Advances in Neural Information Processing Systems, vol. 35, pp. 5775–5787, 2022

work page 2022

[19] [19]

Progressive distillation for fast sampling of diffusion models,

T. Salimans and J. Ho, “Progressive distillation for fast sampling of diffusion models,” inThe Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29,

work page 2022

[20] [20]

[Online]

OpenReview.net, 2022. [Online]. Available: https://openreview. net/forum?id=TIdIXIpzhoI

work page 2022

[21] [21]

Lin and D

S. Lin and D. Costello,Error Control Coding: Fundamentals and Appli- cations, ser. Pearson education. Pearson-Prentice Hall, 2004. [Online]. Available: https://books.google.com/books?id=ENwdtAEACAAJ

work page 2004

[22] [22]

Efficient implementations of the sum-product algorithm for decoding ldpc codes,

X.-Y . Hu, E. Eleftheriou, D.-M. Arnold, and A. Dholakia, “Efficient implementations of the sum-product algorithm for decoding ldpc codes,” inGLOBECOM’01. IEEE Global Telecommunications Conference (Cat. No. 01CH37270), vol. 2. IEEE, 2001, pp. 1036–1036E

work page 2001

[23] [23]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020

[24] [24]

Deep unsupervised learning using nonequilibrium thermodynamics,

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” inInternational conference on machine learning. PMLR, 2015, pp. 2256–2265

work page 2015

[25] [25]

Channel polarization: A method for constructing capacity- achieving codes for symmetric binary-input memoryless channels,

E. Arikan, “Channel polarization: A method for constructing capacity- achieving codes for symmetric binary-input memoryless channels,”IEEE Transactions on information Theory, vol. 55, no. 7, pp. 3051–3073, 2009

work page 2009

[26] [26]

Low-density parity-check codes,

R. Gallager, “Low-density parity-check codes,”IRE Transactions on information theory, vol. 8, no. 1, pp. 21–28, 1962

work page 1962

[27] [27]

Database of Channel Codes and ML Simulation Results,

M. Helmling, S. Scholl, F. Gensheimer, T. Dietz, K. Kraft, S. Ruzika, and N. Wehn, “Database of Channel Codes and ML Simulation Results,” www.uni-kl.de/channel-codes, 2019

work page 2019

[28] [28]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Y . Bengio and Y . LeCun, Eds., 2015. [Online]. Available: http://arxiv.org/abs/1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2015