Optimization of Sparse VLSF Codes for Short-Packet Transmission via Saddlepoint Methods

Guodong Sun; Jean-Marie Gorce; Philippe Mary; Samir M. Perlaza

arxiv: 2604.16049 · v1 · submitted 2026-04-17 · 💻 cs.IT · math.IT

Optimization of Sparse VLSF Codes for Short-Packet Transmission via Saddlepoint Methods

Guodong Sun , Samir M. Perlaza , Philippe Mary , Jean-Marie Gorce This is my paper

Pith reviewed 2026-05-10 07:49 UTC · model grok-4.3

classification 💻 cs.IT math.IT

keywords VLSF codessaddlepoint approximationshort-packet transmissionstop-feedbackdecoding optimizationachievability boundsmemoryless channelsgradient optimization

0 comments

The pith

Saddlepoint approximations enable gradient-based optimization of decoding parameters in sparse VLSF codes, tightening achievability bounds with a refined rule.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops an optimization framework for sparse variable-length stop-feedback codes in short-packet communications. It uses a saddlepoint approximation to make the error probability tractable, allowing efficient joint optimization of decoding thresholds via gradients on standard channels like AWGN, BSC, and BEC. The authors also introduce a refined decoding rule that goes beyond the usual fixed threshold, producing tighter bounds on what rates are achievable. A sympathetic reader would care because short-packet transmission with feedback is key for low-latency applications, and better optimization at low cost could improve reliability without heavy computation.

Core claim

The paper claims that a saddlepoint approximation of the relevant information densities or error probabilities allows analytical gradient computation for optimizing the decoding configuration parameters in sparse VLSF codes. This framework finds near-optimal parameters at low cost for memoryless channels. Additionally, replacing the fixed-threshold decoding rule with a refined version extends the achievable region and shows that the conventional rule is restrictive.

What carries the argument

The saddlepoint approximation applied to the cumulative distribution of information densities in the VLSF decoding process, which provides a differentiable expression for the error probability used in gradient descent optimization of the thresholds.

If this is right

Decoding configurations close to optimal can be obtained efficiently without exhaustive search.
Achievability bounds for VLSF codes can be improved by using the refined decoding rule.
The fixed-threshold rule limits performance and can be outperformed by more flexible rules.
The approach applies directly to AWGN, BSC, and BEC with low computational overhead.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This optimization could be adapted for other feedback coding schemes or non-memoryless channels if similar approximations hold.
It implies that saddlepoint methods might replace simulation-based optimization in designing practical short-packet systems.
The results suggest potential gains in real-world low-latency communications by tightening theoretical bounds.

Load-bearing premise

The saddlepoint approximation remains accurate enough across the parameter space to guide reliable gradient-based optimization of the decoding configuration for the target channels.

What would settle it

Running Monte Carlo simulations or exact computations of the block error rate for the optimized parameters and comparing them to the saddlepoint predictions on the same channels would reveal if the optimization is misled by approximation errors.

Figures

Figures reproduced from arXiv: 2604.16049 by Guodong Sun, Jean-Marie Gorce, Philippe Mary, Samir M. Perlaza.

**Figure 1.** Figure 1: This continuous correction ensures that the saddlepoint [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 4.** Figure 4: Achievable rate of sparse VLSF codes versus number of decoding [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: Achievable rate with respect to SNR. The target decoding error is set [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

read the original abstract

In this work, we present an optimization framework for sparse variable-length stop-feedback (VLSF) codes based on a saddlepoint approximation, which jointly optimizes the decoding configuration parameters. Thanks to the analytical tractability of the saddlepoint approximation, the framework enables efficient gradient-based optimization of such parameters for common memoryless channels, including the additive white Gaussian noise, binary symmetric, and binary erasure channels. We further propose a refined decoding rule that extends the conventional fixed-threshold rule and leads to a tighter achievability bound. Numerical results demonstrate that our framework provides near-optimal decoding configurations at low computational cost. Moreover, the results from our refined rule demonstrate that the fixed-threshold decoding rule is restrictive and that achievability bounds can be further tightened.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a workable saddlepoint method to jointly tune decoding parameters for sparse VLSF codes and shows a refined rule that beats fixed thresholds, but the accuracy of the approximation for both optimization and bound claims needs direct checks.

read the letter

The main thing to know is that the authors use saddlepoint approximations to make gradient-based optimization feasible for the decoding parameters of sparse variable-length stop-feedback codes, and they introduce a refined decoding rule that produces tighter achievability bounds than the usual fixed-threshold version on AWGN, BSC, and BEC channels. Their numerics indicate the optimized configs perform near the limits at modest compute cost and that fixed thresholds leave room for improvement. This combination of joint optimization and the refined rule is the concrete advance over prior VLSF work. They do well on tractability. The saddlepoint expressions let them avoid brute-force or full Monte Carlo searches during tuning, which matters for short-packet design where parameters interact. The refined rule relaxes the decision boundary in a way that the numerics show improves the bound without much extra complexity. Credit to the authors for keeping the method analytical enough to support efficient search across common memoryless channels. The soft spot is the saddlepoint approximation itself. It drives both the parameter search and the reported performance comparisons, so any regime-dependent bias could make the configs look better than they are under exact evaluation and could exaggerate the tightening from the refined rule. The paper would be stronger with at least a few spot-checks against exact error probabilities or high-precision simulation for the final optimized points, especially at the short lengths and low error rates of interest. Without that, the near-optimality claim rests on the approximation holding uniformly. This paper is aimed at people who already work on finite-blocklength coding with feedback and need practical design tools for low-latency links. A reader comfortable with saddlepoint methods and VLSF constructions will extract the most value. It deserves a serious referee because the approach is explicit, the claims are testable, and the numerical evidence is presented even if the validation of the approximation needs tightening. I would send it to review with a request for those cross-checks.

Referee Report

2 major / 2 minor

Summary. The manuscript develops a saddlepoint-approximation-based optimization framework for sparse variable-length stop-feedback (VLSF) codes that jointly tunes decoding configuration parameters via gradient descent for AWGN, BSC, and BEC channels. It additionally introduces a refined decoding rule extending the conventional fixed-threshold rule and presents numerical results asserting that the framework yields near-optimal configurations at low cost while the refined rule produces tighter achievability bounds.

Significance. If the saddlepoint approximation remains sufficiently accurate in the short-blocklength, low-error regime, the work supplies a computationally efficient design tool for VLSF codes that could be useful for URLLC applications. The explicit use of the approximation's differentiability to enable gradient-based search is a clear methodological contribution, and the refined rule's potential to improve finite-blocklength bounds merits further study.

major comments (2)

[Numerical Results] Numerical Results section: the claim that the framework produces 'near-optimal decoding configurations' is supported only by comparisons performed under the same saddlepoint approximation used for optimization. No error analysis, comparison against exact tail probabilities, or Monte Carlo validation of the optimized parameters is reported, leaving open the possibility that approximation bias varies across the searched parameter space and produces configurations that are only apparently near-optimal.
[Refined decoding rule] Section describing the refined decoding rule and bound comparisons: the assertion that the refined rule yields tighter achievability bounds than fixed-threshold decoding rests entirely on saddlepoint-evaluated quantities. Without an independent check (e.g., exact computation for small parameters or comparison against known converses), it is impossible to determine whether the reported tightening is genuine or an artifact of the approximation.

minor comments (2)

[Abstract] The abstract states that the framework applies to 'common memoryless channels' but lists only three; an explicit enumeration at the first mention would improve readability.
Notation for the cumulant-generating function and saddlepoint parameters should be introduced with a single consistent definition rather than scattered across the optimization and bound sections.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. The comments correctly identify the need for stronger validation of the saddlepoint approximation when claiming near-optimality and bound improvements. We address each major comment below and outline the revisions we will incorporate.

read point-by-point responses

Referee: [Numerical Results] Numerical Results section: the claim that the framework produces 'near-optimal decoding configurations' is supported only by comparisons performed under the same saddlepoint approximation used for optimization. No error analysis, comparison against exact tail probabilities, or Monte Carlo validation of the optimized parameters is reported, leaving open the possibility that approximation bias varies across the searched parameter space and produces configurations that are only apparently near-optimal.

Authors: We appreciate this observation. The optimization framework minimizes the saddlepoint-approximated error probability, and the near-optimality claim is made with respect to this differentiable objective. Prior literature supports the accuracy of the saddlepoint approximation for tail probabilities in the short-blocklength regime for AWGN, BSC, and BEC. Nevertheless, we agree that potential variation in approximation bias across the parameter space warrants explicit checking. In the revised manuscript we will add Monte Carlo simulations for representative optimized parameter sets on each channel, comparing the approximated error rates against empirical ones, together with a short discussion of the approximation error in the operating regime of interest. revision: yes
Referee: [Refined decoding rule] Section describing the refined decoding rule and bound comparisons: the assertion that the refined rule yields tighter achievability bounds than fixed-threshold decoding rests entirely on saddlepoint-evaluated quantities. Without an independent check (e.g., exact computation for small parameters or comparison against known converses), it is impossible to determine whether the reported tightening is genuine or an artifact of the approximation.

Authors: We thank the referee for raising this point. The refined rule is constructed by allowing the decision threshold to vary with the number of received symbols, which is expected to relax the fixed-threshold restriction and produce tighter bounds. While all numerical comparisons in the current manuscript rely on the saddlepoint method for consistency with the optimization, we acknowledge the value of independent verification. In the revision we will add comparisons against known finite-blocklength converse bounds available in the literature for the three channels. Where feasible, we will also include exact computations for small parameter values to directly illustrate the improvement over fixed-threshold decoding. revision: yes

Circularity Check

0 steps flagged

No significant circularity; saddlepoint approximation applied as external analytical tool

full rationale

The paper's core derivation applies the standard saddlepoint approximation to obtain tractable expressions for gradient-based optimization of VLSF decoding parameters and for a refined decoding rule. This is an external mathematical technique whose cumulant-generating-function form is independent of the specific code parameters being optimized. Numerical demonstrations of near-optimality and bound tightening are presented as empirical outcomes of the framework rather than tautological re-statements of fitted inputs. No self-definitional reduction, fitted-parameter-renamed-as-prediction, or load-bearing self-citation chain is exhibited in the provided abstract or described structure; the central claims remain independently verifiable against exact benchmarks outside the approximation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Insufficient detail in the abstract to enumerate specific free parameters, axioms, or invented entities; the work relies on standard saddlepoint approximations and memoryless channel models from prior literature.

pith-pipeline@v0.9.0 · 5430 in / 1097 out tokens · 47931 ms · 2026-05-10T07:49:06.584667+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 19 canonical work pages

[1]

Exponential error bounds for erasure, list, and decision feedback schemes,

G. Forney, “Exponential error bounds for erasure, list, and decision feedback schemes,”IEEE Transactions on Information Theory, vol. 14, no. 2, pp. 206–220, 1968

work page 1968
[2]

Feedback in the non- asymptotic regime,

Y . Polyanskiy, H. V . Poor, and S. Verdú, “Feedback in the non- asymptotic regime,”IEEE Transactions on Information Theory, vol. 57, no. 8, pp. 4903–4925, 2011

work page 2011
[3]

Variable-length channel codes with probabilistic delay guarantees,

Y . Altu ˘g, H. V . Poor, and S. Verdú, “Variable-length channel codes with probabilistic delay guarantees,” in2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp. 642–649, IEEE, 2015

work page 2015
[4]

Short-packet transmission via variable-length codes in the presence of noisy stop feedback,

J. Östman, R. Devassy, G. Durisi, and E. G. Ström, “Short-packet transmission via variable-length codes in the presence of noisy stop feedback,”IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 214–227, 2020

work page 2020
[5]

Variable-length convolutional coding for short blocklengths with decision feedback,

A. R. Williamson, T.-Y . Chen, and R. D. Wesel, “Variable-length convolutional coding for short blocklengths with decision feedback,” IEEE Transactions on Communications, vol. 63, no. 7, pp. 2389–2403, 2015

work page 2015
[6]

Dahlman, S

E. Dahlman, S. Parkvall, and J. Skold,5G NR: The next generation wireless access technology. Academic Press, 2020

work page 2020
[7]

Variable-length feedback codes under a strict delay constraint,

S. H. Kim, D. K. Sung, and T. Le-Ngoc, “Variable-length feedback codes under a strict delay constraint,”IEEE Communications Letters, vol. 19, no. 4, pp. 513–516, 2015

work page 2015
[8]

Opti- mizing transmission lengths for limited feedback with nonbinary LDPC examples,

K. Vakilinia, S. V . Ranganathan, D. Divsalar, and R. D. Wesel, “Opti- mizing transmission lengths for limited feedback with nonbinary LDPC examples,”IEEE Transactions on Communications, vol. 64, no. 6, pp. 2245–2257, 2016

work page 2016
[9]

Variable-length sparse feedback codes for point-to-point, multiple access, and random access channels,

R. C. Yavas, V . Kostina, and M. Effros, “Variable-length sparse feedback codes for point-to-point, multiple access, and random access channels,” IEEE Transactions on Information Theory, vol. 70, no. 4, pp. 2367– 2394, 2023

work page 2023
[10]

Variable-length stop-feedback codes with finite optimal decoding times for BI-AWGN channels,

H. Yang, R. C. Yavas, V . Kostina, and R. D. Wesel, “Variable-length stop-feedback codes with finite optimal decoding times for BI-AWGN channels,” in2022 IEEE International Symposium on Information The- ory (ISIT), pp. 1527–1532, IEEE, 2022

work page 2022
[11]

Incremental redundancy with ACK/NACK feedback at a few optimal decoding times,

H. Yang, R. C. Yavas, V . Kostina, and R. D. Wesel, “Incremental redundancy with ACK/NACK feedback at a few optimal decoding times,”arXiv preprint arXiv:2205.15399, 2022

work page arXiv 2022
[12]

Hall,The bootstrap and Edgeworth expansion

P. Hall,The bootstrap and Edgeworth expansion. Springer Science & Business Media, 2013

work page 2013
[13]

V . V . Petrov,Sums of independent random variables, vol. 82. Springer Science & Business Media, 2012

work page 2012
[14]

Channel coding rate in the finite blocklength regime,

Y . Polyanskiy, H. V . Poor, and S. Verdú, “Channel coding rate in the finite blocklength regime,”IEEE Transactions on Information Theory, vol. 56, no. 5, pp. 2307–2359, 2010

work page 2010
[15]

R. W. Butler,Saddlepoint approximations with applications, vol. 22. Cambridge University Press, 2007

work page 2007
[16]

Juniper: An open- source nonlinear branch-and-bound solver in Julia,

O. Kröger, C. Coffrin, H. Hijazi, and H. Nagarajan, “Juniper: An open- source nonlinear branch-and-bound solver in Julia,” inInternational conference on the integration of constraint programming, artificial intelligence, and operations research, pp. 377–386, Springer, 2018

work page 2018
[17]

Saddle point approximation for the distribu- tion of the sum of independent random variables,

R. Lugannani and S. Rice, “Saddle point approximation for the distribu- tion of the sum of independent random variables,”Advances in applied probability, vol. 12, no. 2, pp. 475–490, 1980

work page 1980
[18]

MolavianJazi,A unified approach to Gaussian channels with finite blocklength

E. MolavianJazi,A unified approach to Gaussian channels with finite blocklength. University of Notre Dame, 2014

work page 2014
[19]

A. C. Rencher and G. B. Schaalje,Linear models in statistics. John Wiley & Sons, 2008

work page 2008

[1] [1]

Exponential error bounds for erasure, list, and decision feedback schemes,

G. Forney, “Exponential error bounds for erasure, list, and decision feedback schemes,”IEEE Transactions on Information Theory, vol. 14, no. 2, pp. 206–220, 1968

work page 1968

[2] [2]

Feedback in the non- asymptotic regime,

Y . Polyanskiy, H. V . Poor, and S. Verdú, “Feedback in the non- asymptotic regime,”IEEE Transactions on Information Theory, vol. 57, no. 8, pp. 4903–4925, 2011

work page 2011

[3] [3]

Variable-length channel codes with probabilistic delay guarantees,

Y . Altu ˘g, H. V . Poor, and S. Verdú, “Variable-length channel codes with probabilistic delay guarantees,” in2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp. 642–649, IEEE, 2015

work page 2015

[4] [4]

Short-packet transmission via variable-length codes in the presence of noisy stop feedback,

J. Östman, R. Devassy, G. Durisi, and E. G. Ström, “Short-packet transmission via variable-length codes in the presence of noisy stop feedback,”IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 214–227, 2020

work page 2020

[5] [5]

Variable-length convolutional coding for short blocklengths with decision feedback,

A. R. Williamson, T.-Y . Chen, and R. D. Wesel, “Variable-length convolutional coding for short blocklengths with decision feedback,” IEEE Transactions on Communications, vol. 63, no. 7, pp. 2389–2403, 2015

work page 2015

[6] [6]

Dahlman, S

E. Dahlman, S. Parkvall, and J. Skold,5G NR: The next generation wireless access technology. Academic Press, 2020

work page 2020

[7] [7]

Variable-length feedback codes under a strict delay constraint,

S. H. Kim, D. K. Sung, and T. Le-Ngoc, “Variable-length feedback codes under a strict delay constraint,”IEEE Communications Letters, vol. 19, no. 4, pp. 513–516, 2015

work page 2015

[8] [8]

Opti- mizing transmission lengths for limited feedback with nonbinary LDPC examples,

K. Vakilinia, S. V . Ranganathan, D. Divsalar, and R. D. Wesel, “Opti- mizing transmission lengths for limited feedback with nonbinary LDPC examples,”IEEE Transactions on Communications, vol. 64, no. 6, pp. 2245–2257, 2016

work page 2016

[9] [9]

Variable-length sparse feedback codes for point-to-point, multiple access, and random access channels,

R. C. Yavas, V . Kostina, and M. Effros, “Variable-length sparse feedback codes for point-to-point, multiple access, and random access channels,” IEEE Transactions on Information Theory, vol. 70, no. 4, pp. 2367– 2394, 2023

work page 2023

[10] [10]

Variable-length stop-feedback codes with finite optimal decoding times for BI-AWGN channels,

H. Yang, R. C. Yavas, V . Kostina, and R. D. Wesel, “Variable-length stop-feedback codes with finite optimal decoding times for BI-AWGN channels,” in2022 IEEE International Symposium on Information The- ory (ISIT), pp. 1527–1532, IEEE, 2022

work page 2022

[11] [11]

Incremental redundancy with ACK/NACK feedback at a few optimal decoding times,

H. Yang, R. C. Yavas, V . Kostina, and R. D. Wesel, “Incremental redundancy with ACK/NACK feedback at a few optimal decoding times,”arXiv preprint arXiv:2205.15399, 2022

work page arXiv 2022

[12] [12]

Hall,The bootstrap and Edgeworth expansion

P. Hall,The bootstrap and Edgeworth expansion. Springer Science & Business Media, 2013

work page 2013

[13] [13]

V . V . Petrov,Sums of independent random variables, vol. 82. Springer Science & Business Media, 2012

work page 2012

[14] [14]

Channel coding rate in the finite blocklength regime,

Y . Polyanskiy, H. V . Poor, and S. Verdú, “Channel coding rate in the finite blocklength regime,”IEEE Transactions on Information Theory, vol. 56, no. 5, pp. 2307–2359, 2010

work page 2010

[15] [15]

R. W. Butler,Saddlepoint approximations with applications, vol. 22. Cambridge University Press, 2007

work page 2007

[16] [16]

Juniper: An open- source nonlinear branch-and-bound solver in Julia,

O. Kröger, C. Coffrin, H. Hijazi, and H. Nagarajan, “Juniper: An open- source nonlinear branch-and-bound solver in Julia,” inInternational conference on the integration of constraint programming, artificial intelligence, and operations research, pp. 377–386, Springer, 2018

work page 2018

[17] [17]

Saddle point approximation for the distribu- tion of the sum of independent random variables,

R. Lugannani and S. Rice, “Saddle point approximation for the distribu- tion of the sum of independent random variables,”Advances in applied probability, vol. 12, no. 2, pp. 475–490, 1980

work page 1980

[18] [18]

MolavianJazi,A unified approach to Gaussian channels with finite blocklength

E. MolavianJazi,A unified approach to Gaussian channels with finite blocklength. University of Notre Dame, 2014

work page 2014

[19] [19]

A. C. Rencher and G. B. Schaalje,Linear models in statistics. John Wiley & Sons, 2008

work page 2008