Optimization of CV-QKD Under Practical Constraints
Pith reviewed 2026-05-08 18:53 UTC · model grok-4.3
The pith
Reinforcement learning optimizes CV-QKD performance under practical hardware constraints like limited filter taps and finite bit resolution.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using reinforcement learning, we optimize for practical hardware constraints, including limited FIR filter taps at the transmitter and receiver, mean photon number and finite DAC/ADC resolution. Under these realistic conditions, the proposed approach achieves significant performance improvements.
What carries the argument
A reinforcement learning agent that selects CV-QKD system parameters to maximize the secret key rate while respecting the hardware limits on filters, photon number, and converter resolution.
If this is right
- Higher secret key rates become achievable in CV-QKD links that must use limited FIR filter taps at both ends.
- Finite DAC and ADC resolution no longer impose as large a penalty on the achievable rate when parameters are RL-tuned.
- The mean photon number can be chosen to balance rate and security more effectively under the other hardware limits.
- Overall system performance improves without requiring upgrades to filter complexity or converter bit depth.
Where Pith is reading between the lines
- The same reinforcement learning approach could be extended to optimize other quantum communication protocols that face similar hardware constraints.
- Integrating the trained agent with real-time channel monitoring might allow ongoing adaptation in deployed quantum networks.
- Laboratory tests with actual hardware would directly confirm whether the simulated gains translate to physical implementations.
Load-bearing premise
That the reinforcement learning procedure can be trained and evaluated under the stated constraints in a way that produces genuine, reproducible gains rather than artifacts of the simulation or training setup.
What would settle it
A physical CV-QKD experiment that measures the secret key rate achieved with the RL-optimized parameters against the rate from a conventional optimization under identical filter tap, photon number, and resolution limits.
read the original abstract
Using reinforcement learning, we optimize for practical hardware constraints, including limited FIR filter taps at the transmitter and receiver, mean photon number and finite DAC/ADC resolution. Under these realistic conditions, the proposed approach achieves significant performance improvements.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes using reinforcement learning to optimize continuous-variable quantum key distribution (CV-QKD) systems subject to practical hardware constraints, specifically limited FIR filter taps at transmitter and receiver, mean photon number, and finite DAC/ADC resolution. It claims that this yields significant performance improvements relative to unoptimized or conventionally optimized systems under realistic conditions.
Significance. If the performance gains are substantiated by reproducible simulations with appropriate baselines, statistical error bars, and clear reward-function definitions, the work would be significant for bridging theoretical CV-QKD rate calculations with hardware realities, potentially informing practical system design and increasing achievable secure key rates in constrained deployments.
major comments (2)
- Abstract: the central claim of 'significant performance improvements' is unsupported by any quantitative metrics, key-rate values, baseline comparisons, or error analysis, which is load-bearing for the paper's contribution and prevents verification of the result.
- No methods or results sections are available to inspect the RL formulation (state/action spaces, reward function, training procedure), simulation model for the CV-QKD channel and hardware constraints, or the specific FIR-tap and resolution values used; without these the reproducibility of any reported gains cannot be assessed.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each major comment below and have revised the manuscript to strengthen the presentation of our results and improve reproducibility.
read point-by-point responses
-
Referee: Abstract: the central claim of 'significant performance improvements' is unsupported by any quantitative metrics, key-rate values, baseline comparisons, or error analysis, which is load-bearing for the paper's contribution and prevents verification of the result.
Authors: We agree that the abstract requires quantitative support to substantiate the claim. We have revised the abstract to incorporate specific key-rate values and relative improvements from our simulations, along with references to the baseline methods and the statistical reliability of the results as detailed in the Results section. revision: yes
-
Referee: No methods or results sections are available to inspect the RL formulation (state/action spaces, reward function, training procedure), simulation model for the CV-QKD channel and hardware constraints, or the specific FIR-tap and resolution values used; without these the reproducibility of any reported gains cannot be assessed.
Authors: The manuscript contains dedicated sections describing the RL formulation and simulation results. However, we acknowledge that the level of detail may not have been sufficient for full reproducibility. We have expanded the Methods section with explicit definitions of the state and action spaces, the reward function, training procedure, the CV-QKD channel and hardware model, and a table of the specific parameter values (FIR taps, photon number, DAC/ADC resolution). The Results section has also been augmented with additional simulation details and error analysis. revision: partial
Circularity Check
No significant circularity detected
full rationale
The provided abstract and description present a standard application of reinforcement learning to optimize CV-QKD parameters under hardware constraints such as FIR filter taps, mean photon number, and DAC/ADC resolution. No equations, self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear in the text. The central claim of performance improvements is framed as an empirical outcome of the RL procedure rather than a mathematical derivation that reduces to its own inputs by construction. Without any visible derivation chain or ansatz smuggling, the approach remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Optimization of CV-QKD Under Practical Constraints Svitlana Matsenko(1), Amirhossein Ghazisaeidi(2), Marcin Jarzyna(3), Konrad Banaszek(3,4), and Darko Zibar(1) (1) DTU Electro, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark, svitma@dtu.dk (2) Nokia Bell Labs, 91300 Massy, France (3) Centre for Quantum Optical Technologies, CeNT, Universit...
work page 2026
-
[2]
Acknowledgements M.J. and K.B. acknowledge support by the Euro-pean Union’s Horizon Europe research and inno-vation programme under the project ‘Quantum Security Networks Partnership’ (QSNP, Grant Agreement No. 101114043) and the ‘Quantum Optical Technologies’ project (FENG.02.01-IP.05-0017/23) carried out within the Interna-tional Research Agendas progra...
work page 2021
-
[3]
Y. Zhang, Y. Bian, Z. Li, S. Yu, and H. Guo, “Continuous-variable quantum key distribution system: Past, present, and future”, Applied Physics Reviews, vol. 11, no. 1, Mar. 2024, ISSN: 1931-9401, DOI: 10.1063/5.0179566
-
[4]
H. Wang, Y. Li, et al., “High-rate continuous-variable quantum key distribution over 100 km fiber with compos-able security”, Optica, vol. 12, no. 10, pp. 1657–1667, Oct. 2025, ISSN: 2334-2536, DOI: 10.1364/OPTICA. 566359
-
[5]
T. A. Eriksson et al., “Wavelength division multiplexing of continuous variable quantum key distribution and 18.3 Tbit/s data channels”, Communications Physics, vol. 2, no. 9, Jan. 2019, DOI: 10.1038/s42005-018-0105-5
-
[6]
A. Leverrier and P. Grangier, “Unconditional security proof of long-distance continuous-variable quantum key distribution with discrete modulation”, Physical Review Letters, vol. 102, no. 18, May 2009, DOI: 10.1103/PhysRevLett.102.180504
-
[7]
S. Matsenko et al., “Mode mismatch mitigation in Gauss-ian-modulated CV-QKD”, in 2025 European Conference on Optical Communications (ECOC), 2025, pp. 1–4, DOI: 10.1109/ECOC66593.2025.11263288
-
[8]
Tx-Rx mode mismatch effects in gaussian-modulated CV-QKD
M. Kucharczyk, M. Jachura, M. Jarzyna, K. Banaszek, and A. Ghazisaeidi, “Tx-Rx mode mismatch effects in gaussian-modulated CV-QKD”, in ECOC 2024; 50th Eu-ropean Conference on Optical Communication, 2024, pp. 1366–1369
work page 2024
-
[9]
F. Laudenbach, C. Pacher, C.-H. F. Fung, et al., "Contin-uous-variable quantum key distribution with Gaussian modulation: the theory of practical implementations," Ad-vanced Quantum Technologies, vol. 1, 2018, Art. no. 1800011
work page 2018
-
[10]
S. Pirandola, U. L. Andersen, L. Banchi, et al., "Advances in quantum cryptography," Advances in Optics and Pho-tonics, vol. 12, no. 4, 2020, pp. 1012–1236. DOI: 10.1364/AOP.361502
-
[11]
F. Roumestan, A. Ghazisaeidi, et al., “Demonstration of probabilistic constellation shaping for continuous variable quantum key distribution,” 2021 Optical Fiber Communi-cations Conference and Exhibition (OFC),
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.