pith. sign in

arxiv: 2512.16224 · v2 · submitted 2025-12-18 · 📡 eess.SP

Simultaneous Secrecy and Covert Communications (SSACC) in Mobility-Aware RIS-Aided Networks

Pith reviewed 2026-05-16 21:59 UTC · model grok-4.3

classification 📡 eess.SP
keywords RISsecrecy capacitycovert communicationsdeep reinforcement learninggenerative diffusion modelspower allocationuser mobility
0
0 comments X

The pith

A generative diffusion model with reinforcement learning balances secrecy capacity and detection error probability in mobile RIS networks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a simultaneous secrecy and covert communications scheme in a reconfigurable intelligent surface aided network with a cooperative jammer. It derives closed-form expressions for average minimum detection error probability and average secrecy capacity under a worst-case eavesdropper scenario. To balance these under user mobility, it introduces a generative diffusion model combined with deep reinforcement learning for power allocation optimization. Simulations show this approach converges faster and performs better than standard deep deterministic policy gradient methods.

Core claim

The proposed GDM-DRL algorithm achieves faster convergence and superior performance compared to conventional DDPG methods in balancing security and capacity performance under user mobility.

What carries the argument

The generative diffusion model integrated with deep reinforcement learning for optimizing power allocation in the SSACC scheme.

Load-bearing premise

The eavesdropper can optimally adjust the detection threshold to minimize the detection error probability in the worst-case scenario.

What would settle it

A simulation run where the proposed GDM-DRL algorithm fails to show faster convergence or higher values of both AMDEP and ASC than DDPG would refute the performance claim.

Figures

Figures reproduced from arXiv: 2512.16224 by Haoran Liu, Hua Zhong, Pan Li, Wei Wang, Yanyu Cheng, Yujian Hu.

Figure 1
Figure 1. Figure 1: System model for SSACC system. The figure depicts [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: Average channel capacities and ASC versus Alice’s [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: QoE versus 𝑃𝑚𝑎𝑥, where 𝜅 = 0.8 and 𝜅 = 0.2, respectively. is observed. Since the AMDEP threshold is typically above 80%, the analytical results for AMDEP are suitable for use in the optimization problem. In Fig. 5b, it can be seen that as 𝜅 increases, the ASC first grows, attaining a maximum, and then gradually decreases with further increases in 𝜅 and the derived results align closely with the simulation … view at source ↗
Figure 5
Figure 5. Figure 5: c illustrates the variation of Quality of Experience (QoE) in relation to 𝜅 (where 𝜅 = 𝑃𝐴/𝑃𝑚𝑎𝑥 = 1 − 𝑃ˆ 𝐽 /𝑃𝑚𝑎𝑥). The observed trend is primarily influenced by the ASC, exhibiting an initial increase followed by a subsequent decrease as 𝜅 varies. In the second part of QoE, however, a declining AMDEP combined with a rapidly increasing 𝑅𝑊 results in a diminished impact on QoE. This leads to an overall trend … view at source ↗
Figure 6
Figure 6. Figure 6: Test reward curves of GDM-aided and DDPG-aided [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: QoE and AMDEP versus 𝜅 under different environ￾mental distance parameters when 𝑃max = 40 dBm. The discrete markers indicate the optimal power allocation solutions output by the GDMOPT algorithm. covertness and secrecy performance and designing a GDM￾enhanced DRL algorithm, we demonstrated that the proposed approach can effectively optimize power allocation to improve system security under diverse user loca… view at source ↗
read the original abstract

In this paper, we propose a simultaneous secrecy and covert communications (SSACC) scheme in a reconfigurable intelligent surface (RIS)-aided network with a cooperative jammer. The scheme enhances communication security by maximizing the secrecy capacity and the detection error probability (DEP). Under a worst-case scenario for covert communications, we consider that the eavesdropper can optimally adjust the detection threshold to minimize the DEP. Accordingly, we derive closedform expressions for both average minimum DEP (AMDEP) and average secrecy capacity (ASC). To balance AMDEP and ASC, we propose a new performance metric and design an algorithm based on generative diffusion models (GDM) and deep reinforcement learning (DRL). The algorithm maximizes data rates under user mobility while ensuring high AMDEP and ASC by optimizing power allocation. Simulation results demonstrate that the proposed algorithm achieves faster convergence and superior performance compared to conventional deep deterministic policy gradient (DDPG) methods, thereby validating its effectiveness in balancing security and capacity performance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript proposes a simultaneous secrecy and covert communications (SSACC) scheme in a mobility-aware RIS-aided network with a cooperative jammer. It derives closed-form expressions for average minimum detection error probability (AMDEP) and average secrecy capacity (ASC) under a worst-case eavesdropper that optimally tunes its detection threshold to minimize DEP. A new balancing metric is defined, and a generative diffusion model-deep reinforcement learning (GDM-DRL) algorithm is introduced for power allocation to maximize data rates while maintaining high AMDEP and ASC. Simulations claim faster convergence and superior performance versus conventional DDPG.

Significance. If the closed-form derivations hold and the GDM-DRL policy generalizes beyond the simulated environments, the work would provide analytical expressions and an efficient optimizer for jointly achieving secrecy and covertness in dynamic RIS networks. The integration of generative diffusion models into the DRL framework for this balancing task represents a technical contribution that could influence future learning-based secure communication designs.

major comments (1)
  1. [Performance analysis and metric definition] The closed-form expressions for AMDEP and ASC (derived in the performance analysis section) rest on the assumption that the eavesdropper can optimally adjust its detection threshold to minimize DEP under the worst-case scenario. This assumption directly defines the new performance metric that serves as the reward function for the GDM-DRL algorithm. In mobility-aware settings with time-varying channels, it is unclear whether the eavesdropper can achieve this optimum in real time (e.g., due to imperfect CSI or detection latency), which risks mis-specifying both the analytical results and the reported superiority of GDM-DRL over DDPG.
minor comments (2)
  1. [Abstract and Numerical Results] The abstract and simulation section would benefit from explicit reporting of the number of Monte Carlo trials, channel model parameters (e.g., Rician factors, mobility speed ranges), and error bars or confidence intervals on the convergence and performance curves to allow readers to assess the statistical significance of the GDM-DRL gains.
  2. [Problem Formulation] Notation for the new balancing metric should be introduced with a clear equation number and its relationship to AMDEP and ASC should be stated explicitly in the optimization problem formulation.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the major comment below.

read point-by-point responses
  1. Referee: The closed-form expressions for AMDEP and ASC (derived in the performance analysis section) rest on the assumption that the eavesdropper can optimally adjust its detection threshold to minimize DEP under the worst-case scenario. This assumption directly defines the new performance metric that serves as the reward function for the GDM-DRL algorithm. In mobility-aware settings with time-varying channels, it is unclear whether the eavesdropper can achieve this optimum in real time (e.g., due to imperfect CSI or detection latency), which risks mis-specifying both the analytical results and the reported superiority of GDM-DRL over DDPG.

    Authors: We appreciate the referee's observation. The worst-case eavesdropper assumption, where the eavesdropper optimally tunes its detection threshold to minimize the DEP, is a standard approach in the covert communications literature to derive analytical expressions for the minimum detection error probability. This provides a conservative bound on the covertness performance. The closed-form expressions for AMDEP and ASC are derived by averaging over the mobility-induced channel distributions, assuming the eavesdropper has access to the statistical channel information. While we acknowledge that in practical mobility scenarios with time-varying channels, achieving real-time optimal threshold adjustment may be hindered by imperfect CSI or detection latency, our analysis focuses on the theoretical performance limits under this model. Importantly, both the proposed GDM-DRL and the baseline DDPG algorithms are evaluated under the same metric and assumptions, ensuring a fair comparison of their relative performance in optimizing the power allocation. To further strengthen the manuscript, we will include additional discussion on the practical implications of this assumption and provide simulation results under scenarios with imperfect CSI and delayed threshold optimization. Therefore, we will make partial revisions to the performance analysis and simulation sections. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivations rest on explicit assumptions and standard models

full rationale

The paper states an explicit worst-case eavesdropper assumption and derives closed-form AMDEP/ASC expressions from standard statistical channel models. These expressions then define the reward metric for the GDM-DRL optimizer, which is trained in simulation rather than by fitting the target quantities to themselves. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear in the derivation chain. The central performance claims therefore remain independent of the inputs by construction.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard wireless fading and mobility models for the closed-form derivations plus a tunable balancing parameter in the new performance metric that is optimized by the learning algorithm.

free parameters (1)
  • performance metric balancing parameter
    A weight or scaling factor in the new metric that trades off AMDEP against ASC and is tuned during DRL optimization.
axioms (1)
  • domain assumption Standard statistical models for RIS-reflected channels, jammer signals, and user mobility are valid for averaging over fading and movement.
    Invoked to obtain closed-form expressions for average minimum DEP and average secrecy capacity.

pith-pipeline@v0.9.0 · 5482 in / 1322 out tokens · 32424 ms · 2026-05-16T21:59:51.168498+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

  • IndisputableMonolith/Cost/FunctionalEquation washburn_uniqueness_aczel unclear
    ?
    unclear

    Relation between the paper passage and the cited Recognition theorem.

    Under a worst-case scenario for covert communications, we consider that the eavesdropper can optimally adjust the detection threshold to minimize the DEP. Accordingly, we derive closed-form expressions for both average minimum DEP (AMDEP) and average secrecy capacity (ASC).

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 1 internal anchor

  1. [1]

    From 5g to 6g: A survey on security, privacy, and standardization pathways,

    M. Y ang, Y . Qu, T. Ranbaduge, C. Thapa, N. Sultan, M. Ding, H. Suzuki, W. Ni, S. Abuadbba, D. Smith, P . T yler, J. Pieprzyk, T. Rakotoarivelo, X. Guan, and S. M’rabet, “From 5g to 6g: A survey on security, privacy, and standardization pathways,” 2024. [Online]. Available: https://arxiv.org/abs/2410.21986

  2. [2]

    M. I. Bhat and K. J. Giri, Impact of Computational Power on Cryptog- raphy. Singapore: Springer Singapore, 2021, pp. 45–88

  3. [3]

    Communication theory of secrecy systems,

    C. E. Shannon, “Communication theory of secrecy systems,” The Bell system technical journal , vol. 28, no. 4, pp. 656–715, 1949

  4. [4]

    The wire-tap channel,

    A. D. Wyner, “The wire-tap channel,” Bell system technical journal , vol. 54, no. 8, pp. 1355–1387, 1975

  5. [5]

    QoS-based binary signature design for secure CDMA systems,

    M. Li, G. Ti, X. Tian, and Q. Liu, “QoS-based binary signature design for secure CDMA systems,” IEEE Trans. Veh. Technol., vol. 66, no. 11, pp. 10 011–10 023, Nov. 2017

  6. [6]

    Physical layer security in cognitive radio inspired NOMA network,

    Z. Xiang, W. Y ang, G. Pan, Y . Cai, and Y . Song, “Physical layer security in cognitive radio inspired NOMA network,” IEEE J. Sel. Top. Signal Process., vol. 13, no. 3, pp. 700–714, Jun. 2019

  7. [7]

    NOMA- assisted secure short-packet communications in IoT,

    Z. Xiang, W. Y ang, Y . Cai, Z. Ding, Y . Song, and Y . Zou, “NOMA- assisted secure short-packet communications in IoT,” IEEE Wireless Commun., vol. 27, no. 4, pp. 8–15, Aug. 2020

  8. [8]

    On the secrecy capacity of fading channels,

    P . K. Gopala, L. Lai, and H. El Gamal, “On the secrecy capacity of fading channels,” IEEE Trans. Inf. Theory , vol. 54, no. 10, pp. 4687– 4698, Oct. 2008

  9. [9]

    Covert communications: A comprehensive survey,

    X. Chen, J. An, Z. Xiong, C. Xing, N. Zhao, F. R. Yu, and A. Nal- lanathan, “Covert communications: A comprehensive survey,” IEEE Commun. Surv. Tutorials , vol. 25, no. 2, pp. 1173–1198, Apr. 2023

  10. [10]

    A comprehensive survey of covert communication techniques, limitations and future challenges,

    I. Makhdoom, M. Abolhasan, and J. Lipman, “A comprehensive survey of covert communication techniques, limitations and future challenges,” Comput. Secur., vol. 120, p. 102784, 2022

  11. [11]

    Performance analysis and power allocation for covert mobile edge computing with RIS-aided NOMA,

    Y . Cheng, J. Lu, D. Niyato, B. Lyu, M. Xu, and S. Zhu, “Performance analysis and power allocation for covert mobile edge computing with RIS-aided NOMA,” IEEE Trans. Mob. Comput., vol. 23, no. 5, pp. 4212– 4227, Aug. 2024

  12. [12]

    Broadcast channels with confidential mes- sages,

    I. Csiszár and J. Körner, “Broadcast channels with confidential mes- sages,” IEEE Trans. Inf. Theory , vol. 24, no. 3, pp. 339–348, May 1978

  13. [13]

    Secrecy wireless information and power transfer in fading wiretap channel,

    H. Xing, L. Liu, and R. Zhang, “Secrecy wireless information and power transfer in fading wiretap channel,” IEEE Trans. Veh. Technol. , vol. 65, no. 1, pp. 180–190, Jan. 2016

  14. [14]

    On the placement of RF energy harvesting node in wireless networks with secrecy considerations,

    B. He and X. Zhou, “On the placement of RF energy harvesting node in wireless networks with secrecy considerations,” in Proc. IEEE Global Commun. Conf. Workshops (GC Wkshps) , Austin, TX, United States, Dec. 2014, pp. 1355–1360

  15. [15]

    An optimization framework of target secrecy rate and power allocation for SWIPT system,

    H. Yu, S. Guo, and Y . Y ang, “An optimization framework of target secrecy rate and power allocation for SWIPT system,” in Proc. IEEE Global Commun. Conf. (GLOBECOM) , Washington, DC, USA, Dec. 2016, pp. 1–6

  16. [16]

    Outage probability based power distribution between data and artificial noise for physical layer security,

    N. Romero-Zurita, M. Ghogho, and D. McLernon, “Outage probability based power distribution between data and artificial noise for physical layer security,” IEEE Signal Process Lett. , vol. 19, no. 2, pp. 71–74, Feb. 2012

  17. [17]

    Exploiting full-duplex receivers for achieving secret communications in multiuser MISO net- works,

    B. Akgun, O. O. Koyluoglu, and M. Krunz, “Exploiting full-duplex receivers for achieving secret communications in multiuser MISO net- works,” IEEE Trans. Commun. , vol. 65, no. 2, pp. 956–968, Feb. 2017

  18. [18]

    On the design of secure non-orthogonal multiple access systems,

    B. He, A. Liu, N. Y ang, and V . K. N. Lau, “On the design of secure non-orthogonal multiple access systems,” IEEE J. Sel. Areas Commun. , vol. 35, no. 10, pp. 2196–2206, Oct. 2017

  19. [19]

    Covert communication for Jammer-aided multi-antenna UA V networks,

    H. Du, D. Niyato, Y .-A. Xie, Y . Cheng, J. Kang, and D. I. Kim, “Covert communication for Jammer-aided multi-antenna UA V networks,” in Proc. IEEE Int. Conf. Commun. (ICC) , Seoul, South Korea, 2022, pp. 91–96

  20. [20]

    Multi-antenna covert communication via full-duplex jamming against a warden with uncertain locations,

    X. Chen, W. Sun, C. Xing, N. Zhao, Y . Chen, F. R. Yu, and A. Nal- lanathan, “Multi-antenna covert communication via full-duplex jamming against a warden with uncertain locations,” IEEE Trans. Wireless Com- mun., vol. 20, no. 8, pp. 5467–5480, Aug. 2021

  21. [21]

    Covert communications in D2D underlaying cellular networks with antenna array assisted artificial noise transmission,

    Y . Jiang, L. Wang, and H.-H. Chen, “Covert communications in D2D underlaying cellular networks with antenna array assisted artificial noise transmission,” IEEE Trans. Veh. Technol., vol. 69, no. 3, pp. 2980–2992, Mar. 2020

  22. [22]

    Covert wireless communication with artificial noise generation,

    R. Soltani, D. Goeckel, D. Towsley, B. A. Bash, and S. Guha, “Covert wireless communication with artificial noise generation,” IEEE Trans. Wireless Commun. , vol. 17, no. 11, pp. 7252–7267, Nov. 2018

  23. [23]

    Multi-hop routing in covert wireless networks,

    A. Sheikholeslami, M. Ghaderi, D. Towsley, B. A. Bash, S. Guha, and D. Goeckel, “Multi-hop routing in covert wireless networks,” IEEE Trans. Wireless Commun. , vol. 17, no. 6, pp. 3656–3669, Jun. 2018

  24. [24]

    Covert commu- nication using null space and 3D beamforming: Uncertainty of willie’s location information,

    M. Forouzesh, P . Azmi, N. Mokari, and D. Goeckel, “Covert commu- nication using null space and 3D beamforming: Uncertainty of willie’s location information,” IEEE Trans. Veh. Technol. , vol. 69, no. 8, pp. 8568–8576, Aug. 2020

  25. [25]

    STAR-RIS enhanced joint physical layer security and covert commu- nications for multi-antenna mmWave systems,

    H. Xiao, X. Hu, A. Li, W. Wang, Z. Su, K.-K. Wong, and K. Y ang, “STAR-RIS enhanced joint physical layer security and covert commu- nications for multi-antenna mmWave systems,” IEEE Trans. Wireless Commun., vol. 23, no. 8, pp. 8805–8819, Aug. 2024. 13

  26. [26]

    STAR-RIS assisted covert communication for NOMA systems,

    Z. Zhang, X. Li, H. Gao, K. Rabie, and A. Khaleel, “STAR-RIS assisted covert communication for NOMA systems,” in Proc. IEEE Veh. Technol. Conf. (VTC2023-Fall), Hong Kong, China, Oct. 2023, pp. 1–6

  27. [27]

    Secrecy rate maximization for movable antenna-aided STAR-RIS in integrated sensing and com- munication systems,

    Y . Wu, C. Li, H. Shi, H. Ding, and J. Song, “Secrecy rate maximization for movable antenna-aided STAR-RIS in integrated sensing and com- munication systems,” Entropy, vol. 27, no. 12, 2025

  28. [28]

    Intelligent reflecting surface enabled covert communications in wireless networks,

    X. Lu, E. Hossain, T. Shafique, S. Feng, H. Jiang, and D. Niyato, “Intelligent reflecting surface enabled covert communications in wireless networks,” IEEE Network, vol. 34, no. 5, pp. 148–155, Sep. 2020

  29. [29]

    Reconfigurable intelligent surfaces: Principles and opportunities,

    Y . Liu, X. Liu, X. Mu, T. Hou, J. Xu, M. Di Renzo, and N. Al-Dhahir, “Reconfigurable intelligent surfaces: Principles and opportunities,” IEEE Commun. Surv. Tutorials , vol. 23, no. 3, pp. 1546–1577, May 2021

  30. [30]

    Towards smart and reconfigurable environment: Intelligent reflecting surface aided wireless network,

    Q. Wu and R. Zhang, “Towards smart and reconfigurable environment: Intelligent reflecting surface aided wireless network,” IEEE Commun. Mag., vol. 58, no. 1, pp. 106–112, Jan. 2020

  31. [31]

    Downlink and uplink intelligent reflecting surface aided networks: NOMA and OMA,

    Y . Cheng, K. H. Li, Y . Liu, K. C. Teh, and H. Vincent Poor, “Downlink and uplink intelligent reflecting surface aided networks: NOMA and OMA,” IEEE Trans. Wireless Commun. , vol. 20, no. 6, pp. 3988–4000, Jun. 2021

  32. [32]

    Covert communication in intelligent reflecting surface-assisted NOMA systems: Design, analysis, and optimization,

    L. Lv, Q. Wu, Z. Li, Z. Ding, N. Al-Dhahir, and J. Chen, “Covert communication in intelligent reflecting surface-assisted NOMA systems: Design, analysis, and optimization,” IEEE Trans. Wireless Commun. , vol. 21, no. 3, pp. 1735–1750, Mar. 2022

  33. [33]

    Channel estimation for ris-empowered multi-user MISO wireless communications,

    L. Wei, C. Huang, G. C. Alexandropoulos, C. Yuen, Z. Zhang, and M. Debbah, “Channel estimation for ris-empowered multi-user MISO wireless communications,” IEEE Trans. Commun. , vol. 69, no. 6, pp. 4144–4157, Jun. 2021

  34. [34]

    Covert non-orthogonal multiple access vehicular communications with friendly jamming,

    Q. Li, P . Ren, D. Xu, and Y . Xie, “Covert non-orthogonal multiple access vehicular communications with friendly jamming,” in Proc. IEEE Global Commun. Conf. Workshops (GC Wkshps) , Taipei, China, Dec. 2020, pp. 1–6

  35. [35]

    On the road to 6G: Visions, requirements, key technologies, and testbeds,

    C.-X. Wang, X. Y ou, X. Gao, X. Zhu, Z. Li, C. Zhang, H. Wang, Y . Huang, Y . Chen, H. Haas, J. S. Thompson, S. Chatzinotas, K. Y ang, and H. V . Poor, “On the road to 6G: Visions, requirements, key technologies, and testbeds,” IEEE Commun. Surv. Tutorials , vol. 25, no. 2, pp. 905–974, Apr. 2023

  36. [36]

    On challenges of sixth-generation (6G) wireless networks: A compre- hensive survey of requirements, applications, and security issues,

    M. S. Akbar, Z. Hussain, M. Ikram, Q. Z. Sheng, and S. Mukhopadhyay, “On challenges of sixth-generation (6G) wireless networks: A compre- hensive survey of requirements, applications, and security issues,” IEEE Open J. Commun. Soc. , vol. 5, pp. 1200–1245, 2024

  37. [37]

    Achieving covert communication in uplink NOMA systems via energy harvesting jam- mer,

    L. Tao, W. Y ang, X. Lu, M. Wang, and Y . Song, “Achieving covert communication in uplink NOMA systems via energy harvesting jam- mer,” IEEE Commun. Lett. , vol. 25, no. 12, pp. 3785–3789, Dec. 2021

  38. [38]

    Covert communications with friendly jamming in internet of vehicles,

    H. Zhang, Y . Zou, D. Yu, J. Yu, and X. Cheng, “Covert communications with friendly jamming in internet of vehicles,” Veh. Commun., vol. 35, p. 100472, Jun. 2022

  39. [39]

    Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,

    Q. Wu and R. Zhang, “Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,” IEEE Trans. Wireless Commun., vol. 18, no. 11, pp. 5394–5409, Nov. 2019

  40. [40]

    On the impact of phase shifting designs on IRS-NOMA,

    Z. Ding, R. Schober, and H. V . Poor, “On the impact of phase shifting designs on IRS-NOMA,” IEEE Wireless Commun. Lett. , vol. 9, no. 10, pp. 1596–1600, Oct. 2020

  41. [41]

    Secure transmission with multiple antennas—part ii: The MIMOME wiretap channel,

    A. Khisti and G. W. Wornell, “Secure transmission with multiple antennas—part ii: The MIMOME wiretap channel,” IEEE Trans. Inf. Theory, vol. 56, no. 11, pp. 5515–5532, Nov. 2010

  42. [42]

    Federated transfer learning with client selection for intrusion detection in mobile edge computing,

    Y . Cheng, J. Lu, D. Niyato, B. Lyu, J. Kang, and S. Zhu, “Federated transfer learning with client selection for intrusion detection in mobile edge computing,” IEEE Commun. Lett. , vol. 26, no. 3, pp. 552–556, Mar. 2022

  43. [43]

    Enhancing deep reinforcement learning: A tutorial on generative diffusion models in network optimization,

    H. Du, R. Zhang, Y . Liu, J. Wang, Y . Lin, Z. Li, D. Niyato, J. Kang, Z. Xiong, S. Cui, B. Ai, H. Zhou, and D. I. Kim, “Enhancing deep reinforcement learning: A tutorial on generative diffusion models in network optimization,” IEEE Commun. Surv. Tutorials , vol. 26, no. 4, pp. 2611–2646, May 2024

  44. [44]

    Proximal Policy Optimization Algorithms

    J. Schulman, F. Wolski, P . Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” 2017. [Online]. Available: https://arxiv.org/abs/1707.06347

  45. [45]

    Multiagent federated reinforcement learning for secure incentive mechanism in intelligent cyber–physical systems,

    M. Xu, J. Peng, B. B. Gupta, J. Kang, Z. Xiong, Z. Li, and A. A. A. El- Latif, “Multiagent federated reinforcement learning for secure incentive mechanism in intelligent cyber–physical systems,” IEEE Internet Things J., vol. 9, no. 22, pp. 22 095–22 108, Nov. 2022

  46. [46]

    Diffusion policies as an expressive policy class for offline reinforcement learning,

    Z. Wang, J. J. Hunt, and M. Zhou, “Diffusion policies as an expressive policy class for offline reinforcement learning,” in Proc. Int. Conf. Learn. Represent. (ICLR) , Kigali, Rwanda, May 2023, pp. 1–29

  47. [47]

    Denoising diffusion probabilistic mod- els,

    J. Ho, A. Jain, and P . Abbeel, “Denoising diffusion probabilistic mod- els,” in Advances in Neural Information Processing Systems (NeurIPS) , vol. 33, Vancouver, BC, Canada, Dec. 2020, pp. 6840–6851

  48. [48]

    Sustainable diffusion-based incentive mechanism for generative AI-driven digital twins in industrial Cyber-Physical systems,

    J. Wen, J. Kang, D. Niyato, Y . Zhang, and S. Mao, “Sustainable diffusion-based incentive mechanism for generative AI-driven digital twins in industrial Cyber-Physical systems,” IEEE Trans. Ind. Cyber- Phys. Syst. , vol. 3, pp. 139–149, 2025