Hybrid Bit and Semantic Communications for UAV-Enabled Wireless Power Transfer Networks: A Decision-Assisted Deep Reinforcement Learning Approach

Chong Huang; Jingfu Li; Jingjing Cui; Jing Zhu; Mingzhe Chen; Pei Xiao; Rahim Tafazolli; Zheng Chu

arxiv: 2606.00668 · v1 · pith:HDUPBJU7new · submitted 2026-05-30 · 💻 cs.IT · eess.SP· math.IT

Hybrid Bit and Semantic Communications for UAV-Enabled Wireless Power Transfer Networks: A Decision-Assisted Deep Reinforcement Learning Approach

Jingfu Li , Jingjing Cui , Chong Huang , Jing Zhu , Zheng Chu , Mingzhe Chen , Pei Xiao , Rahim Tafazolli This is my paper

Pith reviewed 2026-06-28 18:18 UTC · model grok-4.3

classification 💻 cs.IT eess.SPmath.IT

keywords semantic communicationswireless power transferUAV networksdeep reinforcement learninghybrid communicationsenergy efficiencytrajectory optimization

0 comments

The pith

A hybrid bit-semantic framework with decision-assisted DSAC maximizes efficiency in UAV wireless power transfer networks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper investigates the integration of semantic communications with wireless power transfer to support energy-limited devices in UAV networks. It proposes a multi-layer hybrid framework that switches between bit and semantic modes to adapt to varying SNR and task requirements. The central goal is maximizing a semantic communication efficiency metric by jointly optimizing UAV trajectory, EHBS selection, user association, mode selection, and harvesting time allocation. A distributional soft actor-critic algorithm enhanced by a decision assistant solves the resulting long-term dynamic optimization problem. Simulations indicate that this approach delivers superior performance compared to baselines in changing network conditions.

Core claim

In UAV-enabled wireless power transfer networks, a multi-layer hybrid bit and semantic communication framework maximizes semantic communication efficiency by jointly optimizing UAV trajectory, EHBS selection, user association, semantic mode selection, and energy harvesting time allocation, with the optimization solved by a decision-assisted distributional soft actor-critic algorithm that achieves superior long-term performance in dynamic environments.

What carries the argument

Decision-assisted distributional soft actor-critic (DSAC) algorithm applied to joint optimization over the multi-layer hybrid bit-semantic framework.

If this is right

The hybrid framework enables adaptation to different SNR levels and task requirements.
Joint optimization of trajectory, selection, association, mode, and time allocation improves energy efficiency.
The decision assistant enhances convergence of the DSAC algorithm.
Superior long-term optimization performance holds in dynamic network environments.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same joint-optimization structure could be tested in other mobile energy-harvesting settings such as ground-vehicle or low-altitude platform networks.
The efficiency metric may provide a quantitative basis for comparing semantic versus bit modes under power constraints.

Load-bearing premise

The semantic communication efficiency metric and the hybrid framework's ability to adapt to SNR and tasks are well-defined and sufficient to produce meaningful optimization gains.

What would settle it

A set of simulations in which the proposed decision-assisted DSAC fails to exceed the long-term efficiency of standard actor-critic or non-hybrid baselines across multiple dynamic UAV-WPT scenarios would falsify the superiority claim.

Figures

Figures reproduced from arXiv: 2606.00668 by Chong Huang, Jingfu Li, Jingjing Cui, Jing Zhu, Mingzhe Chen, Pei Xiao, Rahim Tafazolli, Zheng Chu.

**Figure 2.** Figure 2: The Hybrid Bit & Semantic Communication Framework. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: The compressed size and semantic generation quality [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 5.** Figure 5: The decision assistant in DRL. Algorithm 1 DA-DSAC-Based Optimization in UAVEnabled WPT Network: 1: Initialize parameters in UAV-Enabled WPT networks and semantic transmission framework. 2: Initialize DRL learning variables. 3: repeat: 4: for time slot 0, 1, ..., T do 5: Choose an action a(t) based on the current policy and state f(t). 6: Obtain the next state f(t+1) and reward r(t) from the environment. … view at source ↗

**Figure 6.** Figure 6: Semantic efficiency vs the weight factor of generatio [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 8.** Figure 8: Semantic efficiency vs raw data size. becomes marginal. Moreover, the baseline results in this figure demonstrate the advantages of the proposed algorithm, and highlight the importance of optimizing UAV trajectory and wireless power transfer time factor. The impact of raw data size on semantic communication efficiency is shown in [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗

**Figure 9.** Figure 9: Semantic efficiency vs maximum UAV speed. [PITH_FULL_IMAGE:figures/full_fig_p011_9.png] view at source ↗

**Figure 12.** Figure 12: Semantic efficiency vs maximum GU speed. [PITH_FULL_IMAGE:figures/full_fig_p011_12.png] view at source ↗

read the original abstract

Semantic communications which can significantly reduce spectrum consumption in wireless networks, have recently become a popular research area. When combined with wireless power transfer (WPT), semantic communications can help achieve high spectral efficiency for energy-limited devices in wireless communications. In energy-constrained and link budget-limited scenarios such as UAV networks, the integration of semantic communications and WPT enables highly energyefficient transmission mechanisms. In this paper, we investigate semantic communications in UAV-enabled WPT networks. To achieve adaptability to varying signal-to-noise ratio (SNR) and task requirements, we introduce a multi-layer hybrid bit and semantic communication framework. We adopt a semantic communication efficiency metric and aim to maximize it by jointly optimizing UAV trajectory, energy harvesting base station (EHBS) selection, user association, semantic mode selection, and energy harvesting time allocation. To address this complex longterm optimization problem, we introduce the distributional soft actor-critic (DSAC) algorithm and introduce a decision assistant to further enhance the convergence performance of DSAC. Simulation results validate the effectiveness of the proposed method and framework and demonstrate that our algorithm can achieve superior long-term optimization performance in dynamic network environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Applies a hybrid bit-semantic framework and decision-assisted DSAC to UAV WPT optimization, but validation details are absent from the abstract.

read the letter

The main takeaway is that this paper applies semantic communications and distributional soft actor-critic to a UAV wireless power transfer setting. It introduces a multi-layer hybrid framework that switches between bit and semantic modes and uses a decision assistant to stabilize the RL training.

The work is new in the specific combination: joint optimization of UAV trajectory, EHBS selection, user association, mode choice, and harvesting time to maximize a semantic efficiency metric under varying SNR and task conditions. The hybrid structure and the assistant tweak to DSAC are presented as practical responses to the dynamic, energy-limited environment.

The paper does a reasonable job framing why semantic comms plus WPT could help in UAV networks and why standard RL needs the extra decision step for convergence. The problem statement is concrete and the long-term optimization goal is clearly stated.

The soft spot is the evidence. Everything rests on simulations, yet the abstract supplies no setup details, no baselines, no parameter ranges, and no statistical checks. Without those, it is impossible to tell whether the reported gains come from the hybrid approach or from favorable metric definitions and environment choices. The semantic efficiency metric itself is not defined here, so its role in driving the joint decisions needs checking.

This is for people already working on semantic communications or UAV resource allocation. A specialist might borrow the framework or the assistant idea for their own simulations. Broader readers will not find new core methods.

The paper deserves a serious referee. The motivation and structure are coherent, so reviewers can focus on the missing simulation details and metric justification. I would send it out rather than desk reject.

Referee Report

0 major / 1 minor

Summary. The paper proposes a multi-layer hybrid bit and semantic communication framework for UAV-enabled wireless power transfer networks to achieve adaptability to varying SNR and task requirements. It adopts a semantic communication efficiency metric and formulates a long-term optimization problem over UAV trajectory, EHBS selection, user association, semantic mode selection, and energy harvesting time allocation. The distributional soft actor-critic (DSAC) algorithm augmented with a decision assistant is introduced to solve the problem, with simulation results claimed to validate effectiveness and superior performance in dynamic environments.

Significance. If the simulation claims hold with appropriate baselines and statistical support, the work would demonstrate a practical application of distributional RL to joint optimization of hybrid semantic/bit communications and WPT in UAV networks. This could be relevant for energy-constrained dynamic scenarios, with the decision-assistant enhancement to DSAC as a potential incremental contribution to convergence in such settings.

minor comments (1)

[Abstract] Abstract contains a minor typographical issue ('energyefficient' should be hyphenated as 'energy-efficient').

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the summary of our work on the hybrid bit and semantic communication framework for UAV-enabled WPT networks using decision-assisted DSAC. The summary accurately reflects the problem formulation, optimization variables, and algorithmic approach. No specific major comments were provided in the report, so we have no point-by-point responses at this time. We remain available to address any additional questions or concerns the referee may have regarding the simulations, baselines, or other aspects of the manuscript.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper proposes a hybrid bit/semantic communication framework for UAV-enabled WPT networks and applies the distributional soft actor-critic (DSAC) algorithm with a decision assistant to jointly optimize trajectory, EHBS selection, user association, mode selection, and harvesting time in order to maximize a semantic efficiency metric. No equations, fitted parameters, or self-referential definitions appear in the abstract. The optimization is presented as a standard RL formulation whose performance is assessed via external simulation benchmarks. No load-bearing step reduces by construction to its own inputs, and no self-citation chain is invoked to justify uniqueness or ansatz choices. The central claim therefore remains independent of the method itself.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; full text needed for ledger.

pith-pipeline@v0.9.1-grok · 5787 in / 1021 out tokens · 45450 ms · 2026-06-28T18:18:40.304001+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

51 extracted references · 7 canonical work pages

[1]

Semantics-empowered communications: A tutorial-cum-su rvey,

Z. Lu, R. Li, K. Lu, X. Chen, E. Hossain, Z. Zhao, and H. Zhan g, “Semantics-empowered communications: A tutorial-cum-su rvey,” IEEE Commun. Surveys Tuts. , vol. 26, no. 1, pp. 41–79, 2024

2024
[2]

Deep learning en abled semantic communication systems,

H. Xie, Z. Qin, G. Y . Li, and B.-H. Juang, “Deep learning en abled semantic communication systems,” IEEE Trans. Signal Processing , vol. 69, pp. 2663–2675, Apr. 2021

2021
[3]

Deep joi nt source– channel coding for wireless image transmission,

E. Bourtsoulatze, D. B. Kurka, and D. G¨ und¨ uz, “Deep joi nt source– channel coding for wireless image transmission,” IEEE Trans. Cogn. Commun. Netw., vol. 5, no. 3, pp. 567–579, Sep. 2019

2019
[4]

Beyond transmitting bits: Cont ext, semantics, and task-oriented communications,

D. G¨ und¨ uz, Z. Qin, I. E. Aguerri, H. S. Dhillon, Z. Y ang, A. Y ener, K.-K. Wong, and C.-B. Chae, “Beyond transmitting bits: Cont ext, semantics, and task-oriented communications,” IEEE J. Select. Areas Commun., vol. 41, no. 1, pp. 5–41, Jan. 2023

2023
[5]

Wireless powered communica tion networks: an overview,

S. Bi, Y . Zeng, and R. Zhang, “Wireless powered communica tion networks: an overview,” IEEE Wireless Communications, vol. 23, no. 2, pp. 10–18, Apr. 2016

2016
[6]

Distributed movable antenna systems for reliab le, ﬂex- ible, and resilient communications: Architectures, techn iques, and challenges,

G. Pan, Y . Wu, Y . Si, Z. Hua, X. Gao, S. Wang, J. An, and D. Niyato, “Distributed movable antenna systems for reliab le, ﬂex- ible, and resilient communications: Architectures, techn iques, and challenges,” IEEE Wireless Communications , pp. 1–9, 2025, doi: 10.1109/MWC.2025.3599138

work page doi:10.1109/mwc.2025.3599138 2025
[7]

Wireless communication s with un- manned aerial vehicles: Opportunities and challenges,

Y . Zeng, R. Zhang, and T. J. Lim, “Wireless communication s with un- manned aerial vehicles: Opportunities and challenges,” IEEE Commun. Mag., vol. 54, no. 5, pp. 36–42, May. 2016

2016
[8]

UA V-enabled wireless power transfer: A tutorial overview,

L. Xie, X. Cao, J. Xu, and R. Zhang, “UA V-enabled wireless power transfer: A tutorial overview,” IEEE Trans. Green Commun. Netw. , vol. 5, no. 4, pp. 2042–2064, Dec. 2021

2042
[9]

Trajectory design for UA V-enabled multiuser wireless power transfer,

X. Y uan, Y . Hu, J. Xu, and A. Schmeink, “Trajectory design for UA V-enabled multiuser wireless power transfer,” IEEE Trans. Wireless Commun., vol. 20, no. 1, pp. 26–42, Feb. 2021

2021
[10]

Resource all ocation for text semantic communications,

L. Y an, Z. Qin, R. Zhang, Y . Li, and G. Y . Li, “Resource all ocation for text semantic communications,” IEEE Wireless Commun. Lett. , vol. 11, no. 7, pp. 1394–1398, Jul. 2022

2022
[11]

Semantic communication systems for speech transmission,

Z. Weng and Z. Qin, “Semantic communication systems for speech transmission,” IEEE J. Select. Areas Commun. , vol. 39, no. 8, pp. 2434– 2444, Aug. 2021

2021
[12]

Toward semanti c communications: Deep learning-based image semantic codin g,

D. Huang, F. Gao, X. Tao, Q. Du, and J. Lu, “Toward semanti c communications: Deep learning-based image semantic codin g,” IEEE J. Select. Areas Commun. , vol. 41, no. 1, pp. 55–71, Jan. 2023

2023
[13]

Resource optimization for semantic- aware networks with task ofﬂoading,

Z. Ji, Z. Qin, X. Tao, and Z. Han, “Resource optimization for semantic- aware networks with task ofﬂoading,” IEEE Trans. Wireless Commun. , vol. 23, no. 9, pp. 12 284–12 296, Sep. 2024

2024
[14]

Wit t: A wireless image transmission transformer for semantic comm unications,

K. Y ang, S. Wang, J. Dai, K. Tan, K. Niu, and P . Zhang, “Wit t: A wireless image transmission transformer for semantic comm unications,” IEEE International Conference on Acoustics, Speech and Sig nal Pro- cessing (ICASSP) , Rhodes Island, Greece, Jun. 2023

2023
[15]

Ad aptive bit rate control in semantic communication with incremental kn owledge- based HARQ,

Q. Zhou, R. Liu, J. Zhang, R. He, R. Wang, and R. Zhang, “Ad aptive bit rate control in semantic communication with incremental kn owledge- based HARQ,” IEEE Open J. Commun. Soc , vol. 3, pp. 1075–1089, Jul. 2022

2022
[16]

Energy-efﬁcient probabilistic semantic communication o ver space-air- ground integrated networks,

Z. Zhao, Z. Y ang, M. Chen, C. Zhu, W. Xu, Z. Zhang, and K. Hu ang, “Energy-efﬁcient probabilistic semantic communication o ver space-air- ground integrated networks,” IEEE Trans. Wireless Commun. , vol. 24, no. 10, pp. 8814–8829, Oct. 2025

2025
[17]

A robust image semantic communication system with multi-scale vision tra nsformer,

X. Peng, Z. Qin, X. Tao, J. Lu, and K. B. Letaief, “A robust image semantic communication system with multi-scale vision tra nsformer,” IEEE J. Select. Areas Commun. , vol. 43, no. 4, pp. 1278–1291, Apr. 2025

2025
[18]

What will the future of UA V cellular communications be? a ﬂight fr om 5G to 6G,

G. Geraci, A. Garcia-Rodriguez, M. M. Azari, A. Lozano, M. Mez- zavilla, S. Chatzinotas, Y . Chen, S. Rangan, and M. D. Renzo, “What will the future of UA V cellular communications be? a ﬂight fr om 5G to 6G,” IEEE Commun. Surveys Tuts. , vol. 24, no. 3, pp. 1304–1335, Thirdquarter, 2022

2022
[19]

Beam domain channel modeling and prediction for U A V communications,

H. Chang, C.-X. Wang, R. Feng, C. Huang, L. Hou, and E.-H. M. Aggoune, “Beam domain channel modeling and prediction for U A V communications,” IEEE Trans. Wireless Commun. , vol. 24, no. 2, pp. 969–983, Feb. 2025

2025
[20]

Transformer-based collaborative reinforcement learnin g for ﬂuid an- tenna system (FAS)-enabled 3D UA V positioning,

X. Xu, H. Xu, D. Wei, W. Saad, M. Bennis, and M. Chen, “Transformer-based collaborative reinforcement learnin g for ﬂuid an- tenna system (FAS)-enabled 3D UA V positioning,” IEEE Journal on Selected Areas in Communications , vol. 44, pp. 1128–1143, 2026, doi: 10.1109/JSAC.2025.3625523

work page doi:10.1109/jsac.2025.3625523 2026
[21]

UA V-relay-aided secure maritime networks coe xisting with satellite networks: Robust beamforming and trajector y optimiza- tion,

Y . Y ao, W. Xiao, P . Miao, G. Chen, H. Y ang, C.-B. Chae, and K.-K. Wong, “UA V-relay-aided secure maritime networks coe xisting with satellite networks: Robust beamforming and trajector y optimiza- tion,” IEEE Trans. Wireless Commun. , Early Access, 2025. DOI: 10.1109/TWC.2025.3596136

work page doi:10.1109/twc.2025.3596136 2025
[22]

UA V-RIS-aided space-air-ground integrate d network: In- terference alignment design and DoF analysis,

J. Li, G. Chen, T. Zhang, W. Feng, W. Jiang, T. Q. S. Quek, a nd R. Tafazolli, “UA V-RIS-aided space-air-ground integrate d network: In- terference alignment design and DoF analysis,” IEEE Trans. Wireless Commun., vol. 23, no. 9, pp. 11 678–11 692, Sep. 2024

2024
[23]

D eep learning empowered trajectory and passive beamforming des ign in UA V-RIS enabled secure cognitive non-terrestrial network s,

Y . Liu, C. Huang, G. Chen, R. Song, S. Song, and P . Xiao, “D eep learning empowered trajectory and passive beamforming des ign in UA V-RIS enabled secure cognitive non-terrestrial network s,” IEEE Wireless Commun. Lett. , vol. 13, no. 1, pp. 188–192, Jan. 2024

2024
[24]

Collab orative reinforcement learning based unmanned aerial vehicle (UA V ) trajectory design for 3D UA V tracking,

Y . Zhu, M. Chen, S. Wang, Y . Hu, Y . Liu, and C. Yin, “Collab orative reinforcement learning based unmanned aerial vehicle (UA V ) trajectory design for 3D UA V tracking,” IEEE Trans. Mobile Comput. , vol. 23, no. 12, pp. 10 787–10 802, Dec. 2024

2024
[25]

UA V-assisted communica tions in SAGIN-ISAC: Mobile user tracking and robust beamforming,

W. Mao, Y . Lu, G. Pan, and B. Ai, “UA V-assisted communica tions in SAGIN-ISAC: Mobile user tracking and robust beamforming,” IEEE Journal on Selected Areas in Communications , vol. 43, no. 1, pp. 186– 200, Jan. 2025

2025
[26]

Flexible reconﬁgurable intelligent surface-ai ded covert communications in UA V networks,

C. Huang, G. Chen, Z. Xu, J. Zhu, T. Pan, R. Tafazolli, and W. Huang, “Flexible reconﬁgurable intelligent surface-ai ded covert communications in UA V networks,” IEEE Journal on Selected Areas in Communications , vol. 44, pp. 1577–1588, 2026, doi: 10.1109/JSAC.2025.3639197

work page doi:10.1109/jsac.2025.3639197 2026
[27]

Dynamic interference management by using the enhanced clu stering and deep reinforcement learning in VLC-enabled UA V communication,

L. Pang, K. Song, P . Miao, C. Huang, B. Ji, Z. An, and G. Che n, “Dynamic interference management by using the enhanced clu stering and deep reinforcement learning in VLC-enabled UA V communication,” IEEE Transactions on Consumer Electronics , vol. 71, no. 3, pp. 7584– 7596, Aug. 2025

2025
[28]

Joint optimization of delay and power efﬁciency of neighbor discovery in UA V networks,

H. Fan, Z. Song, X. Y ang, T. Li, S. Wang, C. Y . Leow, G. Pan, and D. Niyato, “Joint optimization of delay and power efﬁciency of neighbor discovery in UA V networks,” IEEE Transactions on Mobile Computing , pp. 1–16, 2025, doi: 10.1109/TMC.2025.3649859

work page doi:10.1109/tmc.2025.3649859 2025
[29]

RIS-assisted UA V communications for IoT with wireless power transfer using deep reinforcement learning,

K. K. Nguyen, A. Masaracchia, V . Sharma, H. V . Poor, and T . Q. Duong, “RIS-assisted UA V communications for IoT with wireless power transfer using deep reinforcement learning,” IEEE J. Sel. Top. Signal Process., vol. 16, no. 5, pp. 1086–1096, Aug. 2022

2022
[30]

Energy minim ization in RIS-assisted UA V-enabled wireless power transfer syste ms,

H. Ren, Z. Zhang, Z. Peng, L. Li, and C. Pan, “Energy minim ization in RIS-assisted UA V-enabled wireless power transfer syste ms,” IEEE Internet Things J. , vol. 10, no. 7, pp. 5794–5809, Apr. 2023

2023
[31]

A two- stage strategy for UA V-enabled wireless power transfer in u nknown environments,

J. Shi, P . Cong, L. Zhao, X. Wang, S. Wan, and M. Guizani, “ A two- stage strategy for UA V-enabled wireless power transfer in u nknown environments,” IEEE Trans. Mobile Comput. , vol. 23, no. 2, pp. 1785– 1802, Jan. 2024

2024
[32]

Aggregate throughput maxi mization for UA V-enabled relay networks with wireless power transfe r: Joint trajectory and power optimization,

H. Y u, M. Ju, and H.-C. Y ang, “Aggregate throughput maxi mization for UA V-enabled relay networks with wireless power transfe r: Joint trajectory and power optimization,” IEEE Trans. V eh. Technol., vol. 73, no. 6, pp. 8253–8265, Jun. 2024

2024
[33]

Economics of semantic communication system in wireless po wered Internet of Things,

Z. Q. Liew, Y . Cheng, W. Y . B. Lim, D. Niyato, C. Miao, and S . Sun, “Economics of semantic communication system in wireless po wered Internet of Things,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Singapore, Singapore, May. 2022, pp. 8637-8641

2022
[34]

Semantic com- munications for simultaneous wireless information and pow er transfer,

N. Khalfet, C. Psomas, S. Chatzinotas, and I. Krikidis, “Semantic com- munications for simultaneous wireless information and pow er transfer,” IEEE Trans. Commun. , vol. 73, no. 1, pp. 173–188, Jan. 2025

2025
[35]

Semantics-aware updates fro m remote energy harvesting devices to interconnected LEO satellite s,

E. Delfani and N. Pappas, “Semantics-aware updates fro m remote energy harvesting devices to interconnected LEO satellite s,” IEEE Commun. Lett. , vol. 29, no. 8, pp. 1928–1932, Aug. 2025

1928
[36]

Wireless power transfer me ets semantic communication for resource-constrained IoT netw orks: A joint transmission mode selection and resource management appro ach,

N. H. Sang, N. D. Hai, N. D. D. Anh, N. C. Luong, V .-D. Nguye n, S. Gong, D. Niyato, and D. I. Kim, “Wireless power transfer me ets semantic communication for resource-constrained IoT netw orks: A joint transmission mode selection and resource management appro ach,” IEEE Internet Things J. , vol. 12, no. 1, pp. 556–569, Jan. 2025

2025
[37]

3D trajectory optimization in rici an fading for UA V-enabled data harvesting,

C. Y ou and R. Zhang, “3D trajectory optimization in rici an fading for UA V-enabled data harvesting,” IEEE Trans. Wireless Commun., vol. 18, no. 6, pp. 3192–3207, Jun. 2019

2019
[38]

Secure D2D communication in large-scale cognitive cellul ar networks: A wireless power transfer model,

Y . Liu, L. Wang, S. A. Raza Zaidi, M. Elkashlan, and T. Q. D uong, “Secure D2D communication in large-scale cognitive cellul ar networks: A wireless power transfer model,” IEEE Trans. Commun., vol. 64, no. 1, pp. 329–342, Jan. 2016

2016
[39]

Robust resource allocation for MIMO wireless powered comm unica- tion networks based on a non-linear EH model,

E. Boshkovska, D. W. K. Ng, N. Zlatanov, A. Koelpin, and R . Schober, “Robust resource allocation for MIMO wireless powered comm unica- tion networks based on a non-linear EH model,” IEEE Transactions on Communications, vol. 65, no. 5, pp. 1984–1999, May. 2017

1984
[40]

Auto-encoding variationa l bayes,

D. P . Kingma and M. Welling, “Auto-encoding variationa l bayes,” International Conference on Learning Representations (IC LR), Banff, Canada, Apr. 2014

2014
[41]

Deep reinforcement learning-based resource allocation f or hybrid bit and generative semantic communications in space-air-grou nd integrated networks,

C. Huang, X. Chen, G. Chen, P . Xiao, G. Y . Li, and W. Huang, “Deep reinforcement learning-based resource allocation f or hybrid bit and generative semantic communications in space-air-grou nd integrated networks,” IEEE J. Select. Areas Commun. , Early Access, 2025. DOI: 10.1109/JSAC.2025.3623157

work page doi:10.1109/jsac.2025.3623157 2025
[42]

BLIP-2: Bootstrap ping language- image pre-training with frozen image encoders and large lan guage models,

J. Li, D. Li, S. Savarese, and S. Hoi, “BLIP-2: Bootstrap ping language- image pre-training with frozen image encoders and large lan guage models,” International Conference on Machine Learning (ICML) , vol. 202, Hawaii, USA, Jul. 2023, pp. 19730-19742

2023
[43]

Learning transferable visual models from natural languag e supervi- sion,

A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Ag arwal, G. Sastry, A. Askell, P . Mishkin, J. Clark, G. Krueger, and I. Sutskever, “Learning transferable visual models from natural languag e supervi- sion,” International Conference on Machine Learning (ICML) , vol. 139, Jul. 2021, pp. 8748-8763

2021
[44]

High- resolution image synthesis with latent diffusion models,

R. Rombach, A. Blattmann, D. Lorenz, P . Esser, and B. Omm er, “High- resolution image synthesis with latent diffusion models,” IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVP R), New Orleans, USA, Jun. 2022, pp. 10684-10695

2022
[45]

Neural discrete representation learning,

A. van den Oord, O. Vinyals, and k. kavukcuoglu, “Neural discrete representation learning,” Advances in Neural Information Processing Systems (NeurIPS) , vol. 30, Long Beach, USA, Dec. 2017

2017
[46]

Classiﬁer-free diffusion guida nce,

J. Ho and T. Salimans, “Classiﬁer-free diffusion guida nce,” Neural Information Processing Systems (NeurIPS) , Dec. 2021

2021
[47]

Di stributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors,

J. Duan, Y . Guan, S. E. Li, Y . Ren, Q. Sun, and B. Cheng, “Di stributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors,” IEEE Trans. Neural Netw. Learn. , vol. 33, no. 11, pp. 6584–6598, Nov. 2022

2022
[48]

Op- timizing model splitting and device task assignment for dec eptive signal-assisted private multi-hop split learning,

D. Wei, X. Xu, Y . Liu, H. Vincent Poor, and M. Chen, “Op- timizing model splitting and device task assignment for dec eptive signal-assisted private multi-hop split learning,” IEEE Journal on Se- lected Areas in Communications , vol. 44, pp. 1512–1528, 2026, doi: 10.1109/JSAC.2025.3637738

work page doi:10.1109/jsac.2025.3637738 2026
[49]

Joint ofﬂoading and resource allocation for hybrid cloud and edge computing in SAGINs: A decision assisted hybrid action space deep rein forcement learning approach,

C. Huang, G. Chen, P . Xiao, Y . Xiao, Z. Han, and J. A. Chamb ers, “Joint ofﬂoading and resource allocation for hybrid cloud and edge computing in SAGINs: A decision assisted hybrid action space deep rein forcement learning approach,” IEEE J. Select. Areas Commun. , vol. 42, no. 5, pp. 1029–1043, May. 2024

2024
[50]

Delay-constrained buff er-aided relay selection in the Internet of Things with decision-assisted reinforcement learning,

C. Huang, G. Chen, and Y . Gong, “Delay-constrained buff er-aided relay selection in the Internet of Things with decision-assisted reinforcement learning,” IEEE Internet Things J. , vol. 8, no. 12, pp. 10 198–10 208, Jun. 2021

2021
[51]

The surprising effectiveness of PPO in cooperative multi- agent games,

C. Y u, A. V elu, E. Vinitsky, J. Gao, Y . Wang, A. Bayen, and Y . WU, “The surprising effectiveness of PPO in cooperative multi- agent games,” Conference on Neural Information Processing Systems (Neur IPS), New Orleans, LA, Nov. 2022, pp. 24611-24624

2022

[1] [1]

Semantics-empowered communications: A tutorial-cum-su rvey,

Z. Lu, R. Li, K. Lu, X. Chen, E. Hossain, Z. Zhao, and H. Zhan g, “Semantics-empowered communications: A tutorial-cum-su rvey,” IEEE Commun. Surveys Tuts. , vol. 26, no. 1, pp. 41–79, 2024

2024

[2] [2]

Deep learning en abled semantic communication systems,

H. Xie, Z. Qin, G. Y . Li, and B.-H. Juang, “Deep learning en abled semantic communication systems,” IEEE Trans. Signal Processing , vol. 69, pp. 2663–2675, Apr. 2021

2021

[3] [3]

Deep joi nt source– channel coding for wireless image transmission,

E. Bourtsoulatze, D. B. Kurka, and D. G¨ und¨ uz, “Deep joi nt source– channel coding for wireless image transmission,” IEEE Trans. Cogn. Commun. Netw., vol. 5, no. 3, pp. 567–579, Sep. 2019

2019

[4] [4]

Beyond transmitting bits: Cont ext, semantics, and task-oriented communications,

D. G¨ und¨ uz, Z. Qin, I. E. Aguerri, H. S. Dhillon, Z. Y ang, A. Y ener, K.-K. Wong, and C.-B. Chae, “Beyond transmitting bits: Cont ext, semantics, and task-oriented communications,” IEEE J. Select. Areas Commun., vol. 41, no. 1, pp. 5–41, Jan. 2023

2023

[5] [5]

Wireless powered communica tion networks: an overview,

S. Bi, Y . Zeng, and R. Zhang, “Wireless powered communica tion networks: an overview,” IEEE Wireless Communications, vol. 23, no. 2, pp. 10–18, Apr. 2016

2016

[6] [6]

Distributed movable antenna systems for reliab le, ﬂex- ible, and resilient communications: Architectures, techn iques, and challenges,

G. Pan, Y . Wu, Y . Si, Z. Hua, X. Gao, S. Wang, J. An, and D. Niyato, “Distributed movable antenna systems for reliab le, ﬂex- ible, and resilient communications: Architectures, techn iques, and challenges,” IEEE Wireless Communications , pp. 1–9, 2025, doi: 10.1109/MWC.2025.3599138

work page doi:10.1109/mwc.2025.3599138 2025

[7] [7]

Wireless communication s with un- manned aerial vehicles: Opportunities and challenges,

Y . Zeng, R. Zhang, and T. J. Lim, “Wireless communication s with un- manned aerial vehicles: Opportunities and challenges,” IEEE Commun. Mag., vol. 54, no. 5, pp. 36–42, May. 2016

2016

[8] [8]

UA V-enabled wireless power transfer: A tutorial overview,

L. Xie, X. Cao, J. Xu, and R. Zhang, “UA V-enabled wireless power transfer: A tutorial overview,” IEEE Trans. Green Commun. Netw. , vol. 5, no. 4, pp. 2042–2064, Dec. 2021

2042

[9] [9]

Trajectory design for UA V-enabled multiuser wireless power transfer,

X. Y uan, Y . Hu, J. Xu, and A. Schmeink, “Trajectory design for UA V-enabled multiuser wireless power transfer,” IEEE Trans. Wireless Commun., vol. 20, no. 1, pp. 26–42, Feb. 2021

2021

[10] [10]

Resource all ocation for text semantic communications,

L. Y an, Z. Qin, R. Zhang, Y . Li, and G. Y . Li, “Resource all ocation for text semantic communications,” IEEE Wireless Commun. Lett. , vol. 11, no. 7, pp. 1394–1398, Jul. 2022

2022

[11] [11]

Semantic communication systems for speech transmission,

Z. Weng and Z. Qin, “Semantic communication systems for speech transmission,” IEEE J. Select. Areas Commun. , vol. 39, no. 8, pp. 2434– 2444, Aug. 2021

2021

[12] [12]

Toward semanti c communications: Deep learning-based image semantic codin g,

D. Huang, F. Gao, X. Tao, Q. Du, and J. Lu, “Toward semanti c communications: Deep learning-based image semantic codin g,” IEEE J. Select. Areas Commun. , vol. 41, no. 1, pp. 55–71, Jan. 2023

2023

[13] [13]

Resource optimization for semantic- aware networks with task ofﬂoading,

Z. Ji, Z. Qin, X. Tao, and Z. Han, “Resource optimization for semantic- aware networks with task ofﬂoading,” IEEE Trans. Wireless Commun. , vol. 23, no. 9, pp. 12 284–12 296, Sep. 2024

2024

[14] [14]

Wit t: A wireless image transmission transformer for semantic comm unications,

K. Y ang, S. Wang, J. Dai, K. Tan, K. Niu, and P . Zhang, “Wit t: A wireless image transmission transformer for semantic comm unications,” IEEE International Conference on Acoustics, Speech and Sig nal Pro- cessing (ICASSP) , Rhodes Island, Greece, Jun. 2023

2023

[15] [15]

Ad aptive bit rate control in semantic communication with incremental kn owledge- based HARQ,

Q. Zhou, R. Liu, J. Zhang, R. He, R. Wang, and R. Zhang, “Ad aptive bit rate control in semantic communication with incremental kn owledge- based HARQ,” IEEE Open J. Commun. Soc , vol. 3, pp. 1075–1089, Jul. 2022

2022

[16] [16]

Energy-efﬁcient probabilistic semantic communication o ver space-air- ground integrated networks,

Z. Zhao, Z. Y ang, M. Chen, C. Zhu, W. Xu, Z. Zhang, and K. Hu ang, “Energy-efﬁcient probabilistic semantic communication o ver space-air- ground integrated networks,” IEEE Trans. Wireless Commun. , vol. 24, no. 10, pp. 8814–8829, Oct. 2025

2025

[17] [17]

A robust image semantic communication system with multi-scale vision tra nsformer,

X. Peng, Z. Qin, X. Tao, J. Lu, and K. B. Letaief, “A robust image semantic communication system with multi-scale vision tra nsformer,” IEEE J. Select. Areas Commun. , vol. 43, no. 4, pp. 1278–1291, Apr. 2025

2025

[18] [18]

What will the future of UA V cellular communications be? a ﬂight fr om 5G to 6G,

G. Geraci, A. Garcia-Rodriguez, M. M. Azari, A. Lozano, M. Mez- zavilla, S. Chatzinotas, Y . Chen, S. Rangan, and M. D. Renzo, “What will the future of UA V cellular communications be? a ﬂight fr om 5G to 6G,” IEEE Commun. Surveys Tuts. , vol. 24, no. 3, pp. 1304–1335, Thirdquarter, 2022

2022

[19] [19]

Beam domain channel modeling and prediction for U A V communications,

H. Chang, C.-X. Wang, R. Feng, C. Huang, L. Hou, and E.-H. M. Aggoune, “Beam domain channel modeling and prediction for U A V communications,” IEEE Trans. Wireless Commun. , vol. 24, no. 2, pp. 969–983, Feb. 2025

2025

[20] [20]

Transformer-based collaborative reinforcement learnin g for ﬂuid an- tenna system (FAS)-enabled 3D UA V positioning,

X. Xu, H. Xu, D. Wei, W. Saad, M. Bennis, and M. Chen, “Transformer-based collaborative reinforcement learnin g for ﬂuid an- tenna system (FAS)-enabled 3D UA V positioning,” IEEE Journal on Selected Areas in Communications , vol. 44, pp. 1128–1143, 2026, doi: 10.1109/JSAC.2025.3625523

work page doi:10.1109/jsac.2025.3625523 2026

[21] [21]

UA V-relay-aided secure maritime networks coe xisting with satellite networks: Robust beamforming and trajector y optimiza- tion,

Y . Y ao, W. Xiao, P . Miao, G. Chen, H. Y ang, C.-B. Chae, and K.-K. Wong, “UA V-relay-aided secure maritime networks coe xisting with satellite networks: Robust beamforming and trajector y optimiza- tion,” IEEE Trans. Wireless Commun. , Early Access, 2025. DOI: 10.1109/TWC.2025.3596136

work page doi:10.1109/twc.2025.3596136 2025

[22] [22]

UA V-RIS-aided space-air-ground integrate d network: In- terference alignment design and DoF analysis,

J. Li, G. Chen, T. Zhang, W. Feng, W. Jiang, T. Q. S. Quek, a nd R. Tafazolli, “UA V-RIS-aided space-air-ground integrate d network: In- terference alignment design and DoF analysis,” IEEE Trans. Wireless Commun., vol. 23, no. 9, pp. 11 678–11 692, Sep. 2024

2024

[23] [23]

D eep learning empowered trajectory and passive beamforming des ign in UA V-RIS enabled secure cognitive non-terrestrial network s,

Y . Liu, C. Huang, G. Chen, R. Song, S. Song, and P . Xiao, “D eep learning empowered trajectory and passive beamforming des ign in UA V-RIS enabled secure cognitive non-terrestrial network s,” IEEE Wireless Commun. Lett. , vol. 13, no. 1, pp. 188–192, Jan. 2024

2024

[24] [24]

Collab orative reinforcement learning based unmanned aerial vehicle (UA V ) trajectory design for 3D UA V tracking,

Y . Zhu, M. Chen, S. Wang, Y . Hu, Y . Liu, and C. Yin, “Collab orative reinforcement learning based unmanned aerial vehicle (UA V ) trajectory design for 3D UA V tracking,” IEEE Trans. Mobile Comput. , vol. 23, no. 12, pp. 10 787–10 802, Dec. 2024

2024

[25] [25]

UA V-assisted communica tions in SAGIN-ISAC: Mobile user tracking and robust beamforming,

W. Mao, Y . Lu, G. Pan, and B. Ai, “UA V-assisted communica tions in SAGIN-ISAC: Mobile user tracking and robust beamforming,” IEEE Journal on Selected Areas in Communications , vol. 43, no. 1, pp. 186– 200, Jan. 2025

2025

[26] [26]

Flexible reconﬁgurable intelligent surface-ai ded covert communications in UA V networks,

C. Huang, G. Chen, Z. Xu, J. Zhu, T. Pan, R. Tafazolli, and W. Huang, “Flexible reconﬁgurable intelligent surface-ai ded covert communications in UA V networks,” IEEE Journal on Selected Areas in Communications , vol. 44, pp. 1577–1588, 2026, doi: 10.1109/JSAC.2025.3639197

work page doi:10.1109/jsac.2025.3639197 2026

[27] [27]

Dynamic interference management by using the enhanced clu stering and deep reinforcement learning in VLC-enabled UA V communication,

L. Pang, K. Song, P . Miao, C. Huang, B. Ji, Z. An, and G. Che n, “Dynamic interference management by using the enhanced clu stering and deep reinforcement learning in VLC-enabled UA V communication,” IEEE Transactions on Consumer Electronics , vol. 71, no. 3, pp. 7584– 7596, Aug. 2025

2025

[28] [28]

Joint optimization of delay and power efﬁciency of neighbor discovery in UA V networks,

H. Fan, Z. Song, X. Y ang, T. Li, S. Wang, C. Y . Leow, G. Pan, and D. Niyato, “Joint optimization of delay and power efﬁciency of neighbor discovery in UA V networks,” IEEE Transactions on Mobile Computing , pp. 1–16, 2025, doi: 10.1109/TMC.2025.3649859

work page doi:10.1109/tmc.2025.3649859 2025

[29] [29]

RIS-assisted UA V communications for IoT with wireless power transfer using deep reinforcement learning,

K. K. Nguyen, A. Masaracchia, V . Sharma, H. V . Poor, and T . Q. Duong, “RIS-assisted UA V communications for IoT with wireless power transfer using deep reinforcement learning,” IEEE J. Sel. Top. Signal Process., vol. 16, no. 5, pp. 1086–1096, Aug. 2022

2022

[30] [30]

Energy minim ization in RIS-assisted UA V-enabled wireless power transfer syste ms,

H. Ren, Z. Zhang, Z. Peng, L. Li, and C. Pan, “Energy minim ization in RIS-assisted UA V-enabled wireless power transfer syste ms,” IEEE Internet Things J. , vol. 10, no. 7, pp. 5794–5809, Apr. 2023

2023

[31] [31]

A two- stage strategy for UA V-enabled wireless power transfer in u nknown environments,

J. Shi, P . Cong, L. Zhao, X. Wang, S. Wan, and M. Guizani, “ A two- stage strategy for UA V-enabled wireless power transfer in u nknown environments,” IEEE Trans. Mobile Comput. , vol. 23, no. 2, pp. 1785– 1802, Jan. 2024

2024

[32] [32]

Aggregate throughput maxi mization for UA V-enabled relay networks with wireless power transfe r: Joint trajectory and power optimization,

H. Y u, M. Ju, and H.-C. Y ang, “Aggregate throughput maxi mization for UA V-enabled relay networks with wireless power transfe r: Joint trajectory and power optimization,” IEEE Trans. V eh. Technol., vol. 73, no. 6, pp. 8253–8265, Jun. 2024

2024

[33] [33]

Economics of semantic communication system in wireless po wered Internet of Things,

Z. Q. Liew, Y . Cheng, W. Y . B. Lim, D. Niyato, C. Miao, and S . Sun, “Economics of semantic communication system in wireless po wered Internet of Things,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Singapore, Singapore, May. 2022, pp. 8637-8641

2022

[34] [34]

Semantic com- munications for simultaneous wireless information and pow er transfer,

N. Khalfet, C. Psomas, S. Chatzinotas, and I. Krikidis, “Semantic com- munications for simultaneous wireless information and pow er transfer,” IEEE Trans. Commun. , vol. 73, no. 1, pp. 173–188, Jan. 2025

2025

[35] [35]

Semantics-aware updates fro m remote energy harvesting devices to interconnected LEO satellite s,

E. Delfani and N. Pappas, “Semantics-aware updates fro m remote energy harvesting devices to interconnected LEO satellite s,” IEEE Commun. Lett. , vol. 29, no. 8, pp. 1928–1932, Aug. 2025

1928

[36] [36]

Wireless power transfer me ets semantic communication for resource-constrained IoT netw orks: A joint transmission mode selection and resource management appro ach,

N. H. Sang, N. D. Hai, N. D. D. Anh, N. C. Luong, V .-D. Nguye n, S. Gong, D. Niyato, and D. I. Kim, “Wireless power transfer me ets semantic communication for resource-constrained IoT netw orks: A joint transmission mode selection and resource management appro ach,” IEEE Internet Things J. , vol. 12, no. 1, pp. 556–569, Jan. 2025

2025

[37] [37]

3D trajectory optimization in rici an fading for UA V-enabled data harvesting,

C. Y ou and R. Zhang, “3D trajectory optimization in rici an fading for UA V-enabled data harvesting,” IEEE Trans. Wireless Commun., vol. 18, no. 6, pp. 3192–3207, Jun. 2019

2019

[38] [38]

Secure D2D communication in large-scale cognitive cellul ar networks: A wireless power transfer model,

Y . Liu, L. Wang, S. A. Raza Zaidi, M. Elkashlan, and T. Q. D uong, “Secure D2D communication in large-scale cognitive cellul ar networks: A wireless power transfer model,” IEEE Trans. Commun., vol. 64, no. 1, pp. 329–342, Jan. 2016

2016

[39] [39]

Robust resource allocation for MIMO wireless powered comm unica- tion networks based on a non-linear EH model,

E. Boshkovska, D. W. K. Ng, N. Zlatanov, A. Koelpin, and R . Schober, “Robust resource allocation for MIMO wireless powered comm unica- tion networks based on a non-linear EH model,” IEEE Transactions on Communications, vol. 65, no. 5, pp. 1984–1999, May. 2017

1984

[40] [40]

Auto-encoding variationa l bayes,

D. P . Kingma and M. Welling, “Auto-encoding variationa l bayes,” International Conference on Learning Representations (IC LR), Banff, Canada, Apr. 2014

2014

[41] [41]

Deep reinforcement learning-based resource allocation f or hybrid bit and generative semantic communications in space-air-grou nd integrated networks,

C. Huang, X. Chen, G. Chen, P . Xiao, G. Y . Li, and W. Huang, “Deep reinforcement learning-based resource allocation f or hybrid bit and generative semantic communications in space-air-grou nd integrated networks,” IEEE J. Select. Areas Commun. , Early Access, 2025. DOI: 10.1109/JSAC.2025.3623157

work page doi:10.1109/jsac.2025.3623157 2025

[42] [42]

BLIP-2: Bootstrap ping language- image pre-training with frozen image encoders and large lan guage models,

J. Li, D. Li, S. Savarese, and S. Hoi, “BLIP-2: Bootstrap ping language- image pre-training with frozen image encoders and large lan guage models,” International Conference on Machine Learning (ICML) , vol. 202, Hawaii, USA, Jul. 2023, pp. 19730-19742

2023

[43] [43]

Learning transferable visual models from natural languag e supervi- sion,

A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Ag arwal, G. Sastry, A. Askell, P . Mishkin, J. Clark, G. Krueger, and I. Sutskever, “Learning transferable visual models from natural languag e supervi- sion,” International Conference on Machine Learning (ICML) , vol. 139, Jul. 2021, pp. 8748-8763

2021

[44] [44]

High- resolution image synthesis with latent diffusion models,

R. Rombach, A. Blattmann, D. Lorenz, P . Esser, and B. Omm er, “High- resolution image synthesis with latent diffusion models,” IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVP R), New Orleans, USA, Jun. 2022, pp. 10684-10695

2022

[45] [45]

Neural discrete representation learning,

A. van den Oord, O. Vinyals, and k. kavukcuoglu, “Neural discrete representation learning,” Advances in Neural Information Processing Systems (NeurIPS) , vol. 30, Long Beach, USA, Dec. 2017

2017

[46] [46]

Classiﬁer-free diffusion guida nce,

J. Ho and T. Salimans, “Classiﬁer-free diffusion guida nce,” Neural Information Processing Systems (NeurIPS) , Dec. 2021

2021

[47] [47]

Di stributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors,

J. Duan, Y . Guan, S. E. Li, Y . Ren, Q. Sun, and B. Cheng, “Di stributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors,” IEEE Trans. Neural Netw. Learn. , vol. 33, no. 11, pp. 6584–6598, Nov. 2022

2022

[48] [48]

Op- timizing model splitting and device task assignment for dec eptive signal-assisted private multi-hop split learning,

D. Wei, X. Xu, Y . Liu, H. Vincent Poor, and M. Chen, “Op- timizing model splitting and device task assignment for dec eptive signal-assisted private multi-hop split learning,” IEEE Journal on Se- lected Areas in Communications , vol. 44, pp. 1512–1528, 2026, doi: 10.1109/JSAC.2025.3637738

work page doi:10.1109/jsac.2025.3637738 2026

[49] [49]

Joint ofﬂoading and resource allocation for hybrid cloud and edge computing in SAGINs: A decision assisted hybrid action space deep rein forcement learning approach,

C. Huang, G. Chen, P . Xiao, Y . Xiao, Z. Han, and J. A. Chamb ers, “Joint ofﬂoading and resource allocation for hybrid cloud and edge computing in SAGINs: A decision assisted hybrid action space deep rein forcement learning approach,” IEEE J. Select. Areas Commun. , vol. 42, no. 5, pp. 1029–1043, May. 2024

2024

[50] [50]

Delay-constrained buff er-aided relay selection in the Internet of Things with decision-assisted reinforcement learning,

C. Huang, G. Chen, and Y . Gong, “Delay-constrained buff er-aided relay selection in the Internet of Things with decision-assisted reinforcement learning,” IEEE Internet Things J. , vol. 8, no. 12, pp. 10 198–10 208, Jun. 2021

2021

[51] [51]

The surprising effectiveness of PPO in cooperative multi- agent games,

C. Y u, A. V elu, E. Vinitsky, J. Gao, Y . Wang, A. Bayen, and Y . WU, “The surprising effectiveness of PPO in cooperative multi- agent games,” Conference on Neural Information Processing Systems (Neur IPS), New Orleans, LA, Nov. 2022, pp. 24611-24624

2022