QAROO: AI-Driven Online Task Offloading for Energy-Efficient and Sustainable MEC Networks

Ahmed Farouk; Canglu Zhu; Haorui Shi; Miaojiang Chen; Yao Yang; Yongtao Yao

arxiv: 2604.25740 · v1 · submitted 2026-04-28 · 💻 cs.AI

QAROO: AI-Driven Online Task Offloading for Energy-Efficient and Sustainable MEC Networks

Yongtao Yao , Yao Yang , Haorui Shi , Canglu Zhu , Miaojiang Chen , Ahmed Farouk This is my paper

Pith reviewed 2026-05-07 16:02 UTC · model grok-4.3

classification 💻 cs.AI

keywords online task offloadingmobile edge computingquantum neural networksattention mechanismsreinforcement learningenergy efficiencyIoT dynamic environmentswireless powered networks

0 comments

The pith

QAROO integrates quantum neural networks with attention and uncertainty-guided quantization to enable faster online task offloading in dynamic wireless powered MEC networks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes QAROO as a reinforcement learning framework for binary task offloading decisions in mobile edge computing systems powered by wireless energy transfer. It targets the limitations of heuristic algorithms by adding recurrent modeling for time sequences, attention layers inside quantum networks for better feature handling, and a quantization step guided by uncertainty estimates to speed up exploration. If the claimed gains hold, this would deliver higher normalized computation speed and lower processing times while co-optimizing energy use across large-scale IoT deployments in changing channel conditions.

Core claim

The central claim is that the QAROO framework, which applies a binary offloading strategy enhanced by quantum neural networks, attention mechanisms, recurrent neural networks for temporal modeling, and uncertainty-guided quantization for improved exploration, outperforms comparative schemes in normalized computation speed and processing time for sustainable task offloading in dynamic channel environments.

What carries the argument

QAROO, the quantum attention-based reinforcement learning framework that embeds attention inside quantum networks, adds recurrent layers for temporal dependencies, and applies uncertainty-guided quantization to guide offloading decisions under binary strategies.

If this is right

The method supplies a stable online solution for task offloading across large-scale IoT environments.
It co-optimizes computing and energy resources under wireless power transfer constraints.
It accelerates convergence and exploration relative to conventional heuristic approaches.
It strengthens temporal modeling and feature representation through its integrated recurrent and attention components.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same combination of quantum attention and uncertainty quantization could be tested on other wireless resource allocation problems such as power control or caching.
If the energy gains generalize, the approach could support longer operational lifetimes for battery-limited IoT devices in harvesting scenarios.
Real-world deployment would require checking sensitivity to imperfect channel state information or varying energy arrival rates.
The framework suggests a path for hybrid quantum-classical agents in broader dynamic optimization settings beyond edge computing.

Load-bearing premise

That combining quantum neural networks, attention mechanisms, recurrent modeling, and uncertainty-guided quantization will reliably improve adaptability and convergence in dynamic channel environments without requiring detailed specification of the network model or task arrival process.

What would settle it

A head-to-head simulation in a time-varying MEC channel where QAROO produces equal or lower normalized computation speed and longer average processing times than standard deep reinforcement learning or heuristic baselines.

Figures

Figures reproduced from arXiv: 2604.25740 by Ahmed Farouk, Canglu Zhu, Haorui Shi, Miaojiang Chen, Yao Yang, Yongtao Yao.

**Figure 1.** Figure 1: An example of the considered wireless powered MEC network and view at source ↗

**Figure 2.** Figure 2: Overall architecture of the proposed QAROO framework. view at source ↗

**Figure 3.** Figure 3: Performance comparison with 10 devices (a) Comparison of Normalization Rates of four algorithms on 10 devices (b) Comparison of loss functions of four algorithms on 10 devices V. EXPERIMENT AND RESULTS A. Experiments settings This section evaluates the performance of the proposed QAROO algorithm through simulation experiments. The experiments are conducted under the same wireless powered mobile edge comp… view at source ↗

**Figure 6.** Figure 6: Normalization rate of QAROO for different device quantities view at source ↗

read the original abstract

With the rapid advancement of artificial intelligence (AI) and intelligent science, intelligent edge computing has been widely adopted. However, the limitations of traditional methods, such as poor adaptability and the slow convergence of heuristic algorithms, are becoming increasingly evident. To enable sustainable and resource-efficient edge applications, this paper proposes an online task offloading framework for wireless powered mobile edge computing (MEC) networks, called Quantum Attention-based Reinforcement learning for Online Offloading (QAROO). The system employs a binary offloading strategy with the aim of co-optimizing computing and energy resources in dynamic channel environments. In response to the issues of poor adaptability in traditional approaches and the slow convergence of heuristic algorithms, the framework integrates quantum neural networks and attention mechanisms, introducing three key improvements: using recurrent neural networks to enhance temporal modeling capability, proposing an uncertainty-guided quantization method to improve exploration efficiency, and incorporating attention mechanisms into quantum networks to strengthen feature representation. Experiments demonstrate that the proposed method outperforms comparative schemes in terms of normalized computation speed and processing time, offering an efficient and stable solution for online task offloading in large-scale Internet of Things (IoT) dynamic environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

QAROO is a routine combination of quantum nets, attention, and RL for MEC offloading whose performance claims cannot be checked without the missing simulation parameters and results.

read the letter

The paper proposes QAROO as an online offloading scheme for wireless-powered MEC that uses quantum neural networks plus attention and recurrent modeling to handle dynamic IoT environments. It adds an uncertainty-guided quantization step and claims faster normalized computation speed and shorter processing time than prior schemes. That is the main pitch: a practical tweak to existing RL-based offloading for energy and latency trade-offs in sustainable edge systems.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes QAROO, a quantum attention-based reinforcement learning framework for online binary task offloading in wireless-powered MEC networks. It integrates quantum neural networks with attention mechanisms, recurrent neural networks for temporal modeling, and an uncertainty-guided quantization method to address poor adaptability and slow convergence in dynamic channel environments. The central claim is that experiments demonstrate outperformance over comparative schemes in normalized computation speed and processing time, providing an efficient solution for large-scale IoT offloading.

Significance. If the experimental results hold under reproducible conditions, the work could offer a meaningful advance in AI-driven resource management for sustainable edge computing by improving adaptability in dynamic wireless-powered settings. The combination of quantum-inspired components with attention and uncertainty handling represents a potentially useful direction for handling large-scale IoT dynamics, though its impact depends on verifiable gains over established RL baselines.

major comments (3)

[Abstract] Abstract: The claim that 'experiments demonstrate that the proposed method outperforms comparative schemes in terms of normalized computation speed and processing time' supplies no experimental setup, baselines, quantitative results, error bars, or statistical tests. This renders the central empirical claim unverifiable and prevents assessment of whether gains are isolated from simulation artifacts.
[Methodology and Experiments] Methodology and Experiments sections: The paper does not define the wireless channel model, task arrival process, energy harvesting statistics, server capacities, or concrete simulation details for the quantum network (qubit count, circuit ansatz, or how uncertainty-guided quantization is applied during training). Without these, the asserted improvements in adaptability and convergence cannot be tested or reproduced.
[Abstract and Results] Abstract and §4 (or equivalent results section): The integration of RNN temporal modeling, attention in quantum networks, and uncertainty-guided quantization is presented as addressing specific limitations, but no ablation studies or component-wise comparisons are referenced to show which elements drive the reported speed and time gains.

minor comments (2)

[Abstract] Abstract: Consider specifying the number and types of comparative schemes (e.g., heuristic, standard DRL) and reporting at least one key quantitative improvement (e.g., percentage reduction in processing time) to make the summary more informative.
[Throughout] Notation and figures: Ensure all acronyms (MEC, QNN, IoT) are defined on first use and that any performance plots include clear labels for axes, legends, and confidence intervals.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their insightful and constructive comments, which have helped us improve the manuscript significantly. We address each major comment point by point below, indicating the revisions made to enhance reproducibility and completeness.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that 'experiments demonstrate that the proposed method outperforms comparative schemes in terms of normalized computation speed and processing time' supplies no experimental setup, baselines, quantitative results, error bars, or statistical tests. This renders the central empirical claim unverifiable and prevents assessment of whether gains are isolated from simulation artifacts.

Authors: We agree that the abstract's empirical claim would be strengthened by additional context. In the revised manuscript, we have expanded the abstract to include a brief description of the experimental setup, the baselines compared (e.g., standard DRL methods and heuristics), and references to the quantitative results with error bars and statistical tests detailed in Section 4. This addresses the verifiability concern while maintaining the abstract's conciseness. revision: yes
Referee: [Methodology and Experiments] Methodology and Experiments sections: The paper does not define the wireless channel model, task arrival process, energy harvesting statistics, server capacities, or concrete simulation details for the quantum network (qubit count, circuit ansatz, or how uncertainty-guided quantization is applied during training). Without these, the asserted improvements in adaptability and convergence cannot be tested or reproduced.

Authors: We acknowledge this important point regarding reproducibility. We have substantially revised the Methodology section to explicitly define all mentioned elements: the wireless channel as a block-fading model with Rayleigh distribution and specific parameters; task arrivals following a Poisson process; energy harvesting modeled with given probabilities and rates; server computation capacities in FLOPS; and quantum specifics including the number of qubits (4), the variational circuit ansatz used, and the precise application of uncertainty-guided quantization via variance-based thresholding in the training process. These details were incorporated to allow full reproduction of the results. revision: yes
Referee: [Abstract and Results] Abstract and §4 (or equivalent results section): The integration of RNN temporal modeling, attention in quantum networks, and uncertainty-guided quantization is presented as addressing specific limitations, but no ablation studies or component-wise comparisons are referenced to show which elements drive the reported speed and time gains.

Authors: We agree that component-wise analysis is necessary to validate the contributions. Accordingly, we have added ablation studies in the revised Results section. These studies evaluate the performance of QAROO with and without each proposed component (RNN for temporal modeling, attention in quantum networks, and uncertainty-guided quantization), demonstrating through comparative metrics on normalized computation speed and processing time that each element provides measurable benefits in dynamic environments. revision: yes

Circularity Check

0 steps flagged

No circularity: no derivations or equations presented

full rationale

The paper describes a proposed QAROO framework combining quantum neural networks, attention mechanisms, RNN temporal modeling, and uncertainty-guided quantization for binary offloading in wireless-powered MEC. The abstract and summary contain no equations, no derivation steps, no fitted parameters, and no self-citations used to justify core claims. Experimental outperformance is asserted without any visible mathematical chain that could reduce to its own inputs by construction. Per the rules, this is a self-contained descriptive proposal against external benchmarks with no detectable circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no equations, parameters, or model details, so no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5516 in / 1080 out tokens · 37439 ms · 2026-05-07T16:02:00.162132+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

25 extracted references

[1]

Energy-efficient resource allocation for mobile-edge computation offloading,

C. You, K. Huang, H. Chae, and B.-H. Kim, “Energy-efficient resource allocation for mobile-edge computation offloading,”IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. 1397–1411, 2017

2017
[2]

A survey on mobile edge computing: The communication perspective,

Y . Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,”IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322–2358, 2017

2017
[3]

Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading,

S. Bi and Y . J. Zhang, “Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading,” IEEE Transactions on Wireless Communications, vol. 17, no. 6, pp. 4177–4190, 2018

2018
[4]

Energy-efficient dynamic computation offloading and cooperative task scheduling in mobile cloud computing,

S. Guo, J. Liu, Y . Yang, B. Xiao, and Z. Li, “Energy-efficient dynamic computation offloading and cooperative task scheduling in mobile cloud computing,”IEEE Transactions on Mobile Computing, vol. 18, no. 2, pp. 319–333, 2019

2019
[5]

Training deep quantum neural networks,

K. Beer, D. Bondarenko, T. Farrelly, T. J. Osborne, R. Salzmann, D. Scheiermann, and R. Wolf, “Training deep quantum neural networks,” NATURE COMMUNICATIONS, vol. 11, no. 1, FEB 10 2020

2020
[6]

Efficient representation of quantum many- body states with deep neural networks,

X. Gao and L.-M. Duan, “Efficient representation of quantum many- body states with deep neural networks,”NATURE COMMUNICATIONS, vol. 8, SEP 22 2017

2017
[7]

Quantum convolu- tional neural network based on variational quantum circuits,

L.-H. Gong, J.-J. Pei, T.-F. Zhang, and N.-R. Zhou, “Quantum convolu- tional neural network based on variational quantum circuits,”OPTICS COMMUNICATIONS, vol. 550, JAN 1 2024

2024
[8]

Quantum convolutional neural networks,

I. Cong, S. Choi, and M. D. Lukin, “Quantum convolutional neural networks,”NATURE PHYSICS, vol. 15, no. 12, pp. 1273+, DEC 2019

2019
[9]

The power of quantum neural networks,

A. Abbas, D. Sutter, C. Zoufal, A. Lucchi, A. Figalli, and S. Woerner, “The power of quantum neural networks,”NATURE COMPUTATIONAL SCIENCE, vol. 1, no. 6, pp. 403–409, JUN 2021

2021
[10]

Quantum entanglement in neural network states,

D.-L. Deng, X. Li, and S. Das Sarma, “Quantum entanglement in neural network states,”PHYSICAL REVIEW X, vol. 7, no. 2, MAY 11 2017

2017
[11]

A review on the attention mechanism of deep learning,

Z. Niu, G. Zhong, and H. Yu, “A review on the attention mechanism of deep learning,”NEUROCOMPUTING, vol. 452, pp. 48–62, SEP 10 2021

2021
[12]

Attention mechanism in neural networks: where it comes and where it goes,

D. Soydaner, “Attention mechanism in neural networks: where it comes and where it goes,”NEURAL COMPUTING & APPLICATIONS, vol. 34, no. 16, SI, pp. 13 371–13 385, AUG 2022

2022
[13]

Attention, please! a survey of neural attention models in deep learning,

A. d. S. Correia and E. L. Colombini, “Attention, please! a survey of neural attention models in deep learning,”ARTIFICIAL INTELLIGENCE REVIEW, vol. 55, no. 8, pp. 6037–6124, DEC 2022

2022
[14]

Attention in natural language processing,

A. Galassi, M. Lippi, and P. Torroni, “Attention in natural language processing,”IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, vol. 32, no. 10, pp. 4291–4308, OCT 2021

2021
[15]

Qksan: A quantum kernel self-attention network,

R.-X. Zhao, J. Shi, and X. Li, “Qksan: A quantum kernel self-attention network,”IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MA- CHINE INTELLIGENCE, vol. 46, no. 12, pp. 10 184–10 195, DEC 2024. IEEE TRANSACTIONS ON ... 10

2024
[16]

Qsan: A near-term achievable quantum self-attention network,

J. Shi, R.-X. Zhao, W. Wang, S. Zhang, and X. Li, “Qsan: A near-term achievable quantum self-attention network,”IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, vol. 36, no. 8, pp. 13 995–14 008, AUG 2025

2025
[17]

Solving the traveling salesman problem with quan- tum self-attention networks,

H. Li and Y . Ruan, “Solving the traveling salesman problem with quan- tum self-attention networks,” in2024 CROSS STRAIT RADIO SCIENCE AND WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC 2024, ser. Cross Strait Quad Regional Radio Science and Wireless Technology Conference. Institute of Electrical and Electronics Engineers Inc, 2024, pp. 16–18, 2024 Cross Strait ...

2024
[18]

Quantum adaptive excitation network with variational quantum circuits for channel atten- tion,

Y .-C. Hsu, K.-C. Chen, T.-Y . Li, and N.-Y . Chen, “Quantum adaptive excitation network with variational quantum circuits for channel atten- tion,” in2025 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 02, 2025, pp. 344–349

2025
[19]

Finding structure in time,

J. L. Elman, “Finding structure in time,”Cognitive Science, vol. 14, no. 2, pp. 179–211, 1990

1990
[20]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,”Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997

1997
[21]

Deep recurrent q-learning for partially observable mdps

M. J. Hausknecht and P. Stone, “Deep recurrent q-learning for partially observable mdps.” inAAAI fall symposia, vol. 45, 2015, p. 141

2015
[22]

Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network,

A. Sherstinsky, “Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network,”Physica D: Nonlinear Phenomena, vol. 404, p. 132306, 2020. [Online]. Available: https: //www.sciencedirect.com/science/article/pii/S0167278919305974

2020
[23]

Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks,

L. Huang, S. Bi, and Y .-J. A. Zhang, “Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks,”IEEE Transactions on Mobile Computing, vol. 19, no. 11, pp. 2581–2593, 2020

2020
[24]

Quantum reinforcement learning,

D. Dong, C. Chen, H. Li, and T.-J. Tarn, “Quantum reinforcement learning,”IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 38, no. 5, pp. 1207–1220, 2008

2008
[25]

Qtrl: Toward practical quantum reinforcement learning via quantum- train,

C.-Y . Liu, C.-H. A. Lin, C.-H. H. Yang, K.-C. Chen, and M.-H. Hsieh, “Qtrl: Toward practical quantum reinforcement learning via quantum- train,” in2024 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 02, 2024, pp. 317–322

2024

[1] [1]

Energy-efficient resource allocation for mobile-edge computation offloading,

C. You, K. Huang, H. Chae, and B.-H. Kim, “Energy-efficient resource allocation for mobile-edge computation offloading,”IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. 1397–1411, 2017

2017

[2] [2]

A survey on mobile edge computing: The communication perspective,

Y . Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,”IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322–2358, 2017

2017

[3] [3]

Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading,

S. Bi and Y . J. Zhang, “Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading,” IEEE Transactions on Wireless Communications, vol. 17, no. 6, pp. 4177–4190, 2018

2018

[4] [4]

Energy-efficient dynamic computation offloading and cooperative task scheduling in mobile cloud computing,

S. Guo, J. Liu, Y . Yang, B. Xiao, and Z. Li, “Energy-efficient dynamic computation offloading and cooperative task scheduling in mobile cloud computing,”IEEE Transactions on Mobile Computing, vol. 18, no. 2, pp. 319–333, 2019

2019

[5] [5]

Training deep quantum neural networks,

K. Beer, D. Bondarenko, T. Farrelly, T. J. Osborne, R. Salzmann, D. Scheiermann, and R. Wolf, “Training deep quantum neural networks,” NATURE COMMUNICATIONS, vol. 11, no. 1, FEB 10 2020

2020

[6] [6]

Efficient representation of quantum many- body states with deep neural networks,

X. Gao and L.-M. Duan, “Efficient representation of quantum many- body states with deep neural networks,”NATURE COMMUNICATIONS, vol. 8, SEP 22 2017

2017

[7] [7]

Quantum convolu- tional neural network based on variational quantum circuits,

L.-H. Gong, J.-J. Pei, T.-F. Zhang, and N.-R. Zhou, “Quantum convolu- tional neural network based on variational quantum circuits,”OPTICS COMMUNICATIONS, vol. 550, JAN 1 2024

2024

[8] [8]

Quantum convolutional neural networks,

I. Cong, S. Choi, and M. D. Lukin, “Quantum convolutional neural networks,”NATURE PHYSICS, vol. 15, no. 12, pp. 1273+, DEC 2019

2019

[9] [9]

The power of quantum neural networks,

A. Abbas, D. Sutter, C. Zoufal, A. Lucchi, A. Figalli, and S. Woerner, “The power of quantum neural networks,”NATURE COMPUTATIONAL SCIENCE, vol. 1, no. 6, pp. 403–409, JUN 2021

2021

[10] [10]

Quantum entanglement in neural network states,

D.-L. Deng, X. Li, and S. Das Sarma, “Quantum entanglement in neural network states,”PHYSICAL REVIEW X, vol. 7, no. 2, MAY 11 2017

2017

[11] [11]

A review on the attention mechanism of deep learning,

Z. Niu, G. Zhong, and H. Yu, “A review on the attention mechanism of deep learning,”NEUROCOMPUTING, vol. 452, pp. 48–62, SEP 10 2021

2021

[12] [12]

Attention mechanism in neural networks: where it comes and where it goes,

D. Soydaner, “Attention mechanism in neural networks: where it comes and where it goes,”NEURAL COMPUTING & APPLICATIONS, vol. 34, no. 16, SI, pp. 13 371–13 385, AUG 2022

2022

[13] [13]

Attention, please! a survey of neural attention models in deep learning,

A. d. S. Correia and E. L. Colombini, “Attention, please! a survey of neural attention models in deep learning,”ARTIFICIAL INTELLIGENCE REVIEW, vol. 55, no. 8, pp. 6037–6124, DEC 2022

2022

[14] [14]

Attention in natural language processing,

A. Galassi, M. Lippi, and P. Torroni, “Attention in natural language processing,”IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, vol. 32, no. 10, pp. 4291–4308, OCT 2021

2021

[15] [15]

Qksan: A quantum kernel self-attention network,

R.-X. Zhao, J. Shi, and X. Li, “Qksan: A quantum kernel self-attention network,”IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MA- CHINE INTELLIGENCE, vol. 46, no. 12, pp. 10 184–10 195, DEC 2024. IEEE TRANSACTIONS ON ... 10

2024

[16] [16]

Qsan: A near-term achievable quantum self-attention network,

J. Shi, R.-X. Zhao, W. Wang, S. Zhang, and X. Li, “Qsan: A near-term achievable quantum self-attention network,”IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, vol. 36, no. 8, pp. 13 995–14 008, AUG 2025

2025

[17] [17]

Solving the traveling salesman problem with quan- tum self-attention networks,

H. Li and Y . Ruan, “Solving the traveling salesman problem with quan- tum self-attention networks,” in2024 CROSS STRAIT RADIO SCIENCE AND WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC 2024, ser. Cross Strait Quad Regional Radio Science and Wireless Technology Conference. Institute of Electrical and Electronics Engineers Inc, 2024, pp. 16–18, 2024 Cross Strait ...

2024

[18] [18]

Quantum adaptive excitation network with variational quantum circuits for channel atten- tion,

Y .-C. Hsu, K.-C. Chen, T.-Y . Li, and N.-Y . Chen, “Quantum adaptive excitation network with variational quantum circuits for channel atten- tion,” in2025 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 02, 2025, pp. 344–349

2025

[19] [19]

Finding structure in time,

J. L. Elman, “Finding structure in time,”Cognitive Science, vol. 14, no. 2, pp. 179–211, 1990

1990

[20] [20]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,”Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997

1997

[21] [21]

Deep recurrent q-learning for partially observable mdps

M. J. Hausknecht and P. Stone, “Deep recurrent q-learning for partially observable mdps.” inAAAI fall symposia, vol. 45, 2015, p. 141

2015

[22] [22]

Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network,

A. Sherstinsky, “Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network,”Physica D: Nonlinear Phenomena, vol. 404, p. 132306, 2020. [Online]. Available: https: //www.sciencedirect.com/science/article/pii/S0167278919305974

2020

[23] [23]

Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks,

L. Huang, S. Bi, and Y .-J. A. Zhang, “Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks,”IEEE Transactions on Mobile Computing, vol. 19, no. 11, pp. 2581–2593, 2020

2020

[24] [24]

Quantum reinforcement learning,

D. Dong, C. Chen, H. Li, and T.-J. Tarn, “Quantum reinforcement learning,”IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 38, no. 5, pp. 1207–1220, 2008

2008

[25] [25]

Qtrl: Toward practical quantum reinforcement learning via quantum- train,

C.-Y . Liu, C.-H. A. Lin, C.-H. H. Yang, K.-C. Chen, and M.-H. Hsieh, “Qtrl: Toward practical quantum reinforcement learning via quantum- train,” in2024 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 02, 2024, pp. 317–322

2024