pith. sign in

arxiv: 1901.10646 · v1 · pith:NVOCJ4WDnew · submitted 2019-01-30 · 💻 cs.NI

An Actor-Critic Reinforcement Learning Method for Computation Offloading with Delay Constraints in Mobile Edge Computing

classification 💻 cs.NI
keywords computingedgedelaymobilesystemactor-criticaverageconstrained
0
0 comments X
read the original abstract

In this paper, we consider a mobile edge computing system that provides computing services by cloud server and edge server collaboratively. The mobile edge computing can both reduce service delay and ease the load on the core network. We model the problem of maximizing the average system revenues with the average delay constraints for different priority service as a constrained semi-Markov decision process (SMDP). We propose an actor-critic algorithm with eligibility traces to solve the constrained SMDP. We use neural networks to train the policy parameters and the state value function's parameters to continuously improve the system performance.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.