Deep Reinforcement Learning for Performance-Aware Adaptive Resource Allocation in Mobile Edge Computing

Binbin Huang, Zhongjin Li, Yunqiu Xu, Linxuan Pan, Shangguang Wang, Victor Chang, Haiyang Hu

Research output: Contribution to journalArticlepeer-review

62 Downloads (Pure)


Mobile edge computing (MEC) enables to provide relatively rich computing resources in close proximity to mobile users, which enables resource-limited mobile devices to offload workloads to nearby edge servers, and thereby greatly reducing the processing delay of various mobile applications and the energy consumption of mobile devices. Despite its advantages, when a large number of mobile users simultaneously offloads their computation tasks to an edge server, due to the limited computation and communication resources of edge server, inefficiency resource allocation will not make full use of the limited resource and cause waste of resource, resulting in low system performance (the weighted sum of the number of processed tasks, the number of punished tasks, and the number of dropped tasks). Therefore, it is a challenging problem to effectively allocate the computing and communication resources to multiple mobile users. To cope with this problem, we propose a performance-aware resource allocation (PARA) scheme, the goal of which is to maximize the long-term system performance. More specifically, we first build the multiuser resource allocation architecture for computing workloads and transmitting result data to mobile devices. Then, we formulate the multiuser resource allocation problem as a Markova Decision Process (MDP). To achieve this problem, a performance-aware resource allocation (PARA) scheme based on a deep deterministic policy gradient (DDPG) is adopted to derive optimal resource allocation policy. Finally, extensive simulation experiments demonstrate the effectiveness of the PARA scheme.
Original languageEnglish
JournalWireless Communications and Mobile Computing
Publication statusPublished - 2 Jul 2020


Dive into the research topics of 'Deep Reinforcement Learning for Performance-Aware Adaptive Resource Allocation in Mobile Edge Computing'. Together they form a unique fingerprint.

Cite this