{"title":"基于深度强化学习的动态计算卸载","authors":"Baichuan Cheng, Zhilong Zhang, Danpu Liu","doi":"10.4108/eai.29-6-2019.2282108","DOIUrl":null,"url":null,"abstract":"Mobile edge computing (MEC) provides computation capability at the edge of wireless network. To reduce the execution delay, computation-intensive multimedia tasks can be offloaded from user equipments (UEs) to the MEC server. How to allocate the computational and wireless resources is one of the key issues to guarantee the quality of services, and is very challenging when tasks are generated dynamically. In this paper, we address the above problem. To minimize the sum execution delay of multiple users, we jointly optimize the offloading decision and the allocation of both computational and wireless resources. We propose a deep policy gradient (DPG) algorithm based on the deep reinforcement learning. Simulation results show that our proposed DPG method can achieve lower latency than the baselines under different numbers of users, computation capacities and wireless bandwidths.","PeriodicalId":150308,"journal":{"name":"Proceedings of the 12th EAI International Conference on Mobile Multimedia Communications, Mobimedia 2019, 29th - 30th Jun 2019, Weihai, China","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Dynamic Computation Offloading Based on Deep Reinforcement Learning\",\"authors\":\"Baichuan Cheng, Zhilong Zhang, Danpu Liu\",\"doi\":\"10.4108/eai.29-6-2019.2282108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mobile edge computing (MEC) provides computation capability at the edge of wireless network. To reduce the execution delay, computation-intensive multimedia tasks can be offloaded from user equipments (UEs) to the MEC server. How to allocate the computational and wireless resources is one of the key issues to guarantee the quality of services, and is very challenging when tasks are generated dynamically. In this paper, we address the above problem. To minimize the sum execution delay of multiple users, we jointly optimize the offloading decision and the allocation of both computational and wireless resources. We propose a deep policy gradient (DPG) algorithm based on the deep reinforcement learning. Simulation results show that our proposed DPG method can achieve lower latency than the baselines under different numbers of users, computation capacities and wireless bandwidths.\",\"PeriodicalId\":150308,\"journal\":{\"name\":\"Proceedings of the 12th EAI International Conference on Mobile Multimedia Communications, Mobimedia 2019, 29th - 30th Jun 2019, Weihai, China\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 12th EAI International Conference on Mobile Multimedia Communications, Mobimedia 2019, 29th - 30th Jun 2019, Weihai, China\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/eai.29-6-2019.2282108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th EAI International Conference on Mobile Multimedia Communications, Mobimedia 2019, 29th - 30th Jun 2019, Weihai, China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.29-6-2019.2282108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic Computation Offloading Based on Deep Reinforcement Learning
Mobile edge computing (MEC) provides computation capability at the edge of wireless network. To reduce the execution delay, computation-intensive multimedia tasks can be offloaded from user equipments (UEs) to the MEC server. How to allocate the computational and wireless resources is one of the key issues to guarantee the quality of services, and is very challenging when tasks are generated dynamically. In this paper, we address the above problem. To minimize the sum execution delay of multiple users, we jointly optimize the offloading decision and the allocation of both computational and wireless resources. We propose a deep policy gradient (DPG) algorithm based on the deep reinforcement learning. Simulation results show that our proposed DPG method can achieve lower latency than the baselines under different numbers of users, computation capacities and wireless bandwidths.