{"title":"基于深度强化学习的协同计算卸载方案","authors":"Jinho Park, K. Chung","doi":"10.1109/icoin56518.2023.10048957","DOIUrl":null,"url":null,"abstract":"Deep Reinforcement Learning (DRL)-based computation offloading scheme has been proposed to improve the Quality of Experience (QoE). However, the existing DRL does not consider the temporal states because of the fully connected layer. Also, DRL learns the policy regardless of the importance of experience. To solve these problems, we propose a collaborative computation offloading scheme with DRL. First, we define the objective function about task service time and load balance. Second, we utilize the Least Absolute Shrinkage and Selection Operator (LASSO) regression in the backbone network for considering temporal states. Finally, we prioritize the experience according to the Temporal Difference (TD) error and learning loss. The simulation results show that the proposed scheme achieves high QoE due to low task service time and high load balance.","PeriodicalId":285763,"journal":{"name":"2023 International Conference on Information Networking (ICOIN)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Collaborative Computation Offloading Scheme Based on Deep Reinforcement Learning\",\"authors\":\"Jinho Park, K. Chung\",\"doi\":\"10.1109/icoin56518.2023.10048957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep Reinforcement Learning (DRL)-based computation offloading scheme has been proposed to improve the Quality of Experience (QoE). However, the existing DRL does not consider the temporal states because of the fully connected layer. Also, DRL learns the policy regardless of the importance of experience. To solve these problems, we propose a collaborative computation offloading scheme with DRL. First, we define the objective function about task service time and load balance. Second, we utilize the Least Absolute Shrinkage and Selection Operator (LASSO) regression in the backbone network for considering temporal states. Finally, we prioritize the experience according to the Temporal Difference (TD) error and learning loss. The simulation results show that the proposed scheme achieves high QoE due to low task service time and high load balance.\",\"PeriodicalId\":285763,\"journal\":{\"name\":\"2023 International Conference on Information Networking (ICOIN)\",\"volume\":\"125 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Information Networking (ICOIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icoin56518.2023.10048957\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Information Networking (ICOIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icoin56518.2023.10048957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Collaborative Computation Offloading Scheme Based on Deep Reinforcement Learning
Deep Reinforcement Learning (DRL)-based computation offloading scheme has been proposed to improve the Quality of Experience (QoE). However, the existing DRL does not consider the temporal states because of the fully connected layer. Also, DRL learns the policy regardless of the importance of experience. To solve these problems, we propose a collaborative computation offloading scheme with DRL. First, we define the objective function about task service time and load balance. Second, we utilize the Least Absolute Shrinkage and Selection Operator (LASSO) regression in the backbone network for considering temporal states. Finally, we prioritize the experience according to the Temporal Difference (TD) error and learning loss. The simulation results show that the proposed scheme achieves high QoE due to low task service time and high load balance.