Jing Yang;Jialin Lu;Xu Zhou;Shaobo Li;Chuanyue Xiong;Jianjun Hu
{"title":"HA-A2C: Hard Attention and Advantage Actor-Critic for Addressing Latency Optimization in Edge Computing","authors":"Jing Yang;Jialin Lu;Xu Zhou;Shaobo Li;Chuanyue Xiong;Jianjun Hu","doi":"10.1109/TGCN.2024.3409390","DOIUrl":null,"url":null,"abstract":"Due to the rapid development of the IoT and data-driven applications, low-latency task scheduling methods that quickly respond to user tasks has become a significant challenge for edge servers. However, the existing task scheduling strategies do not overcome the impact of factors such as task characteristics, resource availability, and network conditions on delays. Meanwhile, the cross-regional maldistribution of edge servers is obvious, and the edge servers are either idle or overloaded. To address these issues, we propose a low-latency edge scheduling strategy based on the Hard Attention Mechanism and Advantage Actor-Critic (HA-A2C). The core element of this method is the adoption of a hard attention mechanism, which reduces computing complexity and increases efficiency. Effective attention allocation during the resource allocation process further reduces job completion time. Additionally, the deep reinforcement learning method is employed to enhance task dynamic scheduling capabilities, thereby reducing scheduling delays. The HA-A2C approach reduces task latency by approximately 40% compared to the DQN method. Consequently, the intelligent allocation of task resources achieved by integrating the hard attention technique significantly reduces task scheduling time in edge environments.","PeriodicalId":13052,"journal":{"name":"IEEE Transactions on Green Communications and Networking","volume":"9 1","pages":"207-217"},"PeriodicalIF":5.3000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Green Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10547471/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Due to the rapid development of the IoT and data-driven applications, low-latency task scheduling methods that quickly respond to user tasks has become a significant challenge for edge servers. However, the existing task scheduling strategies do not overcome the impact of factors such as task characteristics, resource availability, and network conditions on delays. Meanwhile, the cross-regional maldistribution of edge servers is obvious, and the edge servers are either idle or overloaded. To address these issues, we propose a low-latency edge scheduling strategy based on the Hard Attention Mechanism and Advantage Actor-Critic (HA-A2C). The core element of this method is the adoption of a hard attention mechanism, which reduces computing complexity and increases efficiency. Effective attention allocation during the resource allocation process further reduces job completion time. Additionally, the deep reinforcement learning method is employed to enhance task dynamic scheduling capabilities, thereby reducing scheduling delays. The HA-A2C approach reduces task latency by approximately 40% compared to the DQN method. Consequently, the intelligent allocation of task resources achieved by integrating the hard attention technique significantly reduces task scheduling time in edge environments.