{"title":"Twin Delayed DDPG based Dynamic Power Allocation for Internet of Robotic Things","authors":"Homayun Kabir, Mau-Luen Tham, Yoong Choon Chang","doi":"10.23919/softcom55329.2022.9911277","DOIUrl":null,"url":null,"abstract":"The internet of robotic things (IoRT) is an emerging technology that combines user equipment (UE) by allowing communications among each other and data transmission with existing communications and network protocols. However, current IoRT network topologies and resources are insufficient to handle this massive data flow and meet the quality of service (QoS) requirements due to the rapid increment of connected UEs. Hence, the most crucial challenge is radio resource management by controlling the emitting power of the antenna called power allocation (PA), considering the interfering multiple access channel (IMAC). In this paper, we propose a data-driven and model-free twin delayed deep deterministic policy gradient (TD3) algorithm which controls the continuous power level of the PA. TD3 is a modified algorithm of deep deterministic policy gradient (DDPG) that consists of six networks: two actors (one for model and the other for target) and four critics (two for models and two for targets) networks. Results show that the proposed TD3 algorithm outperforms the model-based methods such as fractional programming (FP) and weighted MMSE (WMMSE) as well as model-free algorithms, for example, deep Q network (DQN) and DDPG on sum-rate performance with good generalization power.","PeriodicalId":261625,"journal":{"name":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/softcom55329.2022.9911277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The internet of robotic things (IoRT) is an emerging technology that combines user equipment (UE) by allowing communications among each other and data transmission with existing communications and network protocols. However, current IoRT network topologies and resources are insufficient to handle this massive data flow and meet the quality of service (QoS) requirements due to the rapid increment of connected UEs. Hence, the most crucial challenge is radio resource management by controlling the emitting power of the antenna called power allocation (PA), considering the interfering multiple access channel (IMAC). In this paper, we propose a data-driven and model-free twin delayed deep deterministic policy gradient (TD3) algorithm which controls the continuous power level of the PA. TD3 is a modified algorithm of deep deterministic policy gradient (DDPG) that consists of six networks: two actors (one for model and the other for target) and four critics (two for models and two for targets) networks. Results show that the proposed TD3 algorithm outperforms the model-based methods such as fractional programming (FP) and weighted MMSE (WMMSE) as well as model-free algorithms, for example, deep Q network (DQN) and DDPG on sum-rate performance with good generalization power.