Soft Actor Critic Framework for Resource Allocation in Backscatter-NOMA Networks

Abdullah Alajmi, M. Fayaz, Waleed Ahsan, A. Nallanathan
{"title":"Soft Actor Critic Framework for Resource Allocation in Backscatter-NOMA Networks","authors":"Abdullah Alajmi, M. Fayaz, Waleed Ahsan, A. Nallanathan","doi":"10.1109/LATINCOM56090.2022.10000455","DOIUrl":null,"url":null,"abstract":"With the use of power domain non-orthogonal multiple access (NOMA) and backscatter communication (BAC), future sixth-generation ultra massive machine type communications networks are expected to connect large-scale Internet of things (IoT) devices. However, due to NOMA co-channel interference, the power allocation to large-scale IoT devices becomes critical. The existing convex optimization-based solutions are highly complex hence, it is difficult to find the optimal solution to the resource allocation problem in a highly dynamic environment. Therefore, this work develops an efficient model-free BACNOMA system to assist the base station for complex resource scheduling tasks in a dynamic BAC-NOMA IoT network. More specifically, we jointly optimize the transmit power of downlink IoT users and the reflection coefficient of uplink backscatter devices using a reinforcement learning algorithm, namely, softactor critic. Numerical results show that the proposed algorithm obtained a higher reward and converges to an optimal solution with respect to a large number of iterations. The proposed algorithm increases the sum rate by 57.6% as compared to the conventional optimization (benchmark) approach. Moreover, we show that the proposed algorithm outperforms the conventional BAC-NOMA scheme and BAC with orthogonal multiple access in terms of average sum rate with the increasing number of backscatter devices.","PeriodicalId":221354,"journal":{"name":"2022 IEEE Latin-American Conference on Communications (LATINCOM)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Latin-American Conference on Communications (LATINCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LATINCOM56090.2022.10000455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the use of power domain non-orthogonal multiple access (NOMA) and backscatter communication (BAC), future sixth-generation ultra massive machine type communications networks are expected to connect large-scale Internet of things (IoT) devices. However, due to NOMA co-channel interference, the power allocation to large-scale IoT devices becomes critical. The existing convex optimization-based solutions are highly complex hence, it is difficult to find the optimal solution to the resource allocation problem in a highly dynamic environment. Therefore, this work develops an efficient model-free BACNOMA system to assist the base station for complex resource scheduling tasks in a dynamic BAC-NOMA IoT network. More specifically, we jointly optimize the transmit power of downlink IoT users and the reflection coefficient of uplink backscatter devices using a reinforcement learning algorithm, namely, softactor critic. Numerical results show that the proposed algorithm obtained a higher reward and converges to an optimal solution with respect to a large number of iterations. The proposed algorithm increases the sum rate by 57.6% as compared to the conventional optimization (benchmark) approach. Moreover, we show that the proposed algorithm outperforms the conventional BAC-NOMA scheme and BAC with orthogonal multiple access in terms of average sum rate with the increasing number of backscatter devices.
后向散射- noma网络资源分配的软行为者评价框架
随着功率域非正交多址(NOMA)和反向散射通信(BAC)技术的应用,未来第六代超大型机器型通信网络有望连接大规模物联网(IoT)设备。然而,由于NOMA共信道干扰,大规模物联网设备的功率分配变得至关重要。现有的基于凸优化的求解方法非常复杂,难以在高度动态的环境中找到资源分配问题的最优解。因此,本工作开发了一种高效的无模型BACNOMA系统,以协助基站在动态BAC-NOMA物联网网络中完成复杂的资源调度任务。更具体地说,我们使用强化学习算法,即软因子批评,共同优化下行物联网用户的发射功率和上行后向散射设备的反射系数。数值结果表明,该算法获得了较高的回报,且迭代次数多时收敛于最优解。与传统的优化(基准)方法相比,该算法的和速率提高了57.6%。此外,随着后向散射器件数量的增加,该算法在平均和速率方面优于传统的BAC- noma方案和正交多址BAC方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信