Research on Delay DRL in Energy-Constrained CR-NOMA Networks based on Multi-Threads Markov Reward Process

Qiuping Jiang, Chenyu Zhang, Wei Zheng, X. Wen
{"title":"Research on Delay DRL in Energy-Constrained CR-NOMA Networks based on Multi-Threads Markov Reward Process","authors":"Qiuping Jiang, Chenyu Zhang, Wei Zheng, X. Wen","doi":"10.1109/ICCC56324.2022.10065916","DOIUrl":null,"url":null,"abstract":"Applying deep reinforcement learning in wireless networks has been a hot topic in the field of non-orthogonal multiple access. Most present works focus on the design of algorithms and ignore one of the practical problems when deploying them in actual networks: the computing delay, which may lead to performance deterioration. In this paper, we focus on DDPG applied in energy-constrained CR-NOMA networks with delays and propose a multi-threads scheme to assist the main agent to select action in time. We first discuss the workflow of the proposed scheme, and then restore the impaired Markovianity due to the introduction of subthreads by enhancing the state space in MRP. Test results show that the proposed scheme can significantly improve the performance of DDPG in CR-NOMA networks with delays.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10065916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Applying deep reinforcement learning in wireless networks has been a hot topic in the field of non-orthogonal multiple access. Most present works focus on the design of algorithms and ignore one of the practical problems when deploying them in actual networks: the computing delay, which may lead to performance deterioration. In this paper, we focus on DDPG applied in energy-constrained CR-NOMA networks with delays and propose a multi-threads scheme to assist the main agent to select action in time. We first discuss the workflow of the proposed scheme, and then restore the impaired Markovianity due to the introduction of subthreads by enhancing the state space in MRP. Test results show that the proposed scheme can significantly improve the performance of DDPG in CR-NOMA networks with delays.
基于多线程马尔可夫奖励过程的能量约束CR-NOMA网络延迟DRL研究
在无线网络中应用深度强化学习已成为非正交多址领域的研究热点。目前大多数的工作都集中在算法的设计上,而忽略了在实际网络中部署算法时的一个实际问题:计算延迟,这可能会导致性能下降。本文重点研究了DDPG在能量受限、具有时延的CR-NOMA网络中的应用,提出了一种多线程方案来帮助主agent及时选择动作。我们首先讨论了该方案的工作流程,然后通过增强MRP中的状态空间来恢复由于引入子线程而受损的马尔可夫性。测试结果表明,该方案能显著提高具有时延的CR-NOMA网络中DDPG的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信