基于频谱感知的异构无线资源按需接入分布式决策

Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su
{"title":"基于频谱感知的异构无线资源按需接入分布式决策","authors":"Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su","doi":"10.1109/ICCCWorkshops57813.2023.10233809","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.","PeriodicalId":201450,"journal":{"name":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Heterogeneous Wireless Resources On-demand Access Distributed Decision Based on Spectrum Sensing\",\"authors\":\"Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su\",\"doi\":\"10.1109/ICCCWorkshops57813.2023.10233809\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.\",\"PeriodicalId\":201450,\"journal\":{\"name\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233809\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文研究了基于频谱感知(SS)的多用户按需访问异构无线资源的问题,考虑了业务缓冲场景、信道占用变化和信道质量水平的影响。由于这两种变化都可以用马尔可夫链建模,因此采用两层马尔可夫链建模方法和两层Multi-Agent深度强化学习(MARL)训练模型来实现用户负载程度与异构无线资源块的合理匹配。具体而言,将频谱感知的本地信息作为外层Agent的输入状态,而将用户当前的流量队列信息和本地积累窗口收集到的动作决策,以及相应的ACK反馈作为内层Agent的输入状态。与用户流量变化相关联的网络范围实用程序函数用于离线集中训练两层用户代理。仿真结果表明,由于复杂的训练过程已经离线完成,该算法显著简化了在线计算和在线交互过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Heterogeneous Wireless Resources On-demand Access Distributed Decision Based on Spectrum Sensing
In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信