基于频谱感知的异构无线资源按需接入分布式决策

2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops) Pub Date : 2023-08-10 DOI:10.1109/ICCCWorkshops57813.2023.10233809

Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su

{"title":"基于频谱感知的异构无线资源按需接入分布式决策","authors":"Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su","doi":"10.1109/ICCCWorkshops57813.2023.10233809","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.","PeriodicalId":201450,"journal":{"name":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Heterogeneous Wireless Resources On-demand Access Distributed Decision Based on Spectrum Sensing\",\"authors\":\"Shuying Zhang, Kunze Yang, Zuyao Ni, Ming Su\",\"doi\":\"10.1109/ICCCWorkshops57813.2023.10233809\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.\",\"PeriodicalId\":201450,\"journal\":{\"name\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233809\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了基于频谱感知(SS)的多用户按需访问异构无线资源的问题，考虑了业务缓冲场景、信道占用变化和信道质量水平的影响。由于这两种变化都可以用马尔可夫链建模，因此采用两层马尔可夫链建模方法和两层Multi-Agent深度强化学习(MARL)训练模型来实现用户负载程度与异构无线资源块的合理匹配。具体而言，将频谱感知的本地信息作为外层Agent的输入状态，而将用户当前的流量队列信息和本地积累窗口收集到的动作决策，以及相应的ACK反馈作为内层Agent的输入状态。与用户流量变化相关联的网络范围实用程序函数用于离线集中训练两层用户代理。仿真结果表明，由于复杂的训练过程已经离线完成，该算法显著简化了在线计算和在线交互过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Heterogeneous Wireless Resources On-demand Access Distributed Decision Based on Spectrum Sensing

In this paper, we investigate the problem of multi-user on-demand access to heterogeneous wireless resources based on spectrum sensing (SS) considering the impact of traffic buffering scenarios, the variation of channel occupancy, and channel quality levels. Because both variations can be modeled as Markov chains, a two-layer Markov chain modeling approach and a two-layer Multi-Agent deep reinforcement learning (MARL) training model are used to achieve a reasonable matching of users’ load degrees with the heterogeneous wireless resource blocks. Specifically, the local information of spectrum sensing is used as the input state of the outer layer Agent, while the user’s current traffic queue information and the action decisions collected by the local accumulation window, and the corresponding acknowledge (ACK) feedback are used as the input state of the inner layer. A network-wide utility function associated with the user traffic changes is used to centrally train the two layers of user Agents offline. According to the simulation results, the proposed algorithm significantly simplifies the online computation and online interaction process because the complex training process has been completed offline.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)

自引率

0.00%

发文量