基于深度强化学习的覆盖蜂窝网络的 D2D 频谱分配

IF 2.1 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Wireless Networks Pub Date : 2024-05-30 DOI:10.1007/s11276-024-03766-6

Yao-Jen Liang, Yu-Chan Tseng, Chi-Wen Hsieh

{"title":"基于深度强化学习的覆盖蜂窝网络的 D2D 频谱分配","authors":"Yao-Jen Liang, Yu-Chan Tseng, Chi-Wen Hsieh","doi":"10.1007/s11276-024-03766-6","DOIUrl":null,"url":null,"abstract":"<p>We develop a deep reinforcement learning-based (DRL) spectrum access scheme for device-to-device communications in an underlay cellular network. Based on the DRL scheme, the base station aims to maximize the overall system throughput of both the D2D and cellular communications by learning an optimal spectrum allocation strategy. While D2D pairs dynamically access the time slots (TSs) of a shared spectrum belonging to a dedicated cellular user (CU). In particular, to ensure that the quality of service (QoS) requirement of cell-edge CUs, this paper addresses the various positions of CUs and D2D pairs by dividing the cellular area into shareable and un-shareable areas. Then, a double deep Q-network is adopted for the BS to decide whether and which D2D pair can access each TS within a shared spectrum. The proposed DDQN spectrum allocation not only enjoys low computational complexity since just current state information is utilized as input, but also approaches the throughput of exhaustive search method since received signal-to-noise ratios are utilized as inputs. Numerical results show that the proposed deep learning-based spectrum access scheme outperforms the state-of-art algorithms in terms of throughput.</p>","PeriodicalId":23750,"journal":{"name":"Wireless Networks","volume":"62 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network\",\"authors\":\"Yao-Jen Liang, Yu-Chan Tseng, Chi-Wen Hsieh\",\"doi\":\"10.1007/s11276-024-03766-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We develop a deep reinforcement learning-based (DRL) spectrum access scheme for device-to-device communications in an underlay cellular network. Based on the DRL scheme, the base station aims to maximize the overall system throughput of both the D2D and cellular communications by learning an optimal spectrum allocation strategy. While D2D pairs dynamically access the time slots (TSs) of a shared spectrum belonging to a dedicated cellular user (CU). In particular, to ensure that the quality of service (QoS) requirement of cell-edge CUs, this paper addresses the various positions of CUs and D2D pairs by dividing the cellular area into shareable and un-shareable areas. Then, a double deep Q-network is adopted for the BS to decide whether and which D2D pair can access each TS within a shared spectrum. The proposed DDQN spectrum allocation not only enjoys low computational complexity since just current state information is utilized as input, but also approaches the throughput of exhaustive search method since received signal-to-noise ratios are utilized as inputs. Numerical results show that the proposed deep learning-based spectrum access scheme outperforms the state-of-art algorithms in terms of throughput.</p>\",\"PeriodicalId\":23750,\"journal\":{\"name\":\"Wireless Networks\",\"volume\":\"62 1\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wireless Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s11276-024-03766-6\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wireless Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11276-024-03766-6","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

我们开发了一种基于深度强化学习（DRL）的频谱接入方案，用于蜂窝底层网络中的设备对设备通信。基于 DRL 方案，基站旨在通过学习最优频谱分配策略，最大化 D2D 和蜂窝通信的整体系统吞吐量。D2D 对动态访问属于专用蜂窝用户（CU）的共享频谱时隙（TS）。特别是，为了确保小区边缘 CU 的服务质量（QoS）要求，本文通过将蜂窝区域划分为可共享区域和不可共享区域来解决 CU 和 D2D 对的不同位置问题。然后，BS 采用双深 Q 网络来决定是否以及哪个 D2D 对可以访问共享频谱内的每个 TS。由于只使用当前状态信息作为输入，因此所提出的 DDQN 频谱分配不仅计算复杂度低，而且由于使用接收信噪比作为输入，因此其吞吐量接近穷举搜索方法。数值结果表明，所提出的基于深度学习的频谱接入方案在吞吐量方面优于最先进的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

查看原文本刊更多论文

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

We develop a deep reinforcement learning-based (DRL) spectrum access scheme for device-to-device communications in an underlay cellular network. Based on the DRL scheme, the base station aims to maximize the overall system throughput of both the D2D and cellular communications by learning an optimal spectrum allocation strategy. While D2D pairs dynamically access the time slots (TSs) of a shared spectrum belonging to a dedicated cellular user (CU). In particular, to ensure that the quality of service (QoS) requirement of cell-edge CUs, this paper addresses the various positions of CUs and D2D pairs by dividing the cellular area into shareable and un-shareable areas. Then, a double deep Q-network is adopted for the BS to decide whether and which D2D pair can access each TS within a shared spectrum. The proposed DDQN spectrum allocation not only enjoys low computational complexity since just current state information is utilized as input, but also approaches the throughput of exhaustive search method since received signal-to-noise ratios are utilized as inputs. Numerical results show that the proposed deep learning-based spectrum access scheme outperforms the state-of-art algorithms in terms of throughput.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Wireless Networks 工程技术-电信学

CiteScore

7.70

自引率

3.30%

发文量

314

审稿时长

5.5 months

期刊介绍： The wireless communication revolution is bringing fundamental changes to data networking, telecommunication, and is making integrated networks a reality. By freeing the user from the cord, personal communications networks, wireless LAN''s, mobile radio networks and cellular systems, harbor the promise of fully distributed mobile computing and communications, any time, anywhere. Focusing on the networking and user aspects of the field, Wireless Networks provides a global forum for archival value contributions documenting these fast growing areas of interest. The journal publishes refereed articles dealing with research, experience and management issues of wireless networks. Its aim is to allow the reader to benefit from experience, problems and solutions described.