{"title":"Deep Reinforcement Learning Based Resource Allocation for Intelligent Reflecting Surface Assisted Dynamic Spectrum Sharing","authors":"Jianxin Guo, Zhe Wang, Jun Li, J. Zhang","doi":"10.1109/WCSP55476.2022.10039119","DOIUrl":null,"url":null,"abstract":"For cognitive radio (CR) systems, it is challenging to achieve high data rate for secondary user (SU) while avoiding the cross-link interference to primary user (PU). In this paper, we propose an intelligent reflecting surface (IRS) aided dynamic spectrum sharing scheme, where an IRS assisted SU link dynamically shares the spectrum opportunities with a PU link based on the eavesdropped automatic repeat request (ARQ) feedback from the PU. We model the dynamic spectrum sharing problem as a Markov decision process (MDP), where we jointly optimize the SU transmit power and IRS reflect beamforming to maximize the cumulative expected achievable rate of the SU link subject to the outage constraint of the PU link. We propose a deep reinforcement learning (DRL) algorithm to achieve the optimal spectrum sharing policy. Experimental results show that the proposed proximal policy optimization (PPO) algorithm outperforms the traditional policy gradient (PG) and deep deterministic policy gradient (DDPG) algorithms by 100% and 46% in terms of spectrum utilization efficiency, respectively.","PeriodicalId":199421,"journal":{"name":"2022 14th International Conference on Wireless Communications and Signal Processing (WCSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Wireless Communications and Signal Processing (WCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP55476.2022.10039119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
For cognitive radio (CR) systems, it is challenging to achieve high data rate for secondary user (SU) while avoiding the cross-link interference to primary user (PU). In this paper, we propose an intelligent reflecting surface (IRS) aided dynamic spectrum sharing scheme, where an IRS assisted SU link dynamically shares the spectrum opportunities with a PU link based on the eavesdropped automatic repeat request (ARQ) feedback from the PU. We model the dynamic spectrum sharing problem as a Markov decision process (MDP), where we jointly optimize the SU transmit power and IRS reflect beamforming to maximize the cumulative expected achievable rate of the SU link subject to the outage constraint of the PU link. We propose a deep reinforcement learning (DRL) algorithm to achieve the optimal spectrum sharing policy. Experimental results show that the proposed proximal policy optimization (PPO) algorithm outperforms the traditional policy gradient (PG) and deep deterministic policy gradient (DDPG) algorithms by 100% and 46% in terms of spectrum utilization efficiency, respectively.