{"title":"基于知识转移的深度强化学习的智能全方位辅助综合传感与通信","authors":"Xiaowen Ye;Yuyi Mao;Xianghao Yu;Liqun Fu","doi":"10.1109/TWC.2025.3542780","DOIUrl":null,"url":null,"abstract":"This paper investigates an intelligent omni-surface (IOS)-assisted integrated sensing and communication (ISAC) system, where a base station provides both target sensing and communication services with an IOS. The sensing signal-to-noise ratio (SNR) is maximized while satisfying the communication requirement by optimizing IOS configurations. Conventional approaches typically need real-time and accurate channel state information (CSI) and have high computational complexity, making them difficult to implement in realistic systems. To circumvent this problem, this paper puts forth a new framework based on deep reinforcement learning (DRL) with knowledge transfer. In particular, an online learning scheme called Deep reinforcement learning IOS-ISAC (DeepOSC), is first proposed to optimize the reflecting and refracting coefficients of the IOS. Thereafter, to enable powerful reasoning and fast decision-making, we incorporate an echo state network (ESN) with separate output into DeepOSC. To further accelerate convergence, two transfer learning approaches, namely staged policy reuse (SPR) and staged policy distillation (SPD), are developed to guide the learning process of a newly deployed agent by leveraging policies of pre-trained agents. Numerical results show that compared to various benchmarks, DeepOSC attains significant sensing and communication performance gains and is more robust against outdated CSI coefficients. In addition, in comparison to conventional neural networks, ESN shortens the run-time of DeepOSC by more than ten times and is more efficient for temporal inference. Besides, we demonstrate the capabilities of SPR and SPD in accelerating the convergence of DeepOSC.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 5","pages":"4344-4360"},"PeriodicalIF":10.7000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Intelligent Omni-Surface-Aided Integrated Sensing and Communications Based on Deep Reinforcement Learning With Knowledge Transfer\",\"authors\":\"Xiaowen Ye;Yuyi Mao;Xianghao Yu;Liqun Fu\",\"doi\":\"10.1109/TWC.2025.3542780\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper investigates an intelligent omni-surface (IOS)-assisted integrated sensing and communication (ISAC) system, where a base station provides both target sensing and communication services with an IOS. The sensing signal-to-noise ratio (SNR) is maximized while satisfying the communication requirement by optimizing IOS configurations. Conventional approaches typically need real-time and accurate channel state information (CSI) and have high computational complexity, making them difficult to implement in realistic systems. To circumvent this problem, this paper puts forth a new framework based on deep reinforcement learning (DRL) with knowledge transfer. In particular, an online learning scheme called Deep reinforcement learning IOS-ISAC (DeepOSC), is first proposed to optimize the reflecting and refracting coefficients of the IOS. Thereafter, to enable powerful reasoning and fast decision-making, we incorporate an echo state network (ESN) with separate output into DeepOSC. To further accelerate convergence, two transfer learning approaches, namely staged policy reuse (SPR) and staged policy distillation (SPD), are developed to guide the learning process of a newly deployed agent by leveraging policies of pre-trained agents. Numerical results show that compared to various benchmarks, DeepOSC attains significant sensing and communication performance gains and is more robust against outdated CSI coefficients. In addition, in comparison to conventional neural networks, ESN shortens the run-time of DeepOSC by more than ten times and is more efficient for temporal inference. Besides, we demonstrate the capabilities of SPR and SPD in accelerating the convergence of DeepOSC.\",\"PeriodicalId\":13431,\"journal\":{\"name\":\"IEEE Transactions on Wireless Communications\",\"volume\":\"24 5\",\"pages\":\"4344-4360\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Wireless Communications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10907774/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10907774/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Intelligent Omni-Surface-Aided Integrated Sensing and Communications Based on Deep Reinforcement Learning With Knowledge Transfer
This paper investigates an intelligent omni-surface (IOS)-assisted integrated sensing and communication (ISAC) system, where a base station provides both target sensing and communication services with an IOS. The sensing signal-to-noise ratio (SNR) is maximized while satisfying the communication requirement by optimizing IOS configurations. Conventional approaches typically need real-time and accurate channel state information (CSI) and have high computational complexity, making them difficult to implement in realistic systems. To circumvent this problem, this paper puts forth a new framework based on deep reinforcement learning (DRL) with knowledge transfer. In particular, an online learning scheme called Deep reinforcement learning IOS-ISAC (DeepOSC), is first proposed to optimize the reflecting and refracting coefficients of the IOS. Thereafter, to enable powerful reasoning and fast decision-making, we incorporate an echo state network (ESN) with separate output into DeepOSC. To further accelerate convergence, two transfer learning approaches, namely staged policy reuse (SPR) and staged policy distillation (SPD), are developed to guide the learning process of a newly deployed agent by leveraging policies of pre-trained agents. Numerical results show that compared to various benchmarks, DeepOSC attains significant sensing and communication performance gains and is more robust against outdated CSI coefficients. In addition, in comparison to conventional neural networks, ESN shortens the run-time of DeepOSC by more than ten times and is more efficient for temporal inference. Besides, we demonstrate the capabilities of SPR and SPD in accelerating the convergence of DeepOSC.
期刊介绍:
The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols.
The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies.
Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.