通过深度强化学习规避通信干扰的认知多载波雷达

IF 7.4 1区计算机科学 Q1 TELECOMMUNICATIONS

IEEE Transactions on Cognitive Communications and Networking Pub Date : 2023-08-21 DOI:10.1109/TCCN.2023.3306854

Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu

{"title":"通过深度强化学习规避通信干扰的认知多载波雷达","authors":"Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu","doi":"10.1109/TCCN.2023.3306854","DOIUrl":null,"url":null,"abstract":"Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"9 6","pages":"1561-1578"},"PeriodicalIF":7.4000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Cognitive Multi-Carrier Radar for Communication Interference Avoidance via Deep Reinforcement Learning\",\"authors\":\"Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu\",\"doi\":\"10.1109/TCCN.2023.3306854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.\",\"PeriodicalId\":13069,\"journal\":{\"name\":\"IEEE Transactions on Cognitive Communications and Networking\",\"volume\":\"9 6\",\"pages\":\"1561-1578\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2023-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cognitive Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10225362/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10225362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

近年来，雷达与通信系统之间的频谱共享越来越普遍，因此减少通信干扰是雷达面临的一个关键问题。基于深度强化学习（DRL）的频率分配是解决这一问题的常用方法，尤其是在高动态频谱中。然而，由于信道状态信息（CSI）有限，大多数基于 DRL 的方法都存在训练效率低的问题。为了应对这一挑战，我们提出了认知多载波雷达（CMCR），它能在一次传输中获取更多的 CSI，从而更快地学习频谱演化。CMCR 的频率分配问题被表述为一个部分可观测的马尔可夫决策过程，由于存在组合行动空间，这个问题很难解决。为此，我们采用迭代选择法和近端策略优化法（ISPPO）来解决这一问题。为了进一步提高 CMCR 在短期任务中的性能，我们使用模型无关元学习（MAML）对策略进行了预训练。仿真结果表明，在 ISPPO 方法的基础上，CMCR 能够快速学习并在拥塞频谱中实现出色的检测能力。此外，我们还说明了 MAML 预训练的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Cognitive Multi-Carrier Radar for Communication Interference Avoidance via Deep Reinforcement Learning

Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Cognitive Communications and Networking Computer Science-Artificial Intelligence

CiteScore

15.50

自引率

7.00%

发文量

108

期刊介绍： The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.