从有限观测中感知干扰策略:模仿学习的视角

IF 4.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Youlin Fan;Bo Jiu;Wenqiang Pu;Ziniu Li;Kang Li;Hongwei Liu
{"title":"从有限观测中感知干扰策略:模仿学习的视角","authors":"Youlin Fan;Bo Jiu;Wenqiang Pu;Ziniu Li;Kang Li;Hongwei Liu","doi":"10.1109/TSP.2024.3443121","DOIUrl":null,"url":null,"abstract":"This paper studies the problem of sensing mainlobe jamming strategy through interaction samples between a frequency agile radar and a transmit/receive time-sharing jammer. We model this interaction as an episodic Markov decision process, where the jammer's strategy is treated as the state transition probability that needs to be learned. To effectively learn the strategy, we employ two sensing criteria from the imitation learning perspective: Behavioral Cloning (BC) and Generative Adversarial Imitation Learning (GAIL). These criteria enable us to imitate the jammer's strategy based on collected interaction samples. Our theoretical analysis indicates that GAIL provides more accurate strategy sensing performance, while BC offers faster learning. Experimental results corroborate these findings. Additionally, empirical evidence shows that our trained anti-jamming strategies, informed by either BC or GAIL, significantly outperform existing intelligent anti-jamming strategy learning methods in terms of sample efficiency.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"4098-4114"},"PeriodicalIF":4.6000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective\",\"authors\":\"Youlin Fan;Bo Jiu;Wenqiang Pu;Ziniu Li;Kang Li;Hongwei Liu\",\"doi\":\"10.1109/TSP.2024.3443121\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies the problem of sensing mainlobe jamming strategy through interaction samples between a frequency agile radar and a transmit/receive time-sharing jammer. We model this interaction as an episodic Markov decision process, where the jammer's strategy is treated as the state transition probability that needs to be learned. To effectively learn the strategy, we employ two sensing criteria from the imitation learning perspective: Behavioral Cloning (BC) and Generative Adversarial Imitation Learning (GAIL). These criteria enable us to imitate the jammer's strategy based on collected interaction samples. Our theoretical analysis indicates that GAIL provides more accurate strategy sensing performance, while BC offers faster learning. Experimental results corroborate these findings. Additionally, empirical evidence shows that our trained anti-jamming strategies, informed by either BC or GAIL, significantly outperform existing intelligent anti-jamming strategy learning methods in terms of sample efficiency.\",\"PeriodicalId\":13330,\"journal\":{\"name\":\"IEEE Transactions on Signal Processing\",\"volume\":\"72 \",\"pages\":\"4098-4114\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10634527/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10634527/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

本文研究了通过频率敏捷雷达与发射/接收分时干扰器之间的交互样本来感知主波干扰策略的问题。我们将这种交互建模为一个偶发马尔可夫决策过程,其中干扰者的策略被视为需要学习的状态转换概率。为了有效地学习策略,我们从模仿学习的角度出发,采用了两种感知标准:行为克隆(BC)和生成对抗模仿学习(GAIL)。这些标准使我们能够根据收集到的交互样本模仿干扰者的策略。我们的理论分析表明,GAIL 能提供更准确的策略感知性能,而 BC 则能提供更快的学习速度。实验结果证实了这些结论。此外,经验证据表明,在 BC 或 GAIL 的指导下,我们训练的反干扰策略在样本效率方面明显优于现有的智能反干扰策略学习方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective
This paper studies the problem of sensing mainlobe jamming strategy through interaction samples between a frequency agile radar and a transmit/receive time-sharing jammer. We model this interaction as an episodic Markov decision process, where the jammer's strategy is treated as the state transition probability that needs to be learned. To effectively learn the strategy, we employ two sensing criteria from the imitation learning perspective: Behavioral Cloning (BC) and Generative Adversarial Imitation Learning (GAIL). These criteria enable us to imitate the jammer's strategy based on collected interaction samples. Our theoretical analysis indicates that GAIL provides more accurate strategy sensing performance, while BC offers faster learning. Experimental results corroborate these findings. Additionally, empirical evidence shows that our trained anti-jamming strategies, informed by either BC or GAIL, significantly outperform existing intelligent anti-jamming strategy learning methods in terms of sample efficiency.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Signal Processing
IEEE Transactions on Signal Processing 工程技术-工程:电子与电气
CiteScore
11.20
自引率
9.30%
发文量
310
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信