Adaptive Jamming Decision-Making Against FHSS Communications via Inexpert Demonstrations Assisted Meta Reinforcement Learning

IF 3.7 3区 计算机科学 Q2 TELECOMMUNICATIONS
Ning Rao;Hua Xu;Zisen Qi;Dan Wang;Xiang Peng;Lei Jiang
{"title":"Adaptive Jamming Decision-Making Against FHSS Communications via Inexpert Demonstrations Assisted Meta Reinforcement Learning","authors":"Ning Rao;Hua Xu;Zisen Qi;Dan Wang;Xiang Peng;Lei Jiang","doi":"10.1109/LCOMM.2024.3502423","DOIUrl":null,"url":null,"abstract":"Reinforcement learning (RL)’s powerful optimization capabilities have been extensively applied in the field of wireless communication jamming decision-making. However, the generalization of jamming policies has rarely been explored, and most existing studies rely on task-customized reward functions, which are often intractable to design. To address these issues, we propose a meta RL method for frequency-hopping spread spectrum (FHSS) jamming decision-making, aided by inexpert demonstrations. Firstly, the policy network is meta-trained with multiple diverse tasks to obtain initial network parameters with good generalization. Subsequently, we combine RL and behavioral cloning (BC) to extract useful information from demonstrations, along with learning rate adaptation to achieve efficient policy exploration without the task-customized jamming reward. Simulations confirm that our proposed method not only adapts to unseen jamming tasks with just a few fine-tuning steps under general binary rewards condition, but also achieves higher accumulated jamming rewards and results in lower normalized throughput for users, outperforming state-of-the-art methods.","PeriodicalId":13197,"journal":{"name":"IEEE Communications Letters","volume":"29 1","pages":"105-109"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10771787/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Reinforcement learning (RL)’s powerful optimization capabilities have been extensively applied in the field of wireless communication jamming decision-making. However, the generalization of jamming policies has rarely been explored, and most existing studies rely on task-customized reward functions, which are often intractable to design. To address these issues, we propose a meta RL method for frequency-hopping spread spectrum (FHSS) jamming decision-making, aided by inexpert demonstrations. Firstly, the policy network is meta-trained with multiple diverse tasks to obtain initial network parameters with good generalization. Subsequently, we combine RL and behavioral cloning (BC) to extract useful information from demonstrations, along with learning rate adaptation to achieve efficient policy exploration without the task-customized jamming reward. Simulations confirm that our proposed method not only adapts to unseen jamming tasks with just a few fine-tuning steps under general binary rewards condition, but also achieves higher accumulated jamming rewards and results in lower normalized throughput for users, outperforming state-of-the-art methods.
基于非专家演示辅助元强化学习的FHSS通信自适应干扰决策
强化学习(RL)强大的优化能力在无线通信干扰决策领域得到了广泛的应用。然而,干扰策略的泛化研究很少,现有的研究大多依赖于任务定制的奖励函数,这往往难以设计。为了解决这些问题,我们提出了一种用于跳频扩频(FHSS)干扰决策的元RL方法,并辅以非专家演示。首先,对策略网络进行多元任务元训练,得到具有良好泛化能力的初始网络参数;随后,我们将强化学习和行为克隆(BC)相结合,从演示中提取有用的信息,并结合学习率自适应,在没有任务定制干扰奖励的情况下实现有效的策略探索。仿真结果表明,在一般二元奖励条件下,我们提出的方法不仅可以通过几个微调步骤适应未知的干扰任务,而且可以获得更高的累积干扰奖励和更低的用户标准化吞吐量,优于目前最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Communications Letters
IEEE Communications Letters 工程技术-电信学
CiteScore
8.10
自引率
7.30%
发文量
590
审稿时长
2.8 months
期刊介绍: The IEEE Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of communication over different media and channels including wire, underground, waveguide, optical fiber, and storage channels. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of communication systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信