基于优先辅助软评价的中断采样中继器干扰方法

IF 3.6 2区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Signal Processing Pub Date : 2025-08-28 DOI:10.1016/j.sigpro.2025.110259

Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu

{"title":"基于优先辅助软评价的中断采样中继器干扰方法","authors":"Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu","doi":"10.1016/j.sigpro.2025.110259","DOIUrl":null,"url":null,"abstract":"<div><div>Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"239 ","pages":"Article 110259"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Priori-assisted soft actor–critic based interrupted sampling repeater jamming method\",\"authors\":\"Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu\",\"doi\":\"10.1016/j.sigpro.2025.110259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.</div></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"239 \",\"pages\":\"Article 110259\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168425003731\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425003731","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

中断采样中继器干扰（ISRJ）是一种广泛应用于机载自卫干扰设备的电子对抗手段，其性能在很大程度上取决于干扰参数。在非合作动态对抗场景下，现有的干扰决策方法严重依赖专家知识库和精确的数学模型，容易失效，给实现智能自适应干扰带来了挑战。此外，由于参数空间的多维性和连续性，这些方法往往收敛缓慢。为了解决这些问题，我们提出了一种基于优先辅助软行为者评价（SAC）的ISRJ方法。在我们的方法中，我们首先将整个渗透过程中的ISRJ决策建模为马尔可夫决策过程（MDP），并为合作和非合作场景设计了不同的奖励函数。然后，我们利用SAC算法在未知的动态环境中学习最优策略，同时有效地管理大型状态-动作空间。为了加快收敛速度，同时有效避免潜在的局部最优，我们提出了优先级辅助SAC算法来解决上述MDP。与经典的SAC算法相比，该算法集成了专家先验信息，有助于智能体更有效地进行探索，从而提高了策略学习的效率和质量。大量的仿真结果证实了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Priori-assisted soft actor–critic based interrupted sampling repeater jamming method

Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signal Processing 工程技术-工程：电子与电气

CiteScore

9.20

自引率

9.10%

发文量

309

审稿时长

41 days

期刊介绍： Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.