Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu
{"title":"基于优先辅助软评价的中断采样中继器干扰方法","authors":"Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu","doi":"10.1016/j.sigpro.2025.110259","DOIUrl":null,"url":null,"abstract":"<div><div>Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"239 ","pages":"Article 110259"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Priori-assisted soft actor–critic based interrupted sampling repeater jamming method\",\"authors\":\"Jiaqi Tan , Tianpeng Liu , Weidong Jiang , Dewang Wang , Zhen Liu , Zhongguo Wu\",\"doi\":\"10.1016/j.sigpro.2025.110259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.</div></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"239 \",\"pages\":\"Article 110259\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168425003731\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425003731","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Priori-assisted soft actor–critic based interrupted sampling repeater jamming method
Interrupted sampling repeater jamming (ISRJ) is an electronic countermeasure widely used in airborne self-defense jamming equipment, with its performance heavily dependent on the jamming parameters. In non-cooperative and dynamic countermeasure scenarios, existing jamming decision-making methods that rely heavily on expert knowledge bases and precise mathematical models are prone to failure, making it challenging to achieve intelligent and adaptive jamming. Furthermore, due to the multi-dimensional and continuous nature of parameter spaces, these methods tend to converge slowly. To address these issues, we propose a priori-assisted soft actor–critic (SAC) based ISRJ method. In our method, we first model the ISRJ decisions throughout the entire penetration process as a Markov decision process (MDP) and design distinct reward functions for both cooperative and non-cooperative scenarios. We then utilize the SAC algorithm to learn the optimal strategy in an unknown, dynamic environment while efficiently managing large state–action spaces. To accelerate convergence while effectively avoiding potential local optima, we propose the priori-assisted SAC algorithm to solve the above MDP. Compared to the classical SAC, the proposed algorithm integrates expert prior information, which assists the agent in exploring more effectively, thereby improving both the efficiency and quality of policy learning. Extensive simulation results confirm the superiority of the proposed method.
期刊介绍:
Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing.
Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.