{"title":"基于深度强化学习的多目标跟踪联合波形自适应与数据辅助干扰抑制","authors":"Yongbing Huang;Rui Guo;Fengning Yu;Yue Zhang;Shiyou Xu;Zengping Chen","doi":"10.1109/TIM.2025.3577834","DOIUrl":null,"url":null,"abstract":"To enhance the adaptive capabilities of radar, this work focuses on the problem of joint waveform adaptation and data-assisted jamming suppression in multiple target tracking scenarios with high-power sidelobe jamming. Utilizing deep reinforcement learning (DRL) methods, we develop agents capable of decision-making based on the environmental perception information, thus enabling beamforming for jamming suppression and enhancing tracking performance. Jamming suppression faces challenges due to the computational complexity of beamforming and the large sample size of jamming. Furthermore, the multidimensionality of waveform parameters and the fluctuating states of targets pose additional challenges. To address these challenges, we employ deep reinforcement learning for jamming number perception, leverage convolutional neural networks (CNNs) to extract features, and incorporate the self-attention mechanism for jamming number decision-making. For waveform adaptation, we segment each parameter into a distinct decision space, with distinct agents making decisions using value-decomposition networks (VDNs). These agents are augmented with gated recurrent units (GRUs) to manage diverse target states and maintain historical data. The focus on tracking robustness and accuracy can be tailored by designing the reward function weights. A separate training approach is developed to obtain effective joint policies. To avoid signal-to-noise ratio (SNR) constraints and enhance applicability to real radar systems, we develop an intelligent radar tracking simulation at the signal level and evaluate the performance of the proposed method. Simulation results demonstrate that the receiver agent can balance the perception of jamming numbers and real-time capabilities to assist in jamming suppression. Furthermore, compared with criterion-based methods and an entropy reward-based method, the transmitter agent can enhance tracking robustness and accuracy by adapting waveform parameters. Moreover, field experiments are conducted to verify that the proposed DRL method can improve the actual multitarget tracking performance through waveform adaptation.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-19"},"PeriodicalIF":5.6000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint Waveform Adaptation and Data-Assisted Jamming Suppression for Multiple Target Tracking Based on Deep Reinforcement Learning\",\"authors\":\"Yongbing Huang;Rui Guo;Fengning Yu;Yue Zhang;Shiyou Xu;Zengping Chen\",\"doi\":\"10.1109/TIM.2025.3577834\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To enhance the adaptive capabilities of radar, this work focuses on the problem of joint waveform adaptation and data-assisted jamming suppression in multiple target tracking scenarios with high-power sidelobe jamming. Utilizing deep reinforcement learning (DRL) methods, we develop agents capable of decision-making based on the environmental perception information, thus enabling beamforming for jamming suppression and enhancing tracking performance. Jamming suppression faces challenges due to the computational complexity of beamforming and the large sample size of jamming. Furthermore, the multidimensionality of waveform parameters and the fluctuating states of targets pose additional challenges. To address these challenges, we employ deep reinforcement learning for jamming number perception, leverage convolutional neural networks (CNNs) to extract features, and incorporate the self-attention mechanism for jamming number decision-making. For waveform adaptation, we segment each parameter into a distinct decision space, with distinct agents making decisions using value-decomposition networks (VDNs). These agents are augmented with gated recurrent units (GRUs) to manage diverse target states and maintain historical data. The focus on tracking robustness and accuracy can be tailored by designing the reward function weights. A separate training approach is developed to obtain effective joint policies. To avoid signal-to-noise ratio (SNR) constraints and enhance applicability to real radar systems, we develop an intelligent radar tracking simulation at the signal level and evaluate the performance of the proposed method. Simulation results demonstrate that the receiver agent can balance the perception of jamming numbers and real-time capabilities to assist in jamming suppression. Furthermore, compared with criterion-based methods and an entropy reward-based method, the transmitter agent can enhance tracking robustness and accuracy by adapting waveform parameters. Moreover, field experiments are conducted to verify that the proposed DRL method can improve the actual multitarget tracking performance through waveform adaptation.\",\"PeriodicalId\":13341,\"journal\":{\"name\":\"IEEE Transactions on Instrumentation and Measurement\",\"volume\":\"74 \",\"pages\":\"1-19\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Instrumentation and Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11028087/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11028087/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Joint Waveform Adaptation and Data-Assisted Jamming Suppression for Multiple Target Tracking Based on Deep Reinforcement Learning
To enhance the adaptive capabilities of radar, this work focuses on the problem of joint waveform adaptation and data-assisted jamming suppression in multiple target tracking scenarios with high-power sidelobe jamming. Utilizing deep reinforcement learning (DRL) methods, we develop agents capable of decision-making based on the environmental perception information, thus enabling beamforming for jamming suppression and enhancing tracking performance. Jamming suppression faces challenges due to the computational complexity of beamforming and the large sample size of jamming. Furthermore, the multidimensionality of waveform parameters and the fluctuating states of targets pose additional challenges. To address these challenges, we employ deep reinforcement learning for jamming number perception, leverage convolutional neural networks (CNNs) to extract features, and incorporate the self-attention mechanism for jamming number decision-making. For waveform adaptation, we segment each parameter into a distinct decision space, with distinct agents making decisions using value-decomposition networks (VDNs). These agents are augmented with gated recurrent units (GRUs) to manage diverse target states and maintain historical data. The focus on tracking robustness and accuracy can be tailored by designing the reward function weights. A separate training approach is developed to obtain effective joint policies. To avoid signal-to-noise ratio (SNR) constraints and enhance applicability to real radar systems, we develop an intelligent radar tracking simulation at the signal level and evaluate the performance of the proposed method. Simulation results demonstrate that the receiver agent can balance the perception of jamming numbers and real-time capabilities to assist in jamming suppression. Furthermore, compared with criterion-based methods and an entropy reward-based method, the transmitter agent can enhance tracking robustness and accuracy by adapting waveform parameters. Moreover, field experiments are conducted to verify that the proposed DRL method can improve the actual multitarget tracking performance through waveform adaptation.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.