Yun Zhang , Shixun You , Yunbin Yan , Qiaofeng Ou , Xijun Gao , Fangqing Jiang
{"title":"A transfer learning model for cognitive electronic reconnaissance of unmanned aerial vehicle: Experiments","authors":"Yun Zhang , Shixun You , Yunbin Yan , Qiaofeng Ou , Xijun Gao , Fangqing Jiang","doi":"10.1016/j.engappai.2024.109158","DOIUrl":null,"url":null,"abstract":"<div><p>Applying Deep Reinforcement Learning (DRL) technologies to Unmanned Aerial Vehicle (UAV) electronic reconnaissance is one of the current research hotspots. However, simulation and engineering practice show that due to poor generalization of DRL models, the performance of Cognitive Electronic Reconnaissance (CER) policies based on training will significantly decrease when the mission scene undergoes slight changes. To address this issue, we thoughtfully combine the mission area segmentation technique with transfer DRL and propose a difference-adaptive transfer DRL algorithm. This algorithm involves mission subarea segmentation, subarea pre-training, multi-subarea policy transfer, and multi-subarea splicing, providing an efficient solution to the convergence problem of the DRL algorithm caused by mission space expansion and reward sparsity. Additionally, a general CER transfer learning simulator is developed based on the analysis of the capabilities of the maneuvering platform and electronic reconnaissance payload. Multiple sets of CER policy transfer learning experiments are designed for different mission spaces, mission difficulties, and UAV characteristics. Compared with the algorithm baseline, our designed policy model significantly outperforms: the mission completion rate of UAVs in multi-scale mission spaces improves by up to 37.4%, reaching 97.5%, while the training time is reduced by 2.46 h. Further behavior analysis shows that this policy model enables UAVs to exhibit target tracking behaviors such as hovering and sustained approach.</p></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"137 ","pages":"Article 109158"},"PeriodicalIF":7.5000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624013162","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Applying Deep Reinforcement Learning (DRL) technologies to Unmanned Aerial Vehicle (UAV) electronic reconnaissance is one of the current research hotspots. However, simulation and engineering practice show that due to poor generalization of DRL models, the performance of Cognitive Electronic Reconnaissance (CER) policies based on training will significantly decrease when the mission scene undergoes slight changes. To address this issue, we thoughtfully combine the mission area segmentation technique with transfer DRL and propose a difference-adaptive transfer DRL algorithm. This algorithm involves mission subarea segmentation, subarea pre-training, multi-subarea policy transfer, and multi-subarea splicing, providing an efficient solution to the convergence problem of the DRL algorithm caused by mission space expansion and reward sparsity. Additionally, a general CER transfer learning simulator is developed based on the analysis of the capabilities of the maneuvering platform and electronic reconnaissance payload. Multiple sets of CER policy transfer learning experiments are designed for different mission spaces, mission difficulties, and UAV characteristics. Compared with the algorithm baseline, our designed policy model significantly outperforms: the mission completion rate of UAVs in multi-scale mission spaces improves by up to 37.4%, reaching 97.5%, while the training time is reduced by 2.46 h. Further behavior analysis shows that this policy model enables UAVs to exhibit target tracking behaviors such as hovering and sustained approach.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.