通过多代理深度学习方法为 USV 和 UAV 集群进行基于分布式信息融合的轨迹跟踪

Q3 Earth and Planetary Sciences

Aerospace Systems Pub Date : 2024-02-23 DOI:10.1007/s42401-024-00275-4

Hongzhi Wu, Miao wang, Jingshi Wang, Guoqing wang

{"title":"通过多代理深度学习方法为 USV 和 UAV 集群进行基于分布式信息融合的轨迹跟踪","authors":"Hongzhi Wu, Miao wang, Jingshi Wang, Guoqing wang","doi":"10.1007/s42401-024-00275-4","DOIUrl":null,"url":null,"abstract":"<div><p>Considering the complexities of the modern maritime operational environment and aiming for effective safe navigation and communication maintenance, research into the collaborative trajectory tracking problem of unmanned surface vehicles (USVs) and unmanned aerial vehicles (UAVs) clusters during patrol and target tracking missions holds paramount significance. This paper proposes a multi-agent deep reinforcement learning (MADRL) approach, specifically the action-constrained multi-agent deep deterministic policy gradient (MADDPG), to efficiently solve the collaborative maritime-aerial distributed information fusion-based trajectory tracking problem. The proposed approach incorporates a constraint model based on the characteristics of maritime-aerial distributed information fusion mode and two designed reward functions—one global for target tracking and one local for cross-domain collaborative unmanned clusters. Simulation experiments under three different mission scenarios have been conducted, and results demonstrate that the proposed approach possesses excellent applicability to trajectory tracking tasks in collaborative maritime-aerial settings, exhibiting strong convergence and robustness in mobile target tracking. In a complex three-dimensional simulation environment, the improved algorithm demonstrated an 11.04% reduction in training time for convergence and an 8.03% increase in reward values compared to the original algorithm. This indicates that the introduction of attention mechanisms and the design of reward functions enable the algorithm to learn optimal strategies more quickly and effectively.</p></div>","PeriodicalId":36309,"journal":{"name":"Aerospace Systems","volume":"7 2","pages":"193 - 207"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distributed information fusion based trajectory tracking for USV and UAV clusters via multi-agent deep learning approach\",\"authors\":\"Hongzhi Wu, Miao wang, Jingshi Wang, Guoqing wang\",\"doi\":\"10.1007/s42401-024-00275-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Considering the complexities of the modern maritime operational environment and aiming for effective safe navigation and communication maintenance, research into the collaborative trajectory tracking problem of unmanned surface vehicles (USVs) and unmanned aerial vehicles (UAVs) clusters during patrol and target tracking missions holds paramount significance. This paper proposes a multi-agent deep reinforcement learning (MADRL) approach, specifically the action-constrained multi-agent deep deterministic policy gradient (MADDPG), to efficiently solve the collaborative maritime-aerial distributed information fusion-based trajectory tracking problem. The proposed approach incorporates a constraint model based on the characteristics of maritime-aerial distributed information fusion mode and two designed reward functions—one global for target tracking and one local for cross-domain collaborative unmanned clusters. Simulation experiments under three different mission scenarios have been conducted, and results demonstrate that the proposed approach possesses excellent applicability to trajectory tracking tasks in collaborative maritime-aerial settings, exhibiting strong convergence and robustness in mobile target tracking. In a complex three-dimensional simulation environment, the improved algorithm demonstrated an 11.04% reduction in training time for convergence and an 8.03% increase in reward values compared to the original algorithm. This indicates that the introduction of attention mechanisms and the design of reward functions enable the algorithm to learn optimal strategies more quickly and effectively.</p></div>\",\"PeriodicalId\":36309,\"journal\":{\"name\":\"Aerospace Systems\",\"volume\":\"7 2\",\"pages\":\"193 - 207\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aerospace Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s42401-024-00275-4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Earth and Planetary Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Systems","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s42401-024-00275-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}

引用次数: 0

摘要

考虑到现代海上作战环境的复杂性，并以有效的安全导航和通信维护为目标，研究无人水面飞行器（USV）和无人机（UAV）集群在巡逻和目标跟踪任务中的协同轨迹跟踪问题具有极其重要的意义。本文提出了一种多代理深度强化学习（MADRL）方法，特别是行动约束多代理深度确定性策略梯度（MADDPG），以高效解决基于分布式信息融合的海空协同轨迹跟踪问题。所提出的方法结合了基于海空分布式信息融合模式特征的约束模型和两个设计的奖励函数--一个用于目标跟踪的全局奖励函数和一个用于跨域协作无人集群的局部奖励函数。在三种不同的任务场景下进行了仿真实验，结果表明所提出的方法非常适用于海空协同环境下的轨迹跟踪任务，在移动目标跟踪方面表现出很强的收敛性和鲁棒性。在复杂的三维模拟环境中，改进算法与原始算法相比，收敛训练时间缩短了 11.04%，奖励值增加了 8.03%。这表明，注意力机制的引入和奖励函数的设计使算法能够更快、更有效地学习最优策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Distributed information fusion based trajectory tracking for USV and UAV clusters via multi-agent deep learning approach

Considering the complexities of the modern maritime operational environment and aiming for effective safe navigation and communication maintenance, research into the collaborative trajectory tracking problem of unmanned surface vehicles (USVs) and unmanned aerial vehicles (UAVs) clusters during patrol and target tracking missions holds paramount significance. This paper proposes a multi-agent deep reinforcement learning (MADRL) approach, specifically the action-constrained multi-agent deep deterministic policy gradient (MADDPG), to efficiently solve the collaborative maritime-aerial distributed information fusion-based trajectory tracking problem. The proposed approach incorporates a constraint model based on the characteristics of maritime-aerial distributed information fusion mode and two designed reward functions—one global for target tracking and one local for cross-domain collaborative unmanned clusters. Simulation experiments under three different mission scenarios have been conducted, and results demonstrate that the proposed approach possesses excellent applicability to trajectory tracking tasks in collaborative maritime-aerial settings, exhibiting strong convergence and robustness in mobile target tracking. In a complex three-dimensional simulation environment, the improved algorithm demonstrated an 11.04% reduction in training time for convergence and an 8.03% increase in reward values compared to the original algorithm. This indicates that the introduction of attention mechanisms and the design of reward functions enable the algorithm to learn optimal strategies more quickly and effectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Aerospace Systems Social Sciences-Social Sciences (miscellaneous)

CiteScore

1.80

自引率

0.00%

发文量

期刊介绍： Aerospace Systems provides an international, peer-reviewed forum which focuses on system-level research and development regarding aeronautics and astronautics. The journal emphasizes the unique role and increasing importance of informatics on aerospace. It fills a gap in current publishing coverage from outer space vehicles to atmospheric vehicles by highlighting interdisciplinary science, technology and engineering. Potential topics include, but are not limited to: Trans-space vehicle systems design and integration Air vehicle systems Space vehicle systems Near-space vehicle systems Aerospace robotics and unmanned system Communication, navigation and surveillance Aerodynamics and aircraft design Dynamics and control Aerospace propulsion Avionics system Opto-electronic system Air traffic management Earth observation Deep space exploration Bionic micro-aircraft/spacecraft Intelligent sensing and Information fusion