基于GMA-TD3算法的类人质量注意力分配自主避碰决策方法

IF 4.6 2区工程技术 Q1 ENGINEERING, CIVIL

Ocean Engineering Pub Date : 2025-04-17 DOI:10.1016/j.oceaneng.2025.121118

Wuyue Rong , Jian Zheng , Yang Chen , Yang Liu , Zekun Zhang

{"title":"基于GMA-TD3算法的类人质量注意力分配自主避碰决策方法","authors":"Wuyue Rong , Jian Zheng , Yang Chen , Yang Liu , Zekun Zhang","doi":"10.1016/j.oceaneng.2025.121118","DOIUrl":null,"url":null,"abstract":"<div><div>Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"330 ","pages":"Article 121118"},"PeriodicalIF":4.6000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Autonomous collision avoidance decision-making method with human-like attention distribution for MASSs based on GMA-TD3 algorithm\",\"authors\":\"Wuyue Rong , Jian Zheng , Yang Chen , Yang Liu , Zekun Zhang\",\"doi\":\"10.1016/j.oceaneng.2025.121118\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"330 \",\"pages\":\"Article 121118\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825008315\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825008315","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

摘要

确保自主避碰决策（CADM）系统的高质量运行是优化海上自主水面舰艇航行安全的关键。然而，在解决多船相遇场景下的顺序避碰问题方面仍然存在差距，这将继续给操作带来挑战。为了解决这一挑战，本文提出了一种基于门控循环单元增强多头注意双延迟深度确定性策略梯度（GMA-TD3）算法的自主CADM方法。CADM框架包括两个主要模块，一个是碰撞风险评估模块，由门控循环单元增强型人头注意机制（GMA）驱动，基于识别的碰撞风险获得障碍物优先级的确定；一个是运动决策模块，由GMA- td3算法驱动，生成具有类人注意力分布的顺序避碰决策。此外，还引入了双重奖励机制来平衡长期目标导向和即时动态行为。对比实验表明，GMA-TD3算法对目标问题的收敛速度最快，生成的轨迹最短、最平滑。仿真结果进一步证实，该系统在做出决策前能够准确识别出风险最高的障碍物，确保在安全距离内及时、精确地避免碰撞，同时完全遵守COLREGs。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Autonomous collision avoidance decision-making method with human-like attention distribution for MASSs based on GMA-TD3 algorithm

Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ocean Engineering 工程技术-工程：大洋

CiteScore

7.30

自引率

34.00%

发文量

2379

审稿时长

8.1 months

期刊介绍： Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.