基于GMA-TD3算法的类人质量注意力分配自主避碰决策方法

IF 4.6 2区 工程技术 Q1 ENGINEERING, CIVIL
Wuyue Rong , Jian Zheng , Yang Chen , Yang Liu , Zekun Zhang
{"title":"基于GMA-TD3算法的类人质量注意力分配自主避碰决策方法","authors":"Wuyue Rong ,&nbsp;Jian Zheng ,&nbsp;Yang Chen ,&nbsp;Yang Liu ,&nbsp;Zekun Zhang","doi":"10.1016/j.oceaneng.2025.121118","DOIUrl":null,"url":null,"abstract":"<div><div>Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"330 ","pages":"Article 121118"},"PeriodicalIF":4.6000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Autonomous collision avoidance decision-making method with human-like attention distribution for MASSs based on GMA-TD3 algorithm\",\"authors\":\"Wuyue Rong ,&nbsp;Jian Zheng ,&nbsp;Yang Chen ,&nbsp;Yang Liu ,&nbsp;Zekun Zhang\",\"doi\":\"10.1016/j.oceaneng.2025.121118\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"330 \",\"pages\":\"Article 121118\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825008315\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825008315","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

确保自主避碰决策(CADM)系统的高质量运行是优化海上自主水面舰艇航行安全的关键。然而,在解决多船相遇场景下的顺序避碰问题方面仍然存在差距,这将继续给操作带来挑战。为了解决这一挑战,本文提出了一种基于门控循环单元增强多头注意双延迟深度确定性策略梯度(GMA-TD3)算法的自主CADM方法。CADM框架包括两个主要模块,一个是碰撞风险评估模块,由门控循环单元增强型人头注意机制(GMA)驱动,基于识别的碰撞风险获得障碍物优先级的确定;一个是运动决策模块,由GMA- td3算法驱动,生成具有类人注意力分布的顺序避碰决策。此外,还引入了双重奖励机制来平衡长期目标导向和即时动态行为。对比实验表明,GMA-TD3算法对目标问题的收敛速度最快,生成的轨迹最短、最平滑。仿真结果进一步证实,该系统在做出决策前能够准确识别出风险最高的障碍物,确保在安全距离内及时、精确地避免碰撞,同时完全遵守COLREGs。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Autonomous collision avoidance decision-making method with human-like attention distribution for MASSs based on GMA-TD3 algorithm
Ensuring the high-quality operation of an autonomous collision avoidance decision-making (CADM) system for Maritime Autonomous Surface Ships (MASSs) is essential for optimizing navigation safety. However, a gap remains in addressing the sequential collision avoidance problem in multi-ship encounter scenarios, which continues to present challenges for operations. To tackle the challenge, this paper proposes an autonomous CADM method based on Gated Recurrent Unit-enhanced Multi-head Attention Twin Delayed Deep Deterministic Policy Gradient (GMA-TD3) algorithm. The CADM framework consists of two main modules, a collision risk assessment module, powered by the Gated Recurrent Unit-enhanced Multi-head attention mechanism (GMA) mechanism to obtain the priority determination of obstacles based on the identified collision risk, and a motion decision module, driven by the GMA-TD3 algorithm to generate sequential collision avoidance decision with human-like attention distribution. Besides, a dual-level reward mechanism was incorporated to balance long-term goal orientation and immediate dynamic behavior. Comparative experiments show that the GMA-TD3 algorithm achieves the fastest convergence for the targeted problem and generates the shortest and smoothest trajectories. Simulation results further confirm that the proposed system accurately identifies the highest-risk obstacles before making decisions, ensuring timely and precise collision avoidance within a safe distance while fully adhering to COLREGs.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ocean Engineering
Ocean Engineering 工程技术-工程:大洋
CiteScore
7.30
自引率
34.00%
发文量
2379
审稿时长
8.1 months
期刊介绍: Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信