Zhenhong Fan , Defeng Wu , Yuqin Li , Zheng You , Shangkun Zhong
{"title":"Memory-based deep reinforcement learning for COLREGs-compliant obstacle avoidance in USV with limited environmental knowledge","authors":"Zhenhong Fan , Defeng Wu , Yuqin Li , Zheng You , Shangkun Zhong","doi":"10.1016/j.oceaneng.2025.121978","DOIUrl":null,"url":null,"abstract":"<div><div>The deployment of collision avoidance algorithms for unmanned surface vehicles is often limited by their capabilities to perceive dynamic and unpredictable marine environments. Influenced by human memory mechanisms, a memory-based deep reinforcement learning (MDRL) algorithm is proposed in this study for autonomous collision avoidance given limited environmental knowledge. A memory space is established to archive historical navigation data, and gated recurrent units are used to integrate these data into short-term memory for network decision-making. Consequently, the algorithm substantially facilitates the optimization of short-term memory and immediate decision-making, further compensating for the deficiencies of immediate perceptual data, enabling precise evaluation of encounter scenarios and the development of avoidance strategies compliant with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Experimental results demonstrate that the MDRL algorithm significantly enhances the collision avoidance capabilities of USV with limited environmental knowledge while ensuring COLREGs compliance.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"338 ","pages":"Article 121978"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825016841","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
The deployment of collision avoidance algorithms for unmanned surface vehicles is often limited by their capabilities to perceive dynamic and unpredictable marine environments. Influenced by human memory mechanisms, a memory-based deep reinforcement learning (MDRL) algorithm is proposed in this study for autonomous collision avoidance given limited environmental knowledge. A memory space is established to archive historical navigation data, and gated recurrent units are used to integrate these data into short-term memory for network decision-making. Consequently, the algorithm substantially facilitates the optimization of short-term memory and immediate decision-making, further compensating for the deficiencies of immediate perceptual data, enabling precise evaluation of encounter scenarios and the development of avoidance strategies compliant with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Experimental results demonstrate that the MDRL algorithm significantly enhances the collision avoidance capabilities of USV with limited environmental knowledge while ensuring COLREGs compliance.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.