Zhenhong Fan , Defeng Wu , Yuqin Li , Zheng You , Shangkun Zhong
{"title":"基于记忆的深度强化学习在环境知识有限的无人驾驶车辆中进行COLREGs-compliant避障","authors":"Zhenhong Fan , Defeng Wu , Yuqin Li , Zheng You , Shangkun Zhong","doi":"10.1016/j.oceaneng.2025.121978","DOIUrl":null,"url":null,"abstract":"<div><div>The deployment of collision avoidance algorithms for unmanned surface vehicles is often limited by their capabilities to perceive dynamic and unpredictable marine environments. Influenced by human memory mechanisms, a memory-based deep reinforcement learning (MDRL) algorithm is proposed in this study for autonomous collision avoidance given limited environmental knowledge. A memory space is established to archive historical navigation data, and gated recurrent units are used to integrate these data into short-term memory for network decision-making. Consequently, the algorithm substantially facilitates the optimization of short-term memory and immediate decision-making, further compensating for the deficiencies of immediate perceptual data, enabling precise evaluation of encounter scenarios and the development of avoidance strategies compliant with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Experimental results demonstrate that the MDRL algorithm significantly enhances the collision avoidance capabilities of USV with limited environmental knowledge while ensuring COLREGs compliance.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"338 ","pages":"Article 121978"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Memory-based deep reinforcement learning for COLREGs-compliant obstacle avoidance in USV with limited environmental knowledge\",\"authors\":\"Zhenhong Fan , Defeng Wu , Yuqin Li , Zheng You , Shangkun Zhong\",\"doi\":\"10.1016/j.oceaneng.2025.121978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The deployment of collision avoidance algorithms for unmanned surface vehicles is often limited by their capabilities to perceive dynamic and unpredictable marine environments. Influenced by human memory mechanisms, a memory-based deep reinforcement learning (MDRL) algorithm is proposed in this study for autonomous collision avoidance given limited environmental knowledge. A memory space is established to archive historical navigation data, and gated recurrent units are used to integrate these data into short-term memory for network decision-making. Consequently, the algorithm substantially facilitates the optimization of short-term memory and immediate decision-making, further compensating for the deficiencies of immediate perceptual data, enabling precise evaluation of encounter scenarios and the development of avoidance strategies compliant with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Experimental results demonstrate that the MDRL algorithm significantly enhances the collision avoidance capabilities of USV with limited environmental knowledge while ensuring COLREGs compliance.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"338 \",\"pages\":\"Article 121978\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825016841\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825016841","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Memory-based deep reinforcement learning for COLREGs-compliant obstacle avoidance in USV with limited environmental knowledge
The deployment of collision avoidance algorithms for unmanned surface vehicles is often limited by their capabilities to perceive dynamic and unpredictable marine environments. Influenced by human memory mechanisms, a memory-based deep reinforcement learning (MDRL) algorithm is proposed in this study for autonomous collision avoidance given limited environmental knowledge. A memory space is established to archive historical navigation data, and gated recurrent units are used to integrate these data into short-term memory for network decision-making. Consequently, the algorithm substantially facilitates the optimization of short-term memory and immediate decision-making, further compensating for the deficiencies of immediate perceptual data, enabling precise evaluation of encounter scenarios and the development of avoidance strategies compliant with the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). Experimental results demonstrate that the MDRL algorithm significantly enhances the collision avoidance capabilities of USV with limited environmental knowledge while ensuring COLREGs compliance.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.