Yuqin Li , Defeng Wu , Zheng You , Guoquan Chen , Dongjie Wu
{"title":"无人水面车辆避碰的深度强化学习:最新进展","authors":"Yuqin Li , Defeng Wu , Zheng You , Guoquan Chen , Dongjie Wu","doi":"10.1016/j.apor.2025.104778","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, deep reinforcement learning (DRL) has attracted growing attention in the field of collision avoidance for unmanned surface vehicles (USVs). The collision avoidance problem can be naturally formulated as a Markov decision process (MDP), where sequential decision-making is required under dynamic and uncertain maritime conditions. By integrating the perception capabilities of deep learning with the decision-making strengths of reinforcement learning, DRL demonstrates strong adaptability and robustness in solving such MDP-based problems. This review systematically summarizes the frameworks and principles of DRL algorithms for USVs collision avoidance, with emphasis on value-based methods, policy-based methods, and multi-agent DRL methods. Key research topics are systematically reviewed, including reward function design, state-space representation, exploration strategies, integration of safety regulations, robustness to disturbances, environment modeling, and application domains. Challenges in current studies are analyzed, and potential future research directions are proposed to advance DRL methodologies and promote their practical deployment in USVs navigation. This work aims to provide researchers with a consolidated understanding of DRL-based collision avoidance, fostering innovation and accelerating the engineering implementation of autonomous maritime systems.</div></div>","PeriodicalId":8261,"journal":{"name":"Applied Ocean Research","volume":"164 ","pages":"Article 104778"},"PeriodicalIF":4.4000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep reinforcement learning for collision avoidance in unmanned surface vehicles: State-of-the-art\",\"authors\":\"Yuqin Li , Defeng Wu , Zheng You , Guoquan Chen , Dongjie Wu\",\"doi\":\"10.1016/j.apor.2025.104778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, deep reinforcement learning (DRL) has attracted growing attention in the field of collision avoidance for unmanned surface vehicles (USVs). The collision avoidance problem can be naturally formulated as a Markov decision process (MDP), where sequential decision-making is required under dynamic and uncertain maritime conditions. By integrating the perception capabilities of deep learning with the decision-making strengths of reinforcement learning, DRL demonstrates strong adaptability and robustness in solving such MDP-based problems. This review systematically summarizes the frameworks and principles of DRL algorithms for USVs collision avoidance, with emphasis on value-based methods, policy-based methods, and multi-agent DRL methods. Key research topics are systematically reviewed, including reward function design, state-space representation, exploration strategies, integration of safety regulations, robustness to disturbances, environment modeling, and application domains. Challenges in current studies are analyzed, and potential future research directions are proposed to advance DRL methodologies and promote their practical deployment in USVs navigation. This work aims to provide researchers with a consolidated understanding of DRL-based collision avoidance, fostering innovation and accelerating the engineering implementation of autonomous maritime systems.</div></div>\",\"PeriodicalId\":8261,\"journal\":{\"name\":\"Applied Ocean Research\",\"volume\":\"164 \",\"pages\":\"Article 104778\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Ocean Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141118725003645\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, OCEAN\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Ocean Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141118725003645","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, OCEAN","Score":null,"Total":0}
Deep reinforcement learning for collision avoidance in unmanned surface vehicles: State-of-the-art
In recent years, deep reinforcement learning (DRL) has attracted growing attention in the field of collision avoidance for unmanned surface vehicles (USVs). The collision avoidance problem can be naturally formulated as a Markov decision process (MDP), where sequential decision-making is required under dynamic and uncertain maritime conditions. By integrating the perception capabilities of deep learning with the decision-making strengths of reinforcement learning, DRL demonstrates strong adaptability and robustness in solving such MDP-based problems. This review systematically summarizes the frameworks and principles of DRL algorithms for USVs collision avoidance, with emphasis on value-based methods, policy-based methods, and multi-agent DRL methods. Key research topics are systematically reviewed, including reward function design, state-space representation, exploration strategies, integration of safety regulations, robustness to disturbances, environment modeling, and application domains. Challenges in current studies are analyzed, and potential future research directions are proposed to advance DRL methodologies and promote their practical deployment in USVs navigation. This work aims to provide researchers with a consolidated understanding of DRL-based collision avoidance, fostering innovation and accelerating the engineering implementation of autonomous maritime systems.
期刊介绍:
The aim of Applied Ocean Research is to encourage the submission of papers that advance the state of knowledge in a range of topics relevant to ocean engineering.