{"title":"基于元强化学习的自主船舶避碰","authors":"Xinyu Jia , Shu Gao , Wei He","doi":"10.1016/j.oceaneng.2025.122064","DOIUrl":null,"url":null,"abstract":"<div><div>Collision avoidance is critical for intelligent ship navigation. Ships encounter a variety of complex scenarios in real-world navigation environments, which requires improvements in the adaptability and effectiveness of collision avoidance policies. Therefore, we have innovatively proposed a meta-reinforcement learning method for solving ship collision avoidance. Inspired by meta-learning, we designed a two-layered recurrent model to enhance the adaptability and effectiveness of collision avoidance policies. Then, we created a task sampling method to train vessel agents in making collision avoidance decisions for high-risk encounter situations. The objective function and the policy gradient method for risk assessment are designed to enable vessel agents to thoroughly evaluate the risk situation of the current encounter scenario and optimize the collision avoidance policy. Lastly, we conducted simulation experiments to validate the feasibility of our work. The results indicate that collision avoidance policies outperform various comparative methods, exhibiting competitive advantages in adaptability, effectiveness, and safety in diverse encounter scenarios. Overall, our novel method provides a safer solution to enhance intelligent ship navigation.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"339 ","pages":"Article 122064"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Meta-reinforcement learning-based collision avoidance for autonomous ship\",\"authors\":\"Xinyu Jia , Shu Gao , Wei He\",\"doi\":\"10.1016/j.oceaneng.2025.122064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Collision avoidance is critical for intelligent ship navigation. Ships encounter a variety of complex scenarios in real-world navigation environments, which requires improvements in the adaptability and effectiveness of collision avoidance policies. Therefore, we have innovatively proposed a meta-reinforcement learning method for solving ship collision avoidance. Inspired by meta-learning, we designed a two-layered recurrent model to enhance the adaptability and effectiveness of collision avoidance policies. Then, we created a task sampling method to train vessel agents in making collision avoidance decisions for high-risk encounter situations. The objective function and the policy gradient method for risk assessment are designed to enable vessel agents to thoroughly evaluate the risk situation of the current encounter scenario and optimize the collision avoidance policy. Lastly, we conducted simulation experiments to validate the feasibility of our work. The results indicate that collision avoidance policies outperform various comparative methods, exhibiting competitive advantages in adaptability, effectiveness, and safety in diverse encounter scenarios. Overall, our novel method provides a safer solution to enhance intelligent ship navigation.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"339 \",\"pages\":\"Article 122064\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825017159\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825017159","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Meta-reinforcement learning-based collision avoidance for autonomous ship
Collision avoidance is critical for intelligent ship navigation. Ships encounter a variety of complex scenarios in real-world navigation environments, which requires improvements in the adaptability and effectiveness of collision avoidance policies. Therefore, we have innovatively proposed a meta-reinforcement learning method for solving ship collision avoidance. Inspired by meta-learning, we designed a two-layered recurrent model to enhance the adaptability and effectiveness of collision avoidance policies. Then, we created a task sampling method to train vessel agents in making collision avoidance decisions for high-risk encounter situations. The objective function and the policy gradient method for risk assessment are designed to enable vessel agents to thoroughly evaluate the risk situation of the current encounter scenario and optimize the collision avoidance policy. Lastly, we conducted simulation experiments to validate the feasibility of our work. The results indicate that collision avoidance policies outperform various comparative methods, exhibiting competitive advantages in adaptability, effectiveness, and safety in diverse encounter scenarios. Overall, our novel method provides a safer solution to enhance intelligent ship navigation.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.