{"title":"Hierarchical reinforcement learning for dynamic collision avoidance of autonomous ships under uncertain scenarios","authors":"Sijin Yu, Yunbo Li, Jiaye Gong","doi":"10.1016/j.knosys.2025.114528","DOIUrl":null,"url":null,"abstract":"<div><div>Autonomous ships hold substantial potential for enhancing navigational safety, improving collision avoidance efficiency, and increasing adaptability in complex maritime environments, thereby presenting broad prospects for intelligent shipping. This paper introduces a dynamic collision avoidance control method based on a hierarchical reinforcement learning framework for autonomous ships. By integrating high-level global intent planning with low-level fine-grained rudder control, the proposed approach markedly enhances the interpretability, stability, and behavioral consistency of the learned policy. Furthermore, a multidimensional uncertainty modeling mechanism is incorporated during training, systematically accounting for variations in initial states and obstacle behavior patterns, which effectively strengthens policy adaptability and generalization under uncertain conditions. To validate the method, simulations are conducted in representative encounter scenarios as well as in omnidirectional dynamic obstacle tests. A comprehensive evaluation is carried out using multiple control performance metrics, environmental adaptability analysis, policy consistency assessment, and equivalent energy consumption comparisons. The results confirm that the proposed approach achieves stable and reliable intelligent collision avoidance control in highly dynamic environments, offering a feasible and scalable solution for high-performance collision avoidance in intelligent maritime navigation.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114528"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125015679","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous ships hold substantial potential for enhancing navigational safety, improving collision avoidance efficiency, and increasing adaptability in complex maritime environments, thereby presenting broad prospects for intelligent shipping. This paper introduces a dynamic collision avoidance control method based on a hierarchical reinforcement learning framework for autonomous ships. By integrating high-level global intent planning with low-level fine-grained rudder control, the proposed approach markedly enhances the interpretability, stability, and behavioral consistency of the learned policy. Furthermore, a multidimensional uncertainty modeling mechanism is incorporated during training, systematically accounting for variations in initial states and obstacle behavior patterns, which effectively strengthens policy adaptability and generalization under uncertain conditions. To validate the method, simulations are conducted in representative encounter scenarios as well as in omnidirectional dynamic obstacle tests. A comprehensive evaluation is carried out using multiple control performance metrics, environmental adaptability analysis, policy consistency assessment, and equivalent energy consumption comparisons. The results confirm that the proposed approach achieves stable and reliable intelligent collision avoidance control in highly dynamic environments, offering a feasible and scalable solution for high-performance collision avoidance in intelligent maritime navigation.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.