Xin Jiang , Jiawen Li , Zhenkai Huang , Ji Huang , Ronghui Li
{"title":"Exploring the performance impact of soft constraint integration on reinforcement learning-based autonomous vessel navigation: Experimental insights","authors":"Xin Jiang , Jiawen Li , Zhenkai Huang , Ji Huang , Ronghui Li","doi":"10.1016/j.ijnaoe.2024.100609","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.</p></div>","PeriodicalId":14160,"journal":{"name":"International Journal of Naval Architecture and Ocean Engineering","volume":"16 ","pages":"Article 100609"},"PeriodicalIF":2.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2092678224000281/pdfft?md5=88e268ac351e4b2b9b7c1582b3e4b23b&pid=1-s2.0-S2092678224000281-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Naval Architecture and Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2092678224000281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}
引用次数: 0
Abstract
Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.
期刊介绍:
International Journal of Naval Architecture and Ocean Engineering provides a forum for engineers and scientists from a wide range of disciplines to present and discuss various phenomena in the utilization and preservation of ocean environment. Without being limited by the traditional categorization, it is encouraged to present advanced technology development and scientific research, as long as they are aimed for more and better human engagement with ocean environment. Topics include, but not limited to: marine hydrodynamics; structural mechanics; marine propulsion system; design methodology & practice; production technology; system dynamics & control; marine equipment technology; materials science; underwater acoustics; ocean remote sensing; and information technology related to ship and marine systems; ocean energy systems; marine environmental engineering; maritime safety engineering; polar & arctic engineering; coastal & port engineering; subsea engineering; and specialized watercraft engineering.