Exploring the performance impact of soft constraint integration on reinforcement learning-based autonomous vessel navigation: Experimental insights

IF 2.3 3区 工程技术 Q2 ENGINEERING, MARINE
Xin Jiang , Jiawen Li , Zhenkai Huang , Ji Huang , Ronghui Li
{"title":"Exploring the performance impact of soft constraint integration on reinforcement learning-based autonomous vessel navigation: Experimental insights","authors":"Xin Jiang ,&nbsp;Jiawen Li ,&nbsp;Zhenkai Huang ,&nbsp;Ji Huang ,&nbsp;Ronghui Li","doi":"10.1016/j.ijnaoe.2024.100609","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.</p></div>","PeriodicalId":14160,"journal":{"name":"International Journal of Naval Architecture and Ocean Engineering","volume":"16 ","pages":"Article 100609"},"PeriodicalIF":2.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2092678224000281/pdfft?md5=88e268ac351e4b2b9b7c1582b3e4b23b&pid=1-s2.0-S2092678224000281-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Naval Architecture and Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2092678224000281","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}
引用次数: 0

Abstract

Reinforcement learning has shown promise in enabling autonomous ship navigation, allowing vessels to adapt and make informed decisions in complex marine environments. However, the integration of soft constraints and their impact on performance in RL-based autonomous vessel navigation research remain understudied. This research addresses this gap by investigating the implications of soft constraints in the context of the risk-averse ship navigation problem. Four distinct soft constraint functions are proposed, which are integrated with two widely used RL algorithms, resulting in the creation of eight risk-averse autonomous vessel navigation models. To ensure a comprehensive evaluation of their performance, comparative analyses are conducted across seven virtual digital channel environments. Additionally, a novel metric, known as Large Helm Momentum (LHM), is introduced to quantify the smoothness of autonomous vessel navigation. Through thorough experimentation, key considerations for the design of soft constraint functions in the domain of autonomous ship navigation are identified. A comprehensive understanding of how different soft constraint functions influence autonomous driving behavior has been achieved. Key considerations for designing soft constraint functions in the domain of autonomous ship navigation have also been identified. Five principles, namely the constraint association principle, dominance of hard constraints, reward-balance principle, mapping requirement principle, and iterative improvement principle, are proposed to optimize the design of soft constraint functions for autonomous ship navigation, providing valuable guidance and insights.

探索软约束整合对基于强化学习的自主船舶导航性能的影响:实验启示
强化学习在实现船舶自主导航方面大有可为,它使船舶能够在复杂的海洋环境中适应环境并做出明智的决策。然而,在基于 RL 的自主船舶导航研究中,软约束的整合及其对性能的影响仍未得到充分研究。本研究针对这一空白,研究了软约束在规避风险的船舶导航问题中的影响。研究提出了四种不同的软约束函数,并将其与两种广泛使用的 RL 算法相结合,最终创建了八个规避风险的自主船舶导航模型。为确保对这些模型的性能进行全面评估,在七个虚拟数字航道环境中进行了比较分析。此外,还引入了一种称为大舵机动量(LHM)的新指标,用于量化自主船舶导航的平稳性。通过全面的实验,确定了自主船舶导航领域软约束函数设计的关键考虑因素。对不同软约束函数如何影响自主驾驶行为有了全面的了解。此外,还确定了在船舶自主导航领域设计软约束函数的关键考虑因素。提出了五个原则,即约束关联原则、硬约束主导原则、奖励平衡原则、映射要求原则和迭代改进原则,为优化船舶自主导航软约束函数的设计提供了有价值的指导和启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.90
自引率
4.50%
发文量
62
审稿时长
12 months
期刊介绍: International Journal of Naval Architecture and Ocean Engineering provides a forum for engineers and scientists from a wide range of disciplines to present and discuss various phenomena in the utilization and preservation of ocean environment. Without being limited by the traditional categorization, it is encouraged to present advanced technology development and scientific research, as long as they are aimed for more and better human engagement with ocean environment. Topics include, but not limited to: marine hydrodynamics; structural mechanics; marine propulsion system; design methodology & practice; production technology; system dynamics & control; marine equipment technology; materials science; underwater acoustics; ocean remote sensing; and information technology related to ship and marine systems; ocean energy systems; marine environmental engineering; maritime safety engineering; polar & arctic engineering; coastal & port engineering; subsea engineering; and specialized watercraft engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信