基于马尔可夫过程的无人船导航决策算法研究

Maritime Safety International Conference (MASTIC 2018) Pub Date : 1900-01-01 DOI:10.23977/mastic.023

Ruolan Zhang, Masao Furushob

{"title":"基于马尔可夫过程的无人船导航决策算法研究","authors":"Ruolan Zhang, Masao Furushob","doi":"10.23977/mastic.023","DOIUrl":null,"url":null,"abstract":": In this study, the autonomous decision-making architecture of unmanned vessel navigation has been formulated. The aim of this study is the advancement of mathematical methods in the ship transportation field with relevance to collision avoidance scenario applications. The process of seafarers safely navigating a vessel at sea entails enacting appropriate decision-making at the appropriate time. In our model, we do not input the appropriate action order based on a seafarer’s experience. The model scores each step’s reward by its action behaviour and learns how to avoid obstacles by itself. By deploying decision timing, state, reward, and digitizing the seafarer’s decision, we establish a reinforcement learning algorithm based on Markov decision processes. In the model training, under a single factor influence, the vessel tends to change course with the best appropriate action behaviour, which is almost consistent with decision-making behaviour based on actual experience at sea.","PeriodicalId":200338,"journal":{"name":"Maritime Safety International Conference (MASTIC 2018)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing Decision-Making Algorithm for Unmanned Vessel Navigation Using Markov Processes\",\"authors\":\"Ruolan Zhang, Masao Furushob\",\"doi\":\"10.23977/mastic.023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": In this study, the autonomous decision-making architecture of unmanned vessel navigation has been formulated. The aim of this study is the advancement of mathematical methods in the ship transportation field with relevance to collision avoidance scenario applications. The process of seafarers safely navigating a vessel at sea entails enacting appropriate decision-making at the appropriate time. In our model, we do not input the appropriate action order based on a seafarer’s experience. The model scores each step’s reward by its action behaviour and learns how to avoid obstacles by itself. By deploying decision timing, state, reward, and digitizing the seafarer’s decision, we establish a reinforcement learning algorithm based on Markov decision processes. In the model training, under a single factor influence, the vessel tends to change course with the best appropriate action behaviour, which is almost consistent with decision-making behaviour based on actual experience at sea.\",\"PeriodicalId\":200338,\"journal\":{\"name\":\"Maritime Safety International Conference (MASTIC 2018)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Maritime Safety International Conference (MASTIC 2018)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23977/mastic.023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Maritime Safety International Conference (MASTIC 2018)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23977/mastic.023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

:本研究制定了无人船导航自主决策体系结构。本研究的目的是促进船舶运输领域中与避碰场景应用相关的数学方法。海员在海上安全驾驶船舶的过程需要在适当的时间作出适当的决策。在我们的模型中，我们没有根据海员的经验输入适当的操作顺序。该模型根据其行动行为对每一步的奖励进行评分，并学习如何自己避开障碍。通过部署决策时间、状态、奖励和数字化海员决策，我们建立了一种基于马尔可夫决策过程的强化学习算法。在模型训练中，在单因素影响下，船舶倾向于以最合适的行动行为改变航向，这与基于海上实际经验的决策行为基本一致。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Developing Decision-Making Algorithm for Unmanned Vessel Navigation Using Markov Processes

: In this study, the autonomous decision-making architecture of unmanned vessel navigation has been formulated. The aim of this study is the advancement of mathematical methods in the ship transportation field with relevance to collision avoidance scenario applications. The process of seafarers safely navigating a vessel at sea entails enacting appropriate decision-making at the appropriate time. In our model, we do not input the appropriate action order based on a seafarer’s experience. The model scores each step’s reward by its action behaviour and learns how to avoid obstacles by itself. By deploying decision timing, state, reward, and digitizing the seafarer’s decision, we establish a reinforcement learning algorithm based on Markov decision processes. In the model training, under a single factor influence, the vessel tends to change course with the best appropriate action behaviour, which is almost consistent with decision-making behaviour based on actual experience at sea.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Maritime Safety International Conference (MASTIC 2018)

自引率

0.00%

发文量