{"title":"A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning","authors":"Bing Hao , He Du , Zheping Yan","doi":"10.1016/j.oceaneng.2023.113632","DOIUrl":null,"url":null,"abstract":"<div><p>Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.</p></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"270 ","pages":"Article 113632"},"PeriodicalIF":5.5000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801823000161","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 10
Abstract
Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.