{"title":"一种基于动态快速Q学习的无人机路径规划方法","authors":"Bing Hao , He Du , Zheping Yan","doi":"10.1016/j.oceaneng.2023.113632","DOIUrl":null,"url":null,"abstract":"<div><p>Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.</p></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"270 ","pages":"Article 113632"},"PeriodicalIF":5.5000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning\",\"authors\":\"Bing Hao , He Du , Zheping Yan\",\"doi\":\"10.1016/j.oceaneng.2023.113632\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.</p></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"270 \",\"pages\":\"Article 113632\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2023-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801823000161\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801823000161","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.