一种基于动态快速Q学习的无人机路径规划方法

IF 5.5 2区工程技术 Q1 ENGINEERING, CIVIL

Ocean Engineering Pub Date : 2023-02-15 DOI:10.1016/j.oceaneng.2023.113632

Bing Hao , He Du , Zheping Yan

{"title":"一种基于动态快速Q学习的无人机路径规划方法","authors":"Bing Hao , He Du , Zheping Yan","doi":"10.1016/j.oceaneng.2023.113632","DOIUrl":null,"url":null,"abstract":"<div><p>Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.</p></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"270 ","pages":"Article 113632"},"PeriodicalIF":5.5000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning\",\"authors\":\"Bing Hao , He Du , Zheping Yan\",\"doi\":\"10.1016/j.oceaneng.2023.113632\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.</p></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"270 \",\"pages\":\"Article 113632\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2023-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801823000161\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801823000161","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 10

摘要

路径规划是无人水面飞行器的关键问题，有效的路径规划算法是无人水面飞行器完成任务的关键。针对部分已知海洋环境中无人潜航器的路径规划问题，提出了一种动态快速q -学习(DFQL)算法，该算法将q -学习与人工势场(APF)相结合，初始化q -表，为无人潜航器提供环境的先验知识。为了加速Q-learning收敛到最优解，避免USV在探索初期随机行走的行为，提出了静态和动态奖励来激励USV向目标移动。针对不同环境条件下的无人潜航器，采用离线和在线两种模式对算法的性能进行了验证。通过与现有方法的比较，表明该方法对无人潜航器路径规划是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning

Path planning is a critical issue for unmanned surface vehicles (USVs), and an effective path-planning algorithm enables USVs to accomplish the mission. In this paper, a novel algorithm called dynamic and fast Q-learning (DFQL) to solve the path planning problem for USV in partially known maritime environments is proposed, which combines Q-learning with artificial potential field (APF) to initialize the Q-table to provide a priori knowledge from the environment to USV. To accelerate the convergence of Q-learning to the optimal solution and avoid USV's behavior of walking randomly in the early stage of exploration, the static and dynamic rewards are proposed to motivate the USV to move toward the target. Moreover, the performance of the proposed algorithm is verified with offline and online modes for USV in different environmental conditions. By comparing with the existing methods, it shows that the proposed approach is effective for path planning of USV.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ocean Engineering 工程技术-工程：大洋

CiteScore

7.30

自引率

34.00%

发文量

2379

审稿时长

8.1 months

期刊介绍： Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.