The USV Path Planning of Dueling DQN Algorithm Based on Tree Sampling Mechanism

2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC) Pub Date : 2022-04-14 DOI:10.1109/IPEC54454.2022.9777508

Zhijian Huang, Sen Liu, Gui-chen Zhang

{"title":"The USV Path Planning of Dueling DQN Algorithm Based on Tree Sampling Mechanism","authors":"Zhijian Huang, Sen Liu, Gui-chen Zhang","doi":"10.1109/IPEC54454.2022.9777508","DOIUrl":null,"url":null,"abstract":"The path planning and obstacle avoidance of USV (unmanned surface vessel) has become a research hotspot in recent years. Among them, the DQN algorithm has achieved good results in the obstacle avoidance and path planning problems of unmanned surface vessel. However, the algorithm suffers from the problems that the sampling method does not make full use of the stored information and the randomness of action selection during the training process is too large and the convergence is too slow. In this paper, we propose a Dueling DQN algorithm to optimize obstacle avoidance and path planning, which based on tree sampling mechanism. The Dueling DQN algorithm will decomposes the value function Q into a state-value function (V) and a dominance function (A). Meanwhile, the absolute value of TD-error is directly used as a priority indicator for priority sampling in the sampling process. Subsequently, the network model is built and experiments are conducted on each of the four maps. As a result, the convergence steps and loss values of the proposed algorithm on the four paths are better than those of the DQN algorithm. It shows that the dueling DQN algorithm can effectively use the stored information for optimal path planning.","PeriodicalId":232563,"journal":{"name":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPEC54454.2022.9777508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The path planning and obstacle avoidance of USV (unmanned surface vessel) has become a research hotspot in recent years. Among them, the DQN algorithm has achieved good results in the obstacle avoidance and path planning problems of unmanned surface vessel. However, the algorithm suffers from the problems that the sampling method does not make full use of the stored information and the randomness of action selection during the training process is too large and the convergence is too slow. In this paper, we propose a Dueling DQN algorithm to optimize obstacle avoidance and path planning, which based on tree sampling mechanism. The Dueling DQN algorithm will decomposes the value function Q into a state-value function (V) and a dominance function (A). Meanwhile, the absolute value of TD-error is directly used as a priority indicator for priority sampling in the sampling process. Subsequently, the network model is built and experiments are conducted on each of the four maps. As a result, the convergence steps and loss values of the proposed algorithm on the four paths are better than those of the DQN algorithm. It shows that the dueling DQN algorithm can effectively use the stored information for optimal path planning.

查看原文本刊更多论文

基于树形采样机制的决斗DQN算法USV路径规划

无人水面舰艇的路径规划与避障问题已成为近年来的研究热点。其中，DQN算法在无人水面舰艇避障和路径规划问题上取得了较好的效果。但是，该算法存在采样方法没有充分利用存储的信息，训练过程中动作选择的随机性太大，收敛速度太慢等问题。本文提出了一种基于树形采样机制的Dueling DQN算法来优化避障和路径规划。Dueling DQN算法将值函数Q分解为状态-值函数(V)和优势函数(a)，同时在采样过程中直接将TD-error的绝对值作为优先级采样的优先级指标。随后，建立网络模型，并对四张地图分别进行实验。结果表明，该算法在4条路径上的收敛步数和损失值均优于DQN算法。结果表明，决斗DQN算法可以有效地利用存储的信息进行最优路径规划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)

自引率

0.00%

发文量