The USV Path Planning of Dueling DQN Algorithm Based on Tree Sampling Mechanism

Zhijian Huang, Sen Liu, Gui-chen Zhang
{"title":"The USV Path Planning of Dueling DQN Algorithm Based on Tree Sampling Mechanism","authors":"Zhijian Huang, Sen Liu, Gui-chen Zhang","doi":"10.1109/IPEC54454.2022.9777508","DOIUrl":null,"url":null,"abstract":"The path planning and obstacle avoidance of USV (unmanned surface vessel) has become a research hotspot in recent years. Among them, the DQN algorithm has achieved good results in the obstacle avoidance and path planning problems of unmanned surface vessel. However, the algorithm suffers from the problems that the sampling method does not make full use of the stored information and the randomness of action selection during the training process is too large and the convergence is too slow. In this paper, we propose a Dueling DQN algorithm to optimize obstacle avoidance and path planning, which based on tree sampling mechanism. The Dueling DQN algorithm will decomposes the value function Q into a state-value function (V) and a dominance function (A). Meanwhile, the absolute value of TD-error is directly used as a priority indicator for priority sampling in the sampling process. Subsequently, the network model is built and experiments are conducted on each of the four maps. As a result, the convergence steps and loss values of the proposed algorithm on the four paths are better than those of the DQN algorithm. It shows that the dueling DQN algorithm can effectively use the stored information for optimal path planning.","PeriodicalId":232563,"journal":{"name":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPEC54454.2022.9777508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The path planning and obstacle avoidance of USV (unmanned surface vessel) has become a research hotspot in recent years. Among them, the DQN algorithm has achieved good results in the obstacle avoidance and path planning problems of unmanned surface vessel. However, the algorithm suffers from the problems that the sampling method does not make full use of the stored information and the randomness of action selection during the training process is too large and the convergence is too slow. In this paper, we propose a Dueling DQN algorithm to optimize obstacle avoidance and path planning, which based on tree sampling mechanism. The Dueling DQN algorithm will decomposes the value function Q into a state-value function (V) and a dominance function (A). Meanwhile, the absolute value of TD-error is directly used as a priority indicator for priority sampling in the sampling process. Subsequently, the network model is built and experiments are conducted on each of the four maps. As a result, the convergence steps and loss values of the proposed algorithm on the four paths are better than those of the DQN algorithm. It shows that the dueling DQN algorithm can effectively use the stored information for optimal path planning.
基于树形采样机制的决斗DQN算法USV路径规划
无人水面舰艇的路径规划与避障问题已成为近年来的研究热点。其中,DQN算法在无人水面舰艇避障和路径规划问题上取得了较好的效果。但是,该算法存在采样方法没有充分利用存储的信息,训练过程中动作选择的随机性太大,收敛速度太慢等问题。本文提出了一种基于树形采样机制的Dueling DQN算法来优化避障和路径规划。Dueling DQN算法将值函数Q分解为状态-值函数(V)和优势函数(a),同时在采样过程中直接将TD-error的绝对值作为优先级采样的优先级指标。随后,建立网络模型,并对四张地图分别进行实验。结果表明,该算法在4条路径上的收敛步数和损失值均优于DQN算法。结果表明,决斗DQN算法可以有效地利用存储的信息进行最优路径规划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信