融合避障和记忆功能的路径规划算法

IF 1.2 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation and Systems Pub Date : 2023-12-08 DOI:10.1049/ccs2.12098

Qingchun Zheng, Shubo Li, Peihao Zhu, Wenpeng Ma, Yanlu Wang

{"title":"融合避障和记忆功能的路径规划算法","authors":"Qingchun Zheng, Shubo Li, Peihao Zhu, Wenpeng Ma, Yanlu Wang","doi":"10.1049/ccs2.12098","DOIUrl":null,"url":null,"abstract":"<p>In this study, to address the issues of sluggish convergence and poor learning efficiency at the initial stages of training, the authors improve and optimise the Deep Deterministic Policy Gradient (DDPG) algorithm. First, inspired by the Artificial Potential Field method, the selection strategy of DDPG has been improved to accelerate the convergence speed during the early stages of training and reduce the time it takes for the mobile robot to reach the target point. Then, optimising the neural network structure of the DDPG algorithm based on the Long Short-Term Memory accelerates the algorithm's convergence speed in complex dynamic scenes. Static and dynamic scene simulation experiments of mobile robots are carried out in ROS. Test findings demonstrate that the Artificial Potential Field method-Long Short Term Memory Deep Deterministic Policy Gradient (APF-LSTM DDPG) algorithm converges significantly faster in complex dynamic scenes. The success rate is improved by 7.3% and 3.6% in contrast to the DDPG and LSTM-DDPG algorithms. Finally, the usefulness of the method provided in this study is similarly demonstrated in real situations using real mobile robot platforms, laying the foundation for the path planning of mobile robots in complex changing conditions.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"5 4","pages":"300-313"},"PeriodicalIF":1.2000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12098","citationCount":"0","resultStr":"{\"title\":\"A path planning algorithm fusion of obstacle avoidance and memory functions\",\"authors\":\"Qingchun Zheng, Shubo Li, Peihao Zhu, Wenpeng Ma, Yanlu Wang\",\"doi\":\"10.1049/ccs2.12098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In this study, to address the issues of sluggish convergence and poor learning efficiency at the initial stages of training, the authors improve and optimise the Deep Deterministic Policy Gradient (DDPG) algorithm. First, inspired by the Artificial Potential Field method, the selection strategy of DDPG has been improved to accelerate the convergence speed during the early stages of training and reduce the time it takes for the mobile robot to reach the target point. Then, optimising the neural network structure of the DDPG algorithm based on the Long Short-Term Memory accelerates the algorithm's convergence speed in complex dynamic scenes. Static and dynamic scene simulation experiments of mobile robots are carried out in ROS. Test findings demonstrate that the Artificial Potential Field method-Long Short Term Memory Deep Deterministic Policy Gradient (APF-LSTM DDPG) algorithm converges significantly faster in complex dynamic scenes. The success rate is improved by 7.3% and 3.6% in contrast to the DDPG and LSTM-DDPG algorithms. Finally, the usefulness of the method provided in this study is similarly demonstrated in real situations using real mobile robot platforms, laying the foundation for the path planning of mobile robots in complex changing conditions.</p>\",\"PeriodicalId\":33652,\"journal\":{\"name\":\"Cognitive Computation and Systems\",\"volume\":\"5 4\",\"pages\":\"300-313\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12098\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Computation and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

在本研究中，为了解决训练初始阶段收敛缓慢和学习效率差的问题，作者改进和优化了深度确定性策略梯度(Deep Deterministic Policy Gradient, DDPG)算法。首先，受人工势场法的启发，改进了DDPG的选择策略，加快了训练前期的收敛速度，缩短了移动机器人到达目标点的时间。然后，基于长短期记忆对DDPG算法的神经网络结构进行优化，加快了算法在复杂动态场景中的收敛速度。在ROS中对移动机器人进行了静态和动态场景仿真实验。实验结果表明，人工势场法-长短期记忆深度确定性策略梯度(APF - LSTM DDPG)算法在复杂动态场景下的收敛速度明显加快。与DDPG和LSTM - DDPG算法相比，成功率分别提高了7.3%和3.6%。最后，利用真实的移动机器人平台在实际情况中同样证明了本文方法的有效性，为复杂变化条件下移动机器人的路径规划奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A path planning algorithm fusion of obstacle avoidance and memory functions

查看原文本刊更多论文

A path planning algorithm fusion of obstacle avoidance and memory functions

In this study, to address the issues of sluggish convergence and poor learning efficiency at the initial stages of training, the authors improve and optimise the Deep Deterministic Policy Gradient (DDPG) algorithm. First, inspired by the Artificial Potential Field method, the selection strategy of DDPG has been improved to accelerate the convergence speed during the early stages of training and reduce the time it takes for the mobile robot to reach the target point. Then, optimising the neural network structure of the DDPG algorithm based on the Long Short-Term Memory accelerates the algorithm's convergence speed in complex dynamic scenes. Static and dynamic scene simulation experiments of mobile robots are carried out in ROS. Test findings demonstrate that the Artificial Potential Field method-Long Short Term Memory Deep Deterministic Policy Gradient (APF-LSTM DDPG) algorithm converges significantly faster in complex dynamic scenes. The success rate is improved by 7.3% and 3.6% in contrast to the DDPG and LSTM-DDPG algorithms. Finally, the usefulness of the method provided in this study is similarly demonstrated in real situations using real mobile robot platforms, laying the foundation for the path planning of mobile robots in complex changing conditions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Cognitive Computation and Systems Computer Science-Computer Science Applications

CiteScore

2.50

自引率

0.00%

发文量

审稿时长

10 weeks