基于 C-APF-TD3 算法的 AUV 在三维未知环境中的避障路径规划

IF 4.6 2区 工程技术 Q1 ENGINEERING, CIVIL
Xiaohong Li , Shuanghe Yu
{"title":"基于 C-APF-TD3 算法的 AUV 在三维未知环境中的避障路径规划","authors":"Xiaohong Li ,&nbsp;Shuanghe Yu","doi":"10.1016/j.oceaneng.2024.119886","DOIUrl":null,"url":null,"abstract":"<div><div>To enhance the obstacle avoidance path planning ability of AUV in three-dimensional unknown underwater environments with obstacle constraints, an improved algorithm combining Constrained Artificial Potential Field (C-APF) and the Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3) is proposed (C-APF-TD3). First, the kinematic constraints of AUV are incorporated into the APF algorithm, allowing C-APF to generate an approximate path for the AUV. Then, the AUV obstacle avoidance path planning problem is formulated as a Markov Decision Process (MDP), designing state space, action space, and reward functions. The TD3 training is guided by the approximate path planned using C-APF, ultimately resulting in a policy model for AUV path planning. Finally, various simulation experiments are established and designed for different obstacle scenarios. The experimental results show that the C-APF-TD3 algorithm produces more optimized trajectories compared to the C-APF algorithm. Furthermore, compared to the C-APF-DDPG algorithm, it achieves a policy model with efficient control performance with higher convergence efficiency and average returns, enhancing the adaptability and robustness of AUV obstacle avoidance path planning in three-dimensional unknown environments.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"315 ","pages":"Article 119886"},"PeriodicalIF":4.6000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Obstacle avoidance path planning for AUVs in a three-dimensional unknown environment based on the C-APF-TD3 algorithm\",\"authors\":\"Xiaohong Li ,&nbsp;Shuanghe Yu\",\"doi\":\"10.1016/j.oceaneng.2024.119886\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>To enhance the obstacle avoidance path planning ability of AUV in three-dimensional unknown underwater environments with obstacle constraints, an improved algorithm combining Constrained Artificial Potential Field (C-APF) and the Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3) is proposed (C-APF-TD3). First, the kinematic constraints of AUV are incorporated into the APF algorithm, allowing C-APF to generate an approximate path for the AUV. Then, the AUV obstacle avoidance path planning problem is formulated as a Markov Decision Process (MDP), designing state space, action space, and reward functions. The TD3 training is guided by the approximate path planned using C-APF, ultimately resulting in a policy model for AUV path planning. Finally, various simulation experiments are established and designed for different obstacle scenarios. The experimental results show that the C-APF-TD3 algorithm produces more optimized trajectories compared to the C-APF algorithm. Furthermore, compared to the C-APF-DDPG algorithm, it achieves a policy model with efficient control performance with higher convergence efficiency and average returns, enhancing the adaptability and robustness of AUV obstacle avoidance path planning in three-dimensional unknown environments.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"315 \",\"pages\":\"Article 119886\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801824032244\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801824032244","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

为了提高自动潜航器在有障碍物约束的三维未知水下环境中的避障路径规划能力,提出了一种将约束人工势场(C-APF)和双延迟深度确定性策略梯度算法(TD3)相结合的改进算法(C-APF-TD3)。首先,将 AUV 的运动学约束纳入 APF 算法,使 C-APF 能够为 AUV 生成近似路径。然后,将 AUV 避障路径规划问题表述为马尔可夫决策过程(MDP),设计状态空间、行动空间和奖励函数。利用 C-APF 规划的近似路径指导 TD3 训练,最终形成 AUV 路径规划的策略模型。最后,针对不同的障碍场景建立和设计了各种模拟实验。实验结果表明,与 C-APF 算法相比,C-APF-TD3 算法能生成更优化的轨迹。此外,与 C-APF-DDPG 算法相比,该算法实现了具有高效控制性能的策略模型,具有更高的收敛效率和平均收益,增强了 AUV 在三维未知环境中避障路径规划的适应性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Obstacle avoidance path planning for AUVs in a three-dimensional unknown environment based on the C-APF-TD3 algorithm
To enhance the obstacle avoidance path planning ability of AUV in three-dimensional unknown underwater environments with obstacle constraints, an improved algorithm combining Constrained Artificial Potential Field (C-APF) and the Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3) is proposed (C-APF-TD3). First, the kinematic constraints of AUV are incorporated into the APF algorithm, allowing C-APF to generate an approximate path for the AUV. Then, the AUV obstacle avoidance path planning problem is formulated as a Markov Decision Process (MDP), designing state space, action space, and reward functions. The TD3 training is guided by the approximate path planned using C-APF, ultimately resulting in a policy model for AUV path planning. Finally, various simulation experiments are established and designed for different obstacle scenarios. The experimental results show that the C-APF-TD3 algorithm produces more optimized trajectories compared to the C-APF algorithm. Furthermore, compared to the C-APF-DDPG algorithm, it achieves a policy model with efficient control performance with higher convergence efficiency and average returns, enhancing the adaptability and robustness of AUV obstacle avoidance path planning in three-dimensional unknown environments.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ocean Engineering
Ocean Engineering 工程技术-工程:大洋
CiteScore
7.30
自引率
34.00%
发文量
2379
审稿时长
8.1 months
期刊介绍: Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信