{"title":"基于深度强化学习算法的无人潜航器路径规划","authors":"Yu Wang, Zhenzhong Chu, Yongli Hu","doi":"10.1109/ICARM58088.2023.10218902","DOIUrl":null,"url":null,"abstract":"This paper implemented path planning for unmanned underwater vehicle (UUV) using deep reinforcement learning (DRL)algorithms with the UUV Simulator. We used three different algorithms, including twin delayed deep deterministic policy gradient (TD3), Soft Actor-Critic (SAC), and Proximal Policy Optimization (PPO). By conducting multiple experiments in a simulation environment and evaluating their results, we found that all three algorithms have good performance and robustness, and each has its own advantages in different test cases. The research results of this paper can provide some reference and guidance for UUV path planning.","PeriodicalId":220013,"journal":{"name":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Path Planning of Unmanned Underwater Vehicles Based on Deep Reinforcement Learning Algorithm\",\"authors\":\"Yu Wang, Zhenzhong Chu, Yongli Hu\",\"doi\":\"10.1109/ICARM58088.2023.10218902\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper implemented path planning for unmanned underwater vehicle (UUV) using deep reinforcement learning (DRL)algorithms with the UUV Simulator. We used three different algorithms, including twin delayed deep deterministic policy gradient (TD3), Soft Actor-Critic (SAC), and Proximal Policy Optimization (PPO). By conducting multiple experiments in a simulation environment and evaluating their results, we found that all three algorithms have good performance and robustness, and each has its own advantages in different test cases. The research results of this paper can provide some reference and guidance for UUV path planning.\",\"PeriodicalId\":220013,\"journal\":{\"name\":\"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICARM58088.2023.10218902\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARM58088.2023.10218902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Path Planning of Unmanned Underwater Vehicles Based on Deep Reinforcement Learning Algorithm
This paper implemented path planning for unmanned underwater vehicle (UUV) using deep reinforcement learning (DRL)algorithms with the UUV Simulator. We used three different algorithms, including twin delayed deep deterministic policy gradient (TD3), Soft Actor-Critic (SAC), and Proximal Policy Optimization (PPO). By conducting multiple experiments in a simulation environment and evaluating their results, we found that all three algorithms have good performance and robustness, and each has its own advantages in different test cases. The research results of this paper can provide some reference and guidance for UUV path planning.