Xianghe Wang, Zezhao Meng, Xiangwang Hou, Jun Du, Ruiqi Liu, Yong Ren
{"title":"基于强化学习的动态海洋环境下AUV群避障研究","authors":"Xianghe Wang, Zezhao Meng, Xiangwang Hou, Jun Du, Ruiqi Liu, Yong Ren","doi":"10.1109/ICCCWorkshops57813.2023.10233813","DOIUrl":null,"url":null,"abstract":"As an essential component of 6G communication, the Internet of Underwater Things (IoUT) plays a crucial role in supporting various maritime activities. Due to the IoUT devices’ wide-area distribution and constrained transmit power, autonomous underwater vehicles (AUVs) have become extensively employed for data collection in underwater environments. Ensuring the reliability of underwater tasks, the autonomous obstacle avoidance of AUV swarms has emerged as a critical research area. The Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG), which employs centralized training and distributed execution, is an effective method to address this problem. However, current studies often overlook the complex dynamic underwater environment, including the interference caused by ocean currents on AUV movement and the observation noise of objects in the ocean. To adapt to the interference of ocean currents on AUV movement, we establish a two-dimensional motion model for AUVs. This model enables us to transform the output of MADDPG into propulsion force and steering gear control force, effectively accounting for ocean current disturbances. To tackle the observation noise of obstacles in underwater environment, we first assume the obstacle is moving in uniform linear motion, and then use the Kalman filtering algorithm to process the observed signals. Our simulation results demonstrate the superiority of our scheme.","PeriodicalId":201450,"journal":{"name":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning Based Obstacle Avoidance for AUV Swarm in Dynamic Ocean Environment\",\"authors\":\"Xianghe Wang, Zezhao Meng, Xiangwang Hou, Jun Du, Ruiqi Liu, Yong Ren\",\"doi\":\"10.1109/ICCCWorkshops57813.2023.10233813\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As an essential component of 6G communication, the Internet of Underwater Things (IoUT) plays a crucial role in supporting various maritime activities. Due to the IoUT devices’ wide-area distribution and constrained transmit power, autonomous underwater vehicles (AUVs) have become extensively employed for data collection in underwater environments. Ensuring the reliability of underwater tasks, the autonomous obstacle avoidance of AUV swarms has emerged as a critical research area. The Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG), which employs centralized training and distributed execution, is an effective method to address this problem. However, current studies often overlook the complex dynamic underwater environment, including the interference caused by ocean currents on AUV movement and the observation noise of objects in the ocean. To adapt to the interference of ocean currents on AUV movement, we establish a two-dimensional motion model for AUVs. This model enables us to transform the output of MADDPG into propulsion force and steering gear control force, effectively accounting for ocean current disturbances. To tackle the observation noise of obstacles in underwater environment, we first assume the obstacle is moving in uniform linear motion, and then use the Kalman filtering algorithm to process the observed signals. Our simulation results demonstrate the superiority of our scheme.\",\"PeriodicalId\":201450,\"journal\":{\"name\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233813\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CIC International Conference on Communications in China (ICCC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCWorkshops57813.2023.10233813","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning Based Obstacle Avoidance for AUV Swarm in Dynamic Ocean Environment
As an essential component of 6G communication, the Internet of Underwater Things (IoUT) plays a crucial role in supporting various maritime activities. Due to the IoUT devices’ wide-area distribution and constrained transmit power, autonomous underwater vehicles (AUVs) have become extensively employed for data collection in underwater environments. Ensuring the reliability of underwater tasks, the autonomous obstacle avoidance of AUV swarms has emerged as a critical research area. The Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG), which employs centralized training and distributed execution, is an effective method to address this problem. However, current studies often overlook the complex dynamic underwater environment, including the interference caused by ocean currents on AUV movement and the observation noise of objects in the ocean. To adapt to the interference of ocean currents on AUV movement, we establish a two-dimensional motion model for AUVs. This model enables us to transform the output of MADDPG into propulsion force and steering gear control force, effectively accounting for ocean current disturbances. To tackle the observation noise of obstacles in underwater environment, we first assume the obstacle is moving in uniform linear motion, and then use the Kalman filtering algorithm to process the observed signals. Our simulation results demonstrate the superiority of our scheme.