{"title":"Energy-Efficient Reinforcement Learning for Motion Planning of AUV","authors":"Jiayi Wen, Jingwei Zhu, Yejin Lin, Gui-chen Zhang","doi":"10.1109/USYS56283.2022.10073111","DOIUrl":null,"url":null,"abstract":"The accuracy of mapping results depends on the Autonomous Underwater Vehicles (AUVs) navigation errors. However, signals are attenuated drastically in water, which makes it difficult for AUVs to receive signals underwater. As a result, traditional navigation methods may become unreliable. In this paper, a terrain-aided navigation method that takes into account distance and energy consumption is proposed, which does not rely on a precise positioning system. Given the complexity of 3D terrain, this paper formulates the problem as a Markov decision process (MDP) and aims to minimise the energy cost function, where a motion planning method based on soft actor-critic (SAC) is formulated. Then a 3D energy consumption calculation method is developed for the AUV which is accurate for each time slot during the training process. Finally, experiments on the Gym platform were carried out to verify the effectiveness of the proposed method.","PeriodicalId":434350,"journal":{"name":"2022 IEEE 9th International Conference on Underwater System Technology: Theory and Applications (USYS)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th International Conference on Underwater System Technology: Theory and Applications (USYS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/USYS56283.2022.10073111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The accuracy of mapping results depends on the Autonomous Underwater Vehicles (AUVs) navigation errors. However, signals are attenuated drastically in water, which makes it difficult for AUVs to receive signals underwater. As a result, traditional navigation methods may become unreliable. In this paper, a terrain-aided navigation method that takes into account distance and energy consumption is proposed, which does not rely on a precise positioning system. Given the complexity of 3D terrain, this paper formulates the problem as a Markov decision process (MDP) and aims to minimise the energy cost function, where a motion planning method based on soft actor-critic (SAC) is formulated. Then a 3D energy consumption calculation method is developed for the AUV which is accurate for each time slot during the training process. Finally, experiments on the Gym platform were carried out to verify the effectiveness of the proposed method.