利用强化学习在未知环境中进行自主水下航行器链接排列控制

IF 4.2 2区 计算机科学 Q2 ROBOTICS
Yang Weng, Sehwa Chun, Masaki Ohashi, Takumi Matsuda, Yuki Sekimori, Joni Pajarinen, Jan Peters, Toshihiro Maki
{"title":"利用强化学习在未知环境中进行自主水下航行器链接排列控制","authors":"Yang Weng,&nbsp;Sehwa Chun,&nbsp;Masaki Ohashi,&nbsp;Takumi Matsuda,&nbsp;Yuki Sekimori,&nbsp;Joni Pajarinen,&nbsp;Jan Peters,&nbsp;Toshihiro Maki","doi":"10.1002/rob.22348","DOIUrl":null,"url":null,"abstract":"<p>High-speed underwater wireless optical communication holds immense promise in ocean monitoring and surveys, providing crucial support for the real-time sharing of observational data collected by autonomous underwater vehicles (AUVs). However, due to inaccurate target information and external interference in unknown environments, link alignment is challenging and needs to be addressed. In response to these challenges, we propose a reinforcement learning-based alignment method to control the AUV to establish an optical link and maintain alignment. Our alignment control system utilizes a combination of sensors, including a depth sensor, Doppler velocity log (DVL), gyroscope, ultra-short baseline device, and acoustic modem. These sensors are used in conjunction with a particle filter to observe the environment and estimate the AUV's state accurately. The soft actor-critic algorithm is used to train a reinforcement learning-based controller in a simulated environment to reduce pointing errors and energy consumption in alignment. After experimental validation in simulation, we deployed the controller on an actual AUV called Tri-TON. In experiments at sea, Tri-TON maintained the link and angular pointing errors within 1 m and <span></span><math>\n <semantics>\n <mrow>\n \n <mrow>\n <mn>1</mn>\n \n <msup>\n <mn>0</mn>\n \n <mo>∘</mo>\n </msup>\n </mrow>\n </mrow>\n <annotation> $1{0}^{\\circ }$</annotation>\n </semantics></math>, respectively. Experimental results demonstrate that the proposed alignment control method can establish underwater optical communication between AUV fleets, thus improving the efficiency of marine surveys.</p>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"41 6","pages":"1724-1743"},"PeriodicalIF":4.2000,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/rob.22348","citationCount":"0","resultStr":"{\"title\":\"Autonomous underwater vehicle link alignment control in unknown environments using reinforcement learning\",\"authors\":\"Yang Weng,&nbsp;Sehwa Chun,&nbsp;Masaki Ohashi,&nbsp;Takumi Matsuda,&nbsp;Yuki Sekimori,&nbsp;Joni Pajarinen,&nbsp;Jan Peters,&nbsp;Toshihiro Maki\",\"doi\":\"10.1002/rob.22348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>High-speed underwater wireless optical communication holds immense promise in ocean monitoring and surveys, providing crucial support for the real-time sharing of observational data collected by autonomous underwater vehicles (AUVs). However, due to inaccurate target information and external interference in unknown environments, link alignment is challenging and needs to be addressed. In response to these challenges, we propose a reinforcement learning-based alignment method to control the AUV to establish an optical link and maintain alignment. Our alignment control system utilizes a combination of sensors, including a depth sensor, Doppler velocity log (DVL), gyroscope, ultra-short baseline device, and acoustic modem. These sensors are used in conjunction with a particle filter to observe the environment and estimate the AUV's state accurately. The soft actor-critic algorithm is used to train a reinforcement learning-based controller in a simulated environment to reduce pointing errors and energy consumption in alignment. After experimental validation in simulation, we deployed the controller on an actual AUV called Tri-TON. In experiments at sea, Tri-TON maintained the link and angular pointing errors within 1 m and <span></span><math>\\n <semantics>\\n <mrow>\\n \\n <mrow>\\n <mn>1</mn>\\n \\n <msup>\\n <mn>0</mn>\\n \\n <mo>∘</mo>\\n </msup>\\n </mrow>\\n </mrow>\\n <annotation> $1{0}^{\\\\circ }$</annotation>\\n </semantics></math>, respectively. Experimental results demonstrate that the proposed alignment control method can establish underwater optical communication between AUV fleets, thus improving the efficiency of marine surveys.</p>\",\"PeriodicalId\":192,\"journal\":{\"name\":\"Journal of Field Robotics\",\"volume\":\"41 6\",\"pages\":\"1724-1743\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/rob.22348\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Field Robotics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/rob.22348\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ROBOTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22348","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

摘要

高速水下无线光通信在海洋监测和勘测方面前景广阔,为实时共享自主潜水器(AUV)收集的观测数据提供了重要支持。然而,在未知环境中,由于目标信息不准确和外部干扰,链路对准具有挑战性,亟待解决。针对这些挑战,我们提出了一种基于强化学习的对准方法,以控制自动潜航器建立光链路并保持对准。我们的对准控制系统综合利用了多种传感器,包括深度传感器、多普勒速度记录仪(DVL)、陀螺仪、超短基线装置和声学调制解调器。这些传感器与粒子滤波器结合使用,可观测环境并准确估计 AUV 的状态。软演员批评算法用于在模拟环境中训练基于强化学习的控制器,以减少对准过程中的指向误差和能耗。经过模拟实验验证后,我们在名为 Tri-TON 的实际 AUV 上部署了控制器。在海上实验中,Tri-TON 的链路和角度指向误差分别保持在 1 米和 ,以内。实验结果表明,所提出的对准控制方法可以在 AUV 船队之间建立水下光通信,从而提高海洋勘测的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Autonomous underwater vehicle link alignment control in unknown environments using reinforcement learning

Autonomous underwater vehicle link alignment control in unknown environments using reinforcement learning

High-speed underwater wireless optical communication holds immense promise in ocean monitoring and surveys, providing crucial support for the real-time sharing of observational data collected by autonomous underwater vehicles (AUVs). However, due to inaccurate target information and external interference in unknown environments, link alignment is challenging and needs to be addressed. In response to these challenges, we propose a reinforcement learning-based alignment method to control the AUV to establish an optical link and maintain alignment. Our alignment control system utilizes a combination of sensors, including a depth sensor, Doppler velocity log (DVL), gyroscope, ultra-short baseline device, and acoustic modem. These sensors are used in conjunction with a particle filter to observe the environment and estimate the AUV's state accurately. The soft actor-critic algorithm is used to train a reinforcement learning-based controller in a simulated environment to reduce pointing errors and energy consumption in alignment. After experimental validation in simulation, we deployed the controller on an actual AUV called Tri-TON. In experiments at sea, Tri-TON maintained the link and angular pointing errors within 1 m and 1 0 $1{0}^{\circ }$ , respectively. Experimental results demonstrate that the proposed alignment control method can establish underwater optical communication between AUV fleets, thus improving the efficiency of marine surveys.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Field Robotics
Journal of Field Robotics 工程技术-机器人学
CiteScore
15.00
自引率
3.60%
发文量
80
审稿时长
6 months
期刊介绍: The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信