Ming Li, Ke Yang, J. Qin, J. Zhong, Zipeng Jiang, Qin Su
{"title":"基于视觉的无人潜航器实时姿态估计的比较研究","authors":"Ming Li, Ke Yang, J. Qin, J. Zhong, Zipeng Jiang, Qin Su","doi":"10.12688/cobot.17642.1","DOIUrl":null,"url":null,"abstract":"Background: Navigation and localization are key to the successful execution of autonomous unmanned underwater vehicles (UUVs) in marine environmental monitoring, underwater 3D mapping, and ocean resource surveys. The estimation of the position and the orientation of autonomous UUVs are a long-standing challenging and fundamental problem. As one of the underwater sensors, camera has always been the focus of attention due to its advantages of low cost and rich content information in visibility waters, especially in the fields of visual perception of the underwater environment, target recognition and tracking. At present, the visual real-time pose estimation technology that can be used for UUVs is mainly divided into geometry-based visual positioning algorithms and deep learning-based visual positioning algorithms. Methods: In order to compare the performance of different positioning algorithms and strategies, this paper uses C++ and python, takes the ORB-SLAM3 algorithm and DF-VO algorithm as representatives to conduct a comparative experiment and analysis. Results: The geometry-based algorithm ORB-SLAM3 is less affected by illumination, performs more stably in different underwater environments, and has a shorter calculation time, but its robustness is poor in complex environments. The visual positioning algorithm DF-VO based on deep learning takes longer time to compute, and the positioning accuracy is more easily affected by illumination, especially in dark conditions. However, its robustness is better in unstructured environments such as large-scale image rotation and dynamic object interference. Conclusions: In general, the deep learning-based algorithm is more robust, but multiple deep learning networks make it need more time to compute. The geometry-based method costs less time and is more accurate in low-light and turbid underwater conditions. However, in real underwater situations, these two methods can be connected as binocular vision or methods of multi-sensor combined pose estimation.","PeriodicalId":29807,"journal":{"name":"Cobot","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative study on real-time pose estimation of vision-based unmanned underwater vehicles\",\"authors\":\"Ming Li, Ke Yang, J. Qin, J. Zhong, Zipeng Jiang, Qin Su\",\"doi\":\"10.12688/cobot.17642.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Navigation and localization are key to the successful execution of autonomous unmanned underwater vehicles (UUVs) in marine environmental monitoring, underwater 3D mapping, and ocean resource surveys. The estimation of the position and the orientation of autonomous UUVs are a long-standing challenging and fundamental problem. As one of the underwater sensors, camera has always been the focus of attention due to its advantages of low cost and rich content information in visibility waters, especially in the fields of visual perception of the underwater environment, target recognition and tracking. At present, the visual real-time pose estimation technology that can be used for UUVs is mainly divided into geometry-based visual positioning algorithms and deep learning-based visual positioning algorithms. Methods: In order to compare the performance of different positioning algorithms and strategies, this paper uses C++ and python, takes the ORB-SLAM3 algorithm and DF-VO algorithm as representatives to conduct a comparative experiment and analysis. Results: The geometry-based algorithm ORB-SLAM3 is less affected by illumination, performs more stably in different underwater environments, and has a shorter calculation time, but its robustness is poor in complex environments. The visual positioning algorithm DF-VO based on deep learning takes longer time to compute, and the positioning accuracy is more easily affected by illumination, especially in dark conditions. However, its robustness is better in unstructured environments such as large-scale image rotation and dynamic object interference. Conclusions: In general, the deep learning-based algorithm is more robust, but multiple deep learning networks make it need more time to compute. The geometry-based method costs less time and is more accurate in low-light and turbid underwater conditions. However, in real underwater situations, these two methods can be connected as binocular vision or methods of multi-sensor combined pose estimation.\",\"PeriodicalId\":29807,\"journal\":{\"name\":\"Cobot\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cobot\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/cobot.17642.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cobot","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/cobot.17642.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative study on real-time pose estimation of vision-based unmanned underwater vehicles
Background: Navigation and localization are key to the successful execution of autonomous unmanned underwater vehicles (UUVs) in marine environmental monitoring, underwater 3D mapping, and ocean resource surveys. The estimation of the position and the orientation of autonomous UUVs are a long-standing challenging and fundamental problem. As one of the underwater sensors, camera has always been the focus of attention due to its advantages of low cost and rich content information in visibility waters, especially in the fields of visual perception of the underwater environment, target recognition and tracking. At present, the visual real-time pose estimation technology that can be used for UUVs is mainly divided into geometry-based visual positioning algorithms and deep learning-based visual positioning algorithms. Methods: In order to compare the performance of different positioning algorithms and strategies, this paper uses C++ and python, takes the ORB-SLAM3 algorithm and DF-VO algorithm as representatives to conduct a comparative experiment and analysis. Results: The geometry-based algorithm ORB-SLAM3 is less affected by illumination, performs more stably in different underwater environments, and has a shorter calculation time, but its robustness is poor in complex environments. The visual positioning algorithm DF-VO based on deep learning takes longer time to compute, and the positioning accuracy is more easily affected by illumination, especially in dark conditions. However, its robustness is better in unstructured environments such as large-scale image rotation and dynamic object interference. Conclusions: In general, the deep learning-based algorithm is more robust, but multiple deep learning networks make it need more time to compute. The geometry-based method costs less time and is more accurate in low-light and turbid underwater conditions. However, in real underwater situations, these two methods can be connected as binocular vision or methods of multi-sensor combined pose estimation.
期刊介绍:
Cobot is a rapid multidisciplinary open access publishing platform for research focused on the interdisciplinary field of collaborative robots. The aim of Cobot is to enhance knowledge and share the results of the latest innovative technologies for the technicians, researchers and experts engaged in collaborative robot research. The platform will welcome submissions in all areas of scientific and technical research related to collaborative robots, and all articles will benefit from open peer review.
The scope of Cobot includes, but is not limited to:
● Intelligent robots
● Artificial intelligence
● Human-machine collaboration and integration
● Machine vision
● Intelligent sensing
● Smart materials
● Design, development and testing of collaborative robots
● Software for cobots
● Industrial applications of cobots
● Service applications of cobots
● Medical and health applications of cobots
● Educational applications of cobots
As well as research articles and case studies, Cobot accepts a variety of article types including method articles, study protocols, software tools, systematic reviews, data notes, brief reports, and opinion articles.