DAVO:A Monocular Visual Odometry Method Based on Dual Attention

Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence Pub Date : 2022-12-23 DOI:10.1145/3579654.3579704

Jiahao Li, Bin Zheng

{"title":"DAVO:A Monocular Visual Odometry Method Based on Dual Attention","authors":"Jiahao Li, Bin Zheng","doi":"10.1145/3579654.3579704","DOIUrl":null,"url":null,"abstract":"In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579704","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.

查看原文本刊更多论文

DAVO:一种基于双注意的单目视觉里程计方法

近年来，视觉里程计技术在自动驾驶、虚拟现实等领域得到了广泛的应用。传统的视觉里程测量方法依赖于特征提取、特征匹配和摄像机标定等复杂过程，在具有挑战性的环境中存在鲁棒性低和精度严重不足的问题。在本文中，我们提出了一种将深度学习(DL)与强化学习(RL)相结合的双注意单目视觉里程计模型，命名为DAVO (dual attention visual odometry)。该模型将循环注意网络模型与自注意机制相结合，通过强化学习算法学习有利于模型姿态估计的图像区域位置，求解六自由度(6-DoF)的相对姿态。最后，在公开可用的数据集KITTI上对模型进行评估和比较。与其他主流模型相比，DAVO在图像预处理阶段只输入14.04%的数据，运行速度更快，性能优于大多数主流模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence

自引率

0.00%

发文量