DAVO:A Monocular Visual Odometry Method Based on Dual Attention

Jiahao Li, Bin Zheng
{"title":"DAVO:A Monocular Visual Odometry Method Based on Dual Attention","authors":"Jiahao Li, Bin Zheng","doi":"10.1145/3579654.3579704","DOIUrl":null,"url":null,"abstract":"In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579704","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.
DAVO:一种基于双注意的单目视觉里程计方法
近年来,视觉里程计技术在自动驾驶、虚拟现实等领域得到了广泛的应用。传统的视觉里程测量方法依赖于特征提取、特征匹配和摄像机标定等复杂过程,在具有挑战性的环境中存在鲁棒性低和精度严重不足的问题。在本文中,我们提出了一种将深度学习(DL)与强化学习(RL)相结合的双注意单目视觉里程计模型,命名为DAVO (dual attention visual odometry)。该模型将循环注意网络模型与自注意机制相结合,通过强化学习算法学习有利于模型姿态估计的图像区域位置,求解六自由度(6-DoF)的相对姿态。最后,在公开可用的数据集KITTI上对模型进行评估和比较。与其他主流模型相比,DAVO在图像预处理阶段只输入14.04%的数据,运行速度更快,性能优于大多数主流模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信