{"title":"DAVO:A Monocular Visual Odometry Method Based on Dual Attention","authors":"Jiahao Li, Bin Zheng","doi":"10.1145/3579654.3579704","DOIUrl":null,"url":null,"abstract":"In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579704","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, Visual odometry(VO) has been widely used in fields such as autonomous driving and virtual reality. Traditional methods for solving visual odometry rely on complex processes such as feature extraction, feature matching and camera calibration, and have low robustness and serious accuracy deficiency problems in challenging environments. In this paper, we propose a dual attention monocular visual odometry model that integrates Deep Learning(DL) with Reinforcement Learning(RL), named DAVO (Dual Attention Visual Odometry). The model combines a recurrent attention network model with a self-attentive mechanism to solve the relative poses of six degrees of freedom(6-DoF) by learning the image region locations that are favorable for the model pose estimation through a reinforcement learning algorithm. Finally, the model is evaluated and compared on the publicly available dataset KITTI. Compared with other mainstream models, DAVO only inputs 14.04% of the data in the image preprocessing stage, runs faster and outperforms most of the mainstream models.