ORBDeepOdometry -一种基于特征的深度学习方法的单目视觉里程测量

Karthik Sivarama Krishnan, F. Sahin
{"title":"ORBDeepOdometry -一种基于特征的深度学习方法的单目视觉里程测量","authors":"Karthik Sivarama Krishnan, F. Sahin","doi":"10.1109/SYSOSE.2019.8753848","DOIUrl":null,"url":null,"abstract":"Visual Odometry (VO) is an application of computer vision which helps in autonomous navigation of Self-Driving vehicles, Robots etc. The odometry pose of the vehicle can be estimated using the consecutive frames based on monocular or stereo camera setup mounted on the vehicle. The Simultaneous Localization and Mapping (SLAM) based approaches have shown phenomenal results in predicting the vehicle odometry. Deep Learning based approaches are emerging in the field of computer vision and are outperforming other classic algorithms. But, visual odometry is one of the field where deep learning approaches have not yet been exploited largely. This paper proposes the use of Deep Learning based pipeline approach to solve the problem of visual odometry. The paper boosts the performance of deep learning approach by extracting quite more features before training the network. This paper proposes the use of ORB based feature extractor along with Convolutional Neural Network based dimensionality reduction and stacking multiple deep LSTM's for modelling the sequential data. In this case, the KITTI vision Benchmark dataset is used to model the network. The accuracy of the network is examined by calculating the error between the predicted odometry output and the ground truth odometry output. The results are compared with different Convolutional Neural Network (CNN) architectures proposed for the same task. The average translation error from the proposed system is 11.99% and the average rotational error from the proposed system is 0.0462 degrees per meter.","PeriodicalId":133413,"journal":{"name":"2019 14th Annual Conference System of Systems Engineering (SoSE)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"ORBDeepOdometry - A Feature-Based Deep Learning Approach to Monocular Visual Odometry\",\"authors\":\"Karthik Sivarama Krishnan, F. Sahin\",\"doi\":\"10.1109/SYSOSE.2019.8753848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual Odometry (VO) is an application of computer vision which helps in autonomous navigation of Self-Driving vehicles, Robots etc. The odometry pose of the vehicle can be estimated using the consecutive frames based on monocular or stereo camera setup mounted on the vehicle. The Simultaneous Localization and Mapping (SLAM) based approaches have shown phenomenal results in predicting the vehicle odometry. Deep Learning based approaches are emerging in the field of computer vision and are outperforming other classic algorithms. But, visual odometry is one of the field where deep learning approaches have not yet been exploited largely. This paper proposes the use of Deep Learning based pipeline approach to solve the problem of visual odometry. The paper boosts the performance of deep learning approach by extracting quite more features before training the network. This paper proposes the use of ORB based feature extractor along with Convolutional Neural Network based dimensionality reduction and stacking multiple deep LSTM's for modelling the sequential data. In this case, the KITTI vision Benchmark dataset is used to model the network. The accuracy of the network is examined by calculating the error between the predicted odometry output and the ground truth odometry output. The results are compared with different Convolutional Neural Network (CNN) architectures proposed for the same task. The average translation error from the proposed system is 11.99% and the average rotational error from the proposed system is 0.0462 degrees per meter.\",\"PeriodicalId\":133413,\"journal\":{\"name\":\"2019 14th Annual Conference System of Systems Engineering (SoSE)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 14th Annual Conference System of Systems Engineering (SoSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SYSOSE.2019.8753848\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 14th Annual Conference System of Systems Engineering (SoSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SYSOSE.2019.8753848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

视觉里程计(VO)是计算机视觉的一种应用,有助于自动驾驶汽车、机器人等的自主导航。基于安装在车辆上的单目或立体摄像机的连续帧,可以估计车辆的里程计姿态。基于同时定位和映射(SLAM)的方法在预测车辆里程数方面显示出显著的效果。基于深度学习的方法正在计算机视觉领域兴起,并且优于其他经典算法。但是,视觉里程计是深度学习方法尚未得到充分利用的领域之一。本文提出使用基于深度学习的管道方法来解决视觉里程计问题。本文通过在训练网络之前提取更多的特征来提高深度学习方法的性能。本文提出利用基于ORB的特征提取器与基于卷积神经网络的降维和叠加多个深度LSTM对序列数据建模。在这种情况下,使用KITTI视觉基准数据集对网络进行建模。通过计算预测里程计输出与地真里程计输出之间的误差来检验网络的准确性。结果与针对同一任务提出的不同卷积神经网络(CNN)架构进行了比较。该系统的平均平移误差为11.99%,平均旋转误差为0.0462度/米。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
ORBDeepOdometry - A Feature-Based Deep Learning Approach to Monocular Visual Odometry
Visual Odometry (VO) is an application of computer vision which helps in autonomous navigation of Self-Driving vehicles, Robots etc. The odometry pose of the vehicle can be estimated using the consecutive frames based on monocular or stereo camera setup mounted on the vehicle. The Simultaneous Localization and Mapping (SLAM) based approaches have shown phenomenal results in predicting the vehicle odometry. Deep Learning based approaches are emerging in the field of computer vision and are outperforming other classic algorithms. But, visual odometry is one of the field where deep learning approaches have not yet been exploited largely. This paper proposes the use of Deep Learning based pipeline approach to solve the problem of visual odometry. The paper boosts the performance of deep learning approach by extracting quite more features before training the network. This paper proposes the use of ORB based feature extractor along with Convolutional Neural Network based dimensionality reduction and stacking multiple deep LSTM's for modelling the sequential data. In this case, the KITTI vision Benchmark dataset is used to model the network. The accuracy of the network is examined by calculating the error between the predicted odometry output and the ground truth odometry output. The results are compared with different Convolutional Neural Network (CNN) architectures proposed for the same task. The average translation error from the proposed system is 11.99% and the average rotational error from the proposed system is 0.0462 degrees per meter.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信