Object Detection Based on Binocular Vision with Convolutional Neural Network

Zekun Luo, Xia Wu, Qingquan Zou, Xiao Xiao
{"title":"Object Detection Based on Binocular Vision with Convolutional Neural Network","authors":"Zekun Luo, Xia Wu, Qingquan Zou, Xiao Xiao","doi":"10.1145/3297067.3297081","DOIUrl":null,"url":null,"abstract":"Autonomous vehicles are widely accepted as one of the most potential technologies in alleviating traffic problems. In most existing autonomous vehicles for object detection and distance measurement, compared with radar or LIDAR which obviously increases the cost, camera combined with Convolutional Neural Network (CNN) has advantage in accuracy and low cost. However, most object detection methods applied on camera cannot perform distance measurement. In this paper, we simultaneously carry out real-time object detection and distance measurement (DDM) in one system by utilizing CNN on a binocular camera. Firstly, a binocular camera is used to acquire disparity maps. Secondly, a set of high-quality region proposals is generated by those disparity maps and the number of region proposals is reduced. Thirdly, CNN is utilized to classify those region proposals and get the bounding box of detected objects. Consequently, those reduced region proposals generated by disparity maps lead to improved computational efficiency. Finally, the object distance is measured by the disparity map and the bounding box. The experiment results show that the proposed method can achieve an accuracy of 87.2% on KITTI dataset and an accuracy of 68% in the real environment for object detection. The average relative error of the distance measurement is 0.85% within 10 meters in real environment. The operation time of the whole DDM system is less than 80 ms.","PeriodicalId":340004,"journal":{"name":"International Conference on Signal Processing and Machine Learning","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3297067.3297081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Autonomous vehicles are widely accepted as one of the most potential technologies in alleviating traffic problems. In most existing autonomous vehicles for object detection and distance measurement, compared with radar or LIDAR which obviously increases the cost, camera combined with Convolutional Neural Network (CNN) has advantage in accuracy and low cost. However, most object detection methods applied on camera cannot perform distance measurement. In this paper, we simultaneously carry out real-time object detection and distance measurement (DDM) in one system by utilizing CNN on a binocular camera. Firstly, a binocular camera is used to acquire disparity maps. Secondly, a set of high-quality region proposals is generated by those disparity maps and the number of region proposals is reduced. Thirdly, CNN is utilized to classify those region proposals and get the bounding box of detected objects. Consequently, those reduced region proposals generated by disparity maps lead to improved computational efficiency. Finally, the object distance is measured by the disparity map and the bounding box. The experiment results show that the proposed method can achieve an accuracy of 87.2% on KITTI dataset and an accuracy of 68% in the real environment for object detection. The average relative error of the distance measurement is 0.85% within 10 meters in real environment. The operation time of the whole DDM system is less than 80 ms.
基于卷积神经网络的双目视觉目标检测
自动驾驶汽车被广泛认为是缓解交通问题的最具潜力的技术之一。在现有的大多数自动驾驶汽车的目标检测和距离测量中,相对于雷达或激光雷达的成本明显增加,摄像头结合卷积神经网络(CNN)具有精度高、成本低的优势。然而,大多数应用于相机的目标检测方法都不能进行距离测量。在本文中,我们利用双目摄像机上的CNN在一个系统中同时进行实时目标检测和距离测量(DDM)。首先,利用双目摄像机获取视差图;其次,利用视差图生成一组高质量的区域建议,并减少区域建议的数量;第三,利用CNN对这些区域建议进行分类,得到检测目标的边界框。因此,视差图生成的减少区域建议提高了计算效率。最后,通过视差图和边界框测量目标距离。实验结果表明,该方法在KITTI数据集上的准确率为87.2%,在真实环境下的目标检测准确率为68%。在实际环境中,距离测量的平均相对误差为0.85%,误差范围为10米。整个DDM系统的运行时间小于80ms。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信