利用单目图像估计车辆深度的新方法

2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS) Pub Date : 2020-09-01 DOI:10.1109/CFIS49607.2020.9238702

Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad

{"title":"利用单目图像估计车辆深度的新方法","authors":"Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad","doi":"10.1109/CFIS49607.2020.9238702","DOIUrl":null,"url":null,"abstract":"Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.","PeriodicalId":128323,"journal":{"name":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A New Approach To Estimate Depth Of Cars Using A Monocular Image\",\"authors\":\"Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad\",\"doi\":\"10.1109/CFIS49607.2020.9238702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.\",\"PeriodicalId\":128323,\"journal\":{\"name\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CFIS49607.2020.9238702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CFIS49607.2020.9238702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

从RGB图像中预测场景深度是一项具有挑战性的任务。由于摄像头是自动驾驶汽车最容易获得、限制最少、成本最低的信息来源;在这项工作中，单眼图像被用作估计正面视图内汽车深度的唯一数据来源。除检测汽车正面图像外;卷积神经网络(CNN)被训练来检测和定位每辆车对应的灯光。由于边界框的配置，这种方法对错误的敏感性较低。通过添加车灯标签，还对COCO数据集进行了增强。仿真结果表明，该方法优于仅使用边界框的高度和宽度来估计深度的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A New Approach To Estimate Depth Of Cars Using A Monocular Image

Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)

自引率

0.00%

发文量