Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad
{"title":"利用单目图像估计车辆深度的新方法","authors":"Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad","doi":"10.1109/CFIS49607.2020.9238702","DOIUrl":null,"url":null,"abstract":"Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.","PeriodicalId":128323,"journal":{"name":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A New Approach To Estimate Depth Of Cars Using A Monocular Image\",\"authors\":\"Seyed Ali Doustdar Tousi, Javad Khorramdel, F. Lotfi, Amirhossein Nikoofard, A. Ardekani, H. Taghirad\",\"doi\":\"10.1109/CFIS49607.2020.9238702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.\",\"PeriodicalId\":128323,\"journal\":{\"name\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CFIS49607.2020.9238702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CFIS49607.2020.9238702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A New Approach To Estimate Depth Of Cars Using A Monocular Image
Predicting scene depth from RGB images is a challenging task. Since the cameras are the most available, least restrictive and cheapest source of information for autonomous vehicles; in this work, a monocular image has been used as the only source of data to estimate the depth of the car within the frontal view. In addition to the detection of cars in the frontal image; a convolutional neural network (CNN) has been trained to detect and localize the lights corresponding to each car. This approach is less sensitive to errors due to the disposition of bounding boxes. An enhancement on the COCO dataset has also been provided by adding the car lights labels. Simulation results show that the proposed approach outperforms those who only use the height and width of bounding boxes to estimate the depth.