Simone Magistri, Francesco Sambo, Fabio Schoen, Douglas Coimbra de Andrade, Matteo Simoncini, Stefano Caprasecca, Luca Kubin, L. Bravi, L. Taccari
{"title":"A Lightweight Deep Learning Model for Vehicle Viewpoint Estimation from Dashcam Images","authors":"Simone Magistri, Francesco Sambo, Fabio Schoen, Douglas Coimbra de Andrade, Matteo Simoncini, Stefano Caprasecca, Luca Kubin, L. Bravi, L. Taccari","doi":"10.1109/ITSC45102.2020.9294672","DOIUrl":null,"url":null,"abstract":"Vehicle viewpoint estimation from vehicle cameras is a crucial component of road scene understanding.In this paper, we propose a deep lightweight method to predict vehicle viewpoint from a single RGB dashcam image. To this aim, we customize and adapt state-of-the-art deep learning techniques for general object viewpoint estimation to the vehicle viewpoint estimation task. Furthermore, we define a novel objective function that takes into account errors at different granularity to improve neural network training. To keep the model lightweight and fast, we rely upon MobileNetV2 as backbone.Tested both on benchmark viewpoint estimation data (Pascal3D+) and on actual vehicle camera data (nuScenes), our method is shown to outperform the state of the art in vehicle viewpoint estimation, in terms of both accuracy and memory footprint.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Vehicle viewpoint estimation from vehicle cameras is a crucial component of road scene understanding.In this paper, we propose a deep lightweight method to predict vehicle viewpoint from a single RGB dashcam image. To this aim, we customize and adapt state-of-the-art deep learning techniques for general object viewpoint estimation to the vehicle viewpoint estimation task. Furthermore, we define a novel objective function that takes into account errors at different granularity to improve neural network training. To keep the model lightweight and fast, we rely upon MobileNetV2 as backbone.Tested both on benchmark viewpoint estimation data (Pascal3D+) and on actual vehicle camera data (nuScenes), our method is shown to outperform the state of the art in vehicle viewpoint estimation, in terms of both accuracy and memory footprint.