Shulin Li, W. Zhang, Guorong Li, Li Su, Qingming Huang
{"title":"Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network","authors":"Shulin Li, W. Zhang, Guorong Li, Li Su, Qingming Huang","doi":"10.1109/MIPR.2018.00009","DOIUrl":null,"url":null,"abstract":"Vehicle detection technology is a key component of an intelligent transportation system, but most of the current vehicle detection technologies are based on road monitoring cameras. Compared with these fixed cameras, Unmanned Aerial Vehicles (UAVs) seem to have a lot of advantages such as more flexible, broader vision, higher speed, which make the vehicle detection more challenging. In this paper, a new dataset built on UAV traffic videos and a neural network which could fuse multi-layer features are proposed. Different from some networks with only a single layer, the proposed network merges the features from multiple layers firstly. Then a convolution layer is used to reduce the feature dimensions and a deconvolution layer is employed to do upsampling and enhance the response information. Finally, multiple fully connected layers are used to finish the detection. Furthermore, the proposed method combines the detecting and tracking for optimization and high detection speed. Experiments on the self-built UAV traffic video dataset demonstrate that the proposed method gets better results and higher speed.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Vehicle detection technology is a key component of an intelligent transportation system, but most of the current vehicle detection technologies are based on road monitoring cameras. Compared with these fixed cameras, Unmanned Aerial Vehicles (UAVs) seem to have a lot of advantages such as more flexible, broader vision, higher speed, which make the vehicle detection more challenging. In this paper, a new dataset built on UAV traffic videos and a neural network which could fuse multi-layer features are proposed. Different from some networks with only a single layer, the proposed network merges the features from multiple layers firstly. Then a convolution layer is used to reduce the feature dimensions and a deconvolution layer is employed to do upsampling and enhance the response information. Finally, multiple fully connected layers are used to finish the detection. Furthermore, the proposed method combines the detecting and tracking for optimization and high detection speed. Experiments on the self-built UAV traffic video dataset demonstrate that the proposed method gets better results and higher speed.