Improving Tiny Vehicle Detection in Complex Scenes

2018 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2018-07-01 DOI:10.1109/ICME.2018.8486507

W. Liu, Shengcai Liao, W. Hu, Xuezhi Liang, Yan Zhang

{"title":"Improving Tiny Vehicle Detection in Complex Scenes","authors":"W. Liu, Shengcai Liao, W. Hu, Xuezhi Liang, Yan Zhang","doi":"10.1109/ICME.2018.8486507","DOIUrl":null,"url":null,"abstract":"Vehicle detection is still a challenge in complex traffic scenes, especially for vehicles of tiny scales. Though RCNN based two-stage detectors have demonstrated considerably good performance, less attention has been paid to the quality of the first stage, where, however, tiny vehicles are very likely to be missed. In this paper, we propose a deep network for accurate vehicle detection, with the main idea of using a relatively large feature map for proposal generation, and keeping ROI feature's spatial layout to represent and detect tiny vehicles. However, large feature maps in lower levels of a deep network generally contain limited discriminant information. To address this, we introduce a backward feature enhancement operation, which absorbs higher level information step by step to enhance the base feature map. By doing so, even with only 100 proposals, the resulting proposal network achieves an encouraging recall over 99%. Furthermore, unlike a common practice which flatten features after ROI pooling, we argue that for a better detection of tiny vehicles, the spatial layout of the ROI features should be preserved and fully integrated. Accordingly, we use a multi-path light-weight processing chain to effectively integrate ROI features, while preserving the spatial layouts. Experiments done on the challenging DETRAC vehicle detection benchmark show that the proposed method largely improves a competitive baseline (ResNet50 based Faster RCNN) by 16.5% mAP, and it outperforms all previously published and unpublished results.","PeriodicalId":426613,"journal":{"name":"2018 IEEE International Conference on Multimedia and Expo (ICME)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Multimedia and Expo (ICME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME.2018.8486507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Vehicle detection is still a challenge in complex traffic scenes, especially for vehicles of tiny scales. Though RCNN based two-stage detectors have demonstrated considerably good performance, less attention has been paid to the quality of the first stage, where, however, tiny vehicles are very likely to be missed. In this paper, we propose a deep network for accurate vehicle detection, with the main idea of using a relatively large feature map for proposal generation, and keeping ROI feature's spatial layout to represent and detect tiny vehicles. However, large feature maps in lower levels of a deep network generally contain limited discriminant information. To address this, we introduce a backward feature enhancement operation, which absorbs higher level information step by step to enhance the base feature map. By doing so, even with only 100 proposals, the resulting proposal network achieves an encouraging recall over 99%. Furthermore, unlike a common practice which flatten features after ROI pooling, we argue that for a better detection of tiny vehicles, the spatial layout of the ROI features should be preserved and fully integrated. Accordingly, we use a multi-path light-weight processing chain to effectively integrate ROI features, while preserving the spatial layouts. Experiments done on the challenging DETRAC vehicle detection benchmark show that the proposed method largely improves a competitive baseline (ResNet50 based Faster RCNN) by 16.5% mAP, and it outperforms all previously published and unpublished results.

查看原文本刊更多论文

改进复杂场景中的微型车辆检测

在复杂的交通场景中，车辆检测仍然是一个挑战，特别是对于小型车辆。尽管基于RCNN的两级探测器已经显示出相当好的性能，但人们对第一级的质量关注较少，然而，在第一级，微型车辆很可能被遗漏。在本文中，我们提出了一种用于精确车辆检测的深度网络，其主要思想是使用相对较大的特征图进行建议生成，并保持ROI特征的空间布局来表示和检测微型车辆。然而，深度网络低层的大型特征映射通常包含有限的判别信息。为了解决这个问题，我们引入了一种反向特征增强操作，逐步吸收更高层次的信息来增强基本特征图。通过这样做，即使只有100个提案，最终的提案网络也达到了令人鼓舞的99%以上的召回率。此外，与通常在ROI池化后将特征平坦化的做法不同，我们认为为了更好地检测微型车辆，应该保留并充分整合ROI特征的空间布局。因此，我们使用多路径轻量级处理链来有效地整合ROI特征，同时保留空间布局。在具有挑战性的DETRAC车辆检测基准上进行的实验表明，所提出的方法在很大程度上提高了竞争性基准(基于ResNet50的Faster RCNN) 16.5%的mAP，并且优于所有先前发表和未发表的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Multimedia and Expo (ICME)

自引率

0.00%

发文量