Xinhua Liu, Songyao Zhou, Hailan Kuang, Xiaolin Ma
{"title":"Local Adjustment Block Net for Object Detection","authors":"Xinhua Liu, Songyao Zhou, Hailan Kuang, Xiaolin Ma","doi":"10.1109/icmcce51767.2020.00363","DOIUrl":null,"url":null,"abstract":"Recently, object detectors based on Receptive Fields (RFs) in human visual systems have stronger feature extraction capabilities and have achieved great detection performance, such as Inception, ASPP and RFBNet. However, while having capabilities to extract more contextual information, these detectors also capture redundant information, which will reduce the precision of detection. In this paper, we propose a novel and lightweight block based on spatial attention mechanism to solve this problem effectively. Compared with RFB, it can better capture effective contextual information in the feature map and suppress redundant information. Moreover, we propose a local enhancement strategy, which can sparsely locate regions that contains rich feature information and enhance them locally. Experimental results show that our proposed method gains 0.6% mAP improvement on the PSACAL VOC dataset and 0.5% mAP improvement on the COC02019 test-dev set.","PeriodicalId":6712,"journal":{"name":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"37 1","pages":"1655-1658"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icmcce51767.2020.00363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, object detectors based on Receptive Fields (RFs) in human visual systems have stronger feature extraction capabilities and have achieved great detection performance, such as Inception, ASPP and RFBNet. However, while having capabilities to extract more contextual information, these detectors also capture redundant information, which will reduce the precision of detection. In this paper, we propose a novel and lightweight block based on spatial attention mechanism to solve this problem effectively. Compared with RFB, it can better capture effective contextual information in the feature map and suppress redundant information. Moreover, we propose a local enhancement strategy, which can sparsely locate regions that contains rich feature information and enhance them locally. Experimental results show that our proposed method gains 0.6% mAP improvement on the PSACAL VOC dataset and 0.5% mAP improvement on the COC02019 test-dev set.