Xinhua Liu, Songyao Zhou, Hailan Kuang, Xiaolin Ma
{"title":"局部调整块网用于目标检测","authors":"Xinhua Liu, Songyao Zhou, Hailan Kuang, Xiaolin Ma","doi":"10.1109/icmcce51767.2020.00363","DOIUrl":null,"url":null,"abstract":"Recently, object detectors based on Receptive Fields (RFs) in human visual systems have stronger feature extraction capabilities and have achieved great detection performance, such as Inception, ASPP and RFBNet. However, while having capabilities to extract more contextual information, these detectors also capture redundant information, which will reduce the precision of detection. In this paper, we propose a novel and lightweight block based on spatial attention mechanism to solve this problem effectively. Compared with RFB, it can better capture effective contextual information in the feature map and suppress redundant information. Moreover, we propose a local enhancement strategy, which can sparsely locate regions that contains rich feature information and enhance them locally. Experimental results show that our proposed method gains 0.6% mAP improvement on the PSACAL VOC dataset and 0.5% mAP improvement on the COC02019 test-dev set.","PeriodicalId":6712,"journal":{"name":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"37 1","pages":"1655-1658"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Local Adjustment Block Net for Object Detection\",\"authors\":\"Xinhua Liu, Songyao Zhou, Hailan Kuang, Xiaolin Ma\",\"doi\":\"10.1109/icmcce51767.2020.00363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, object detectors based on Receptive Fields (RFs) in human visual systems have stronger feature extraction capabilities and have achieved great detection performance, such as Inception, ASPP and RFBNet. However, while having capabilities to extract more contextual information, these detectors also capture redundant information, which will reduce the precision of detection. In this paper, we propose a novel and lightweight block based on spatial attention mechanism to solve this problem effectively. Compared with RFB, it can better capture effective contextual information in the feature map and suppress redundant information. Moreover, we propose a local enhancement strategy, which can sparsely locate regions that contains rich feature information and enhance them locally. Experimental results show that our proposed method gains 0.6% mAP improvement on the PSACAL VOC dataset and 0.5% mAP improvement on the COC02019 test-dev set.\",\"PeriodicalId\":6712,\"journal\":{\"name\":\"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"volume\":\"37 1\",\"pages\":\"1655-1658\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icmcce51767.2020.00363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icmcce51767.2020.00363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recently, object detectors based on Receptive Fields (RFs) in human visual systems have stronger feature extraction capabilities and have achieved great detection performance, such as Inception, ASPP and RFBNet. However, while having capabilities to extract more contextual information, these detectors also capture redundant information, which will reduce the precision of detection. In this paper, we propose a novel and lightweight block based on spatial attention mechanism to solve this problem effectively. Compared with RFB, it can better capture effective contextual information in the feature map and suppress redundant information. Moreover, we propose a local enhancement strategy, which can sparsely locate regions that contains rich feature information and enhance them locally. Experimental results show that our proposed method gains 0.6% mAP improvement on the PSACAL VOC dataset and 0.5% mAP improvement on the COC02019 test-dev set.