Fudong Li, Dongyang Gao, Yuequan Yang, Zhiqiang Cao, Wei Wang
{"title":"ReCUS:用于目标检测的卷积和上采样网络","authors":"Fudong Li, Dongyang Gao, Yuequan Yang, Zhiqiang Cao, Wei Wang","doi":"10.1109/CCIS53392.2021.9754606","DOIUrl":null,"url":null,"abstract":"Most of the mainstream object detection models such as RetinaNet, SSD, YOLO, and Faster RCNN hardly achieve a good balance between detection accuracy and speed. A major reason is rich deep feature semantic information of images is not fully exploited. To solve this problem, a novel deep convolutional network structure termed as reconvolution and upsampling network (ReCUS) is proposed. In the ReCUS, a modified path aggregation network(mPAN) is added after the backbone, which is beneficial to strengthen the foreground salient feature information and weaken background information. Moreover, two new spatial pyramid pooling (SPP) modules are embedded before output heads for multi-scale fusion of local and global features. The experiments show that the effectiveness of our proposed ReCUS. Furthermore, the better detectability of the ReCUS network is demonstrated for both small scale objects and large scale objects.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ReCUS: Reconvolution and Upsampling Network for Object Detection\",\"authors\":\"Fudong Li, Dongyang Gao, Yuequan Yang, Zhiqiang Cao, Wei Wang\",\"doi\":\"10.1109/CCIS53392.2021.9754606\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most of the mainstream object detection models such as RetinaNet, SSD, YOLO, and Faster RCNN hardly achieve a good balance between detection accuracy and speed. A major reason is rich deep feature semantic information of images is not fully exploited. To solve this problem, a novel deep convolutional network structure termed as reconvolution and upsampling network (ReCUS) is proposed. In the ReCUS, a modified path aggregation network(mPAN) is added after the backbone, which is beneficial to strengthen the foreground salient feature information and weaken background information. Moreover, two new spatial pyramid pooling (SPP) modules are embedded before output heads for multi-scale fusion of local and global features. The experiments show that the effectiveness of our proposed ReCUS. Furthermore, the better detectability of the ReCUS network is demonstrated for both small scale objects and large scale objects.\",\"PeriodicalId\":191226,\"journal\":{\"name\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCIS53392.2021.9754606\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
ReCUS: Reconvolution and Upsampling Network for Object Detection
Most of the mainstream object detection models such as RetinaNet, SSD, YOLO, and Faster RCNN hardly achieve a good balance between detection accuracy and speed. A major reason is rich deep feature semantic information of images is not fully exploited. To solve this problem, a novel deep convolutional network structure termed as reconvolution and upsampling network (ReCUS) is proposed. In the ReCUS, a modified path aggregation network(mPAN) is added after the backbone, which is beneficial to strengthen the foreground salient feature information and weaken background information. Moreover, two new spatial pyramid pooling (SPP) modules are embedded before output heads for multi-scale fusion of local and global features. The experiments show that the effectiveness of our proposed ReCUS. Furthermore, the better detectability of the ReCUS network is demonstrated for both small scale objects and large scale objects.