基于多尺度特征融合和剩余注意机制的改进SSD算法

2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC) Pub Date : 2021-04-01 DOI:10.1109/CTISC52352.2021.00024

Yongquan Zhao

{"title":"基于多尺度特征融合和剩余注意机制的改进SSD算法","authors":"Yongquan Zhao","doi":"10.1109/CTISC52352.2021.00024","DOIUrl":null,"url":null,"abstract":"Convolutional neural network (CNN) has led to significant progress in object detection. In order to detect the objects in various sizes, the object detectors often exploit the hierarchy of the multi-scale feature maps called feature pyramid, which is readily obtained by the CNN architecture. However, such feature maps do not fully consider the supplementary effect of contextual information on semantics. In this work, we proposed a feature fusion method of residual attention based on the SSD benchmark network call Improved SSD to make full use of context information to improve the characterization ability of feature maps. Besides, we use the residual attention mechanism to reinforce the key features to further improve the detector performance. The experiment result on benchmark dataset PASCAL VOC shows that the map of the proposed method with input image sizes of 300×300 and 512×512 is 78.8% and 80.7%.","PeriodicalId":268378,"journal":{"name":"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved SSD Algorithm Based on Multi-scale Feature Fusion and Residual Attention Mechanism\",\"authors\":\"Yongquan Zhao\",\"doi\":\"10.1109/CTISC52352.2021.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural network (CNN) has led to significant progress in object detection. In order to detect the objects in various sizes, the object detectors often exploit the hierarchy of the multi-scale feature maps called feature pyramid, which is readily obtained by the CNN architecture. However, such feature maps do not fully consider the supplementary effect of contextual information on semantics. In this work, we proposed a feature fusion method of residual attention based on the SSD benchmark network call Improved SSD to make full use of context information to improve the characterization ability of feature maps. Besides, we use the residual attention mechanism to reinforce the key features to further improve the detector performance. The experiment result on benchmark dataset PASCAL VOC shows that the map of the proposed method with input image sizes of 300×300 and 512×512 is 78.8% and 80.7%.\",\"PeriodicalId\":268378,\"journal\":{\"name\":\"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CTISC52352.2021.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CTISC52352.2021.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

卷积神经网络(CNN)在目标检测方面取得了重大进展。为了检测不同大小的目标，目标检测器通常利用多尺度特征映射的层次结构，称为特征金字塔，这很容易通过CNN架构获得。然而，这种特征映射并没有充分考虑上下文信息对语义的补充作用。在这项工作中，我们提出了一种基于SSD基准网络call Improved SSD的剩余注意力特征融合方法，充分利用上下文信息来提高特征映射的表征能力。此外，我们使用剩余注意机制来强化关键特征，进一步提高检测器的性能。在基准数据集PASCAL VOC上的实验结果表明，该方法在输入图像尺寸分别为300×300和512×512时的映射率分别为78.8%和80.7%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improved SSD Algorithm Based on Multi-scale Feature Fusion and Residual Attention Mechanism

Convolutional neural network (CNN) has led to significant progress in object detection. In order to detect the objects in various sizes, the object detectors often exploit the hierarchy of the multi-scale feature maps called feature pyramid, which is readily obtained by the CNN architecture. However, such feature maps do not fully consider the supplementary effect of contextual information on semantics. In this work, we proposed a feature fusion method of residual attention based on the SSD benchmark network call Improved SSD to make full use of context information to improve the characterization ability of feature maps. Besides, we use the residual attention mechanism to reinforce the key features to further improve the detector performance. The experiment result on benchmark dataset PASCAL VOC shows that the map of the proposed method with input image sizes of 300×300 and 512×512 is 78.8% and 80.7%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)

自引率

0.00%

发文量