An Anchor-Free Target Detection Algorithm Combining Attention and Dilation Convolution

2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML) Pub Date : 2022-06-01 DOI:10.1109/FAIML57028.2022.00016

Lei Xiong, Fengsui Wang, Yaping Qian, Yue Xu

{"title":"An Anchor-Free Target Detection Algorithm Combining Attention and Dilation Convolution","authors":"Lei Xiong, Fengsui Wang, Yaping Qian, Yue Xu","doi":"10.1109/FAIML57028.2022.00016","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of insufficient target detection capability in CenterNet, an improved target detection model combining attention and cavity convolution is proposed. Firstly, in order to improve the ability of the network to obtain the semantic and location features of the target, an improved nonlocal attention mechanism module (CANL) is designed to capture the remote dependence of the target in the image along the channel domain and the spatial domain, respectively. Secondly, a multi-scale feature extraction network based on dilation convolution (MSNet) is designed to improve the expression ability of the network to different scale targets, the residual structure is used to fuse the receptive field features of multiple scales in parallel, and the feature information obtained by the target in the image at multiple scales is retained. Finally, the proposed algorithm is verified on PASCAL VOC dataset. The detection accuracy of the proposed algorithm is 2.65 % higher than that of the baseline algorithm CenterNet, which effectively improves the performance of the anchorless object detection algorithm.","PeriodicalId":307172,"journal":{"name":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FAIML57028.2022.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Aiming at the problem of insufficient target detection capability in CenterNet, an improved target detection model combining attention and cavity convolution is proposed. Firstly, in order to improve the ability of the network to obtain the semantic and location features of the target, an improved nonlocal attention mechanism module (CANL) is designed to capture the remote dependence of the target in the image along the channel domain and the spatial domain, respectively. Secondly, a multi-scale feature extraction network based on dilation convolution (MSNet) is designed to improve the expression ability of the network to different scale targets, the residual structure is used to fuse the receptive field features of multiple scales in parallel, and the feature information obtained by the target in the image at multiple scales is retained. Finally, the proposed algorithm is verified on PASCAL VOC dataset. The detection accuracy of the proposed algorithm is 2.65 % higher than that of the baseline algorithm CenterNet, which effectively improves the performance of the anchorless object detection algorithm.

查看原文本刊更多论文

一种结合注意卷积和扩张卷积的无锚目标检测算法

针对CenterNet中目标检测能力不足的问题，提出了一种结合注意和空腔卷积的改进目标检测模型。首先，为了提高网络获取目标语义特征和位置特征的能力，设计了改进的非局部注意机制模块(CANL)，分别沿信道域和空间域捕获图像中目标的远程依赖性;其次，设计基于扩张卷积的多尺度特征提取网络(MSNet)，提高网络对不同尺度目标的表达能力，利用残差结构并行融合多尺度的感受野特征，保留图像中目标在多尺度下获得的特征信息;最后，在PASCAL VOC数据集上对该算法进行了验证。该算法的检测精度比基线算法CenterNet提高2.65%，有效提高了无锚点目标检测算法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)

自引率

0.00%

发文量