{"title":"An Anchor-Free Target Detection Algorithm Combining Attention and Dilation Convolution","authors":"Lei Xiong, Fengsui Wang, Yaping Qian, Yue Xu","doi":"10.1109/FAIML57028.2022.00016","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of insufficient target detection capability in CenterNet, an improved target detection model combining attention and cavity convolution is proposed. Firstly, in order to improve the ability of the network to obtain the semantic and location features of the target, an improved nonlocal attention mechanism module (CANL) is designed to capture the remote dependence of the target in the image along the channel domain and the spatial domain, respectively. Secondly, a multi-scale feature extraction network based on dilation convolution (MSNet) is designed to improve the expression ability of the network to different scale targets, the residual structure is used to fuse the receptive field features of multiple scales in parallel, and the feature information obtained by the target in the image at multiple scales is retained. Finally, the proposed algorithm is verified on PASCAL VOC dataset. The detection accuracy of the proposed algorithm is 2.65 % higher than that of the baseline algorithm CenterNet, which effectively improves the performance of the anchorless object detection algorithm.","PeriodicalId":307172,"journal":{"name":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FAIML57028.2022.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Aiming at the problem of insufficient target detection capability in CenterNet, an improved target detection model combining attention and cavity convolution is proposed. Firstly, in order to improve the ability of the network to obtain the semantic and location features of the target, an improved nonlocal attention mechanism module (CANL) is designed to capture the remote dependence of the target in the image along the channel domain and the spatial domain, respectively. Secondly, a multi-scale feature extraction network based on dilation convolution (MSNet) is designed to improve the expression ability of the network to different scale targets, the residual structure is used to fuse the receptive field features of multiple scales in parallel, and the feature information obtained by the target in the image at multiple scales is retained. Finally, the proposed algorithm is verified on PASCAL VOC dataset. The detection accuracy of the proposed algorithm is 2.65 % higher than that of the baseline algorithm CenterNet, which effectively improves the performance of the anchorless object detection algorithm.