Learning Channel Attention in Frequency Domain for Visual Tracking

2021 International Conference on Information Technology and Biomedical Engineering (ICITBE) Pub Date : 2021-12-01 DOI:10.1109/ICITBE54178.2021.00034

Jun Wang, Pei Zhang, Chenchen Meng, Limin Zhang, Yuanyun Wang

{"title":"Learning Channel Attention in Frequency Domain for Visual Tracking","authors":"Jun Wang, Pei Zhang, Chenchen Meng, Limin Zhang, Yuanyun Wang","doi":"10.1109/ICITBE54178.2021.00034","DOIUrl":null,"url":null,"abstract":"Visual tracking based on attention mechanism, especially channel attention, has achieved great success. However, they do not make full use of the correlation between the feature map channels and use the feature information to model the target appearance. In this paper, under the Siamese network framework, we propose a simple and effective tracking algorithm based on channel attention mechanism in the frequency domain, referred to as SiamFCA. The frequency channel attention preprocesses the feature map obtained through the fully convolutional network (FCN). It divides and weights the feature maps in the frequency domain, and recalibrates the channel feature response by modeling the correlation between the feature channels. The proposed model further enhances the feature information of the extracted target image, reduces the model complexity and computational burden. The results of the experiment on the two benchmark datasets of OTB2015 and VOT2016 show that the proposed tracker based on the frequency channel attention mechanism is better than many state-of-the-art trackers.","PeriodicalId":207276,"journal":{"name":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITBE54178.2021.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Visual tracking based on attention mechanism, especially channel attention, has achieved great success. However, they do not make full use of the correlation between the feature map channels and use the feature information to model the target appearance. In this paper, under the Siamese network framework, we propose a simple and effective tracking algorithm based on channel attention mechanism in the frequency domain, referred to as SiamFCA. The frequency channel attention preprocesses the feature map obtained through the fully convolutional network (FCN). It divides and weights the feature maps in the frequency domain, and recalibrates the channel feature response by modeling the correlation between the feature channels. The proposed model further enhances the feature information of the extracted target image, reduces the model complexity and computational burden. The results of the experiment on the two benchmark datasets of OTB2015 and VOT2016 show that the proposed tracker based on the frequency channel attention mechanism is better than many state-of-the-art trackers.

查看原文本刊更多论文

基于频域的信道注意学习视觉跟踪

基于注意机制的视觉跟踪，特别是基于通道注意的视觉跟踪，已经取得了很大的成功。然而，它们没有充分利用特征映射通道之间的相关性，利用特征信息对目标的外观进行建模。本文在Siamese网络框架下，提出了一种基于频域信道注意机制的简单有效的跟踪算法，称为SiamFCA。频率通道注意对通过全卷积网络(FCN)得到的特征图进行预处理。在频域对特征映射进行划分和加权，通过对特征通道之间的相关性建模，重新校准通道特征响应。该模型进一步增强了提取目标图像的特征信息，降低了模型复杂度和计算量。在OTB2015和VOT2016两个基准数据集上的实验结果表明，本文提出的基于频率通道注意机制的跟踪器优于许多最先进的跟踪器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)

自引率

0.00%

发文量