{"title":"Enhanced Semantic Features via Attention for Real-Time Visual Tracking","authors":"M. Geng, Haiying Wang, Yingsen Zeng","doi":"10.1109/VCIP47243.2019.8965870","DOIUrl":null,"url":null,"abstract":"The key to balance the tracking accuracy and speed for object tracking algorithms is to learn powerful features via offline training in a lightweight tracking framework. With the development of attention mechanisms, it’s facile to apply attention to enhance the features without modifying the basic structure of the network. In this paper, a novel combination of different attention modules is implemented into a siamese-based tracker and boosts the tracking performance with little computational burden. In particular, by applying non-local self-attention and dual pooling channel attention, the extracted features tend to be more discriminative and adaptive due to the offline learning with tracking targets of different classes. Meanwhile, an Index-Difference-weight boosts the performance and reduces overfitting when full occlusion occurs. Our experimental results on OTB2013 and OTB2015 show that the tracker using the proposal to implement the attention modules can achieve state-of-the-art performance with a speed of 49 frames per second.","PeriodicalId":388109,"journal":{"name":"2019 IEEE Visual Communications and Image Processing (VCIP)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Visual Communications and Image Processing (VCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP47243.2019.8965870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The key to balance the tracking accuracy and speed for object tracking algorithms is to learn powerful features via offline training in a lightweight tracking framework. With the development of attention mechanisms, it’s facile to apply attention to enhance the features without modifying the basic structure of the network. In this paper, a novel combination of different attention modules is implemented into a siamese-based tracker and boosts the tracking performance with little computational burden. In particular, by applying non-local self-attention and dual pooling channel attention, the extracted features tend to be more discriminative and adaptive due to the offline learning with tracking targets of different classes. Meanwhile, an Index-Difference-weight boosts the performance and reduces overfitting when full occlusion occurs. Our experimental results on OTB2013 and OTB2015 show that the tracker using the proposal to implement the attention modules can achieve state-of-the-art performance with a speed of 49 frames per second.