{"title":"Seeing Clear before Visual Tracking","authors":"Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan","doi":"10.1109/ICCC56324.2022.10066016","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"276 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10066016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.