{"title":"在视觉跟踪之前看到清晰","authors":"Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan","doi":"10.1109/ICCC56324.2022.10066016","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"276 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Seeing Clear before Visual Tracking\",\"authors\":\"Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan\",\"doi\":\"10.1109/ICCC56324.2022.10066016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.\",\"PeriodicalId\":263098,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"volume\":\"276 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC56324.2022.10066016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10066016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.