在视觉跟踪之前看到清晰

2022 IEEE 8th International Conference on Computer and Communications (ICCC) Pub Date : 2022-12-09 DOI:10.1109/ICCC56324.2022.10066016

Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan

{"title":"在视觉跟踪之前看到清晰","authors":"Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan","doi":"10.1109/ICCC56324.2022.10066016","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.","PeriodicalId":263098,"journal":{"name":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","volume":"276 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Seeing Clear before Visual Tracking\",\"authors\":\"Ximing Zhang, Yuanbo Wang, Hui Zhao, Xuewu Fan\",\"doi\":\"10.1109/ICCC56324.2022.10066016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.\",\"PeriodicalId\":263098,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"volume\":\"276 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Computer and Communications (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCC56324.2022.10066016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC56324.2022.10066016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于图像去模糊和视觉跟踪两个分支的两阶段视觉跟踪方法。我们的主要动机是在跟踪器遭受快速运动模糊时实现鲁棒的视觉跟踪。首先，我们提出了基于空间金字塔匹配的分层模型，该模型实现了从精细到粗的去模糊，并利用了从局部到粗的操作。在对图像进行去模糊处理后，采用具有空间和通道关注的变换框架进行特征提取，以同时获取空间和通道特征，从而获得精度和鲁棒性兼顾的快速视觉跟踪。我们首先在Gopro的数据集上训练一步去模糊网络。然后，训练第二阶段的视觉跟踪分支。最后，我们进行了广泛的消融研究，以证明所提出的跟踪器的有效性，该跟踪器在大型跟踪基准上获得了目前优异的结果，我们还验证了我们的方法对快速运动模糊的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Seeing Clear before Visual Tracking

In this paper, we propose a two-stages visual tracking method mainly based on two branches including image deblurring and visual tracking. Our main motivation is to achieve the robust visual tracking when the tracker is suffering fast motion blur. Firstly, we present the hierarchical model based on Spatial Pyramid Matching that performs the fine-to-coarse deblurring and exploits localized-to-coarse operations. After achieving the deblurred images, the proposed method use transformer framework with spatial and channel attention for extracting features in order to obtain the spatial and channel features simultaneously to obtain the fast visual tracking with the balance of accuracy and robustness. We first train the one-stage deblurring network in the dataset of Gopro. Then, we train the second stage visusal tracking branch. Lastly, we conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the outperforming results on large tracking benchmarks, we also validate the effectiveness of our method against the fast motion blurring.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 8th International Conference on Computer and Communications (ICCC)

自引率

0.00%

发文量