基于三维卷积Siamese网络的目标跟踪模板更新

2021 2nd Asia Conference on Computers and Communications (ACCC) Pub Date : 2021-09-01 DOI:10.1109/ACCC54619.2021.00016

Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei

{"title":"基于三维卷积Siamese网络的目标跟踪模板更新","authors":"Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei","doi":"10.1109/ACCC54619.2021.00016","DOIUrl":null,"url":null,"abstract":"Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.","PeriodicalId":215546,"journal":{"name":"2021 2nd Asia Conference on Computers and Communications (ACCC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Template Update Based on 3D-Convolutional Siamese Network for Object Tracking\",\"authors\":\"Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei\",\"doi\":\"10.1109/ACCC54619.2021.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.\",\"PeriodicalId\":215546,\"journal\":{\"name\":\"2021 2nd Asia Conference on Computers and Communications (ACCC)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd Asia Conference on Computers and Communications (ACCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACCC54619.2021.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Asia Conference on Computers and Communications (ACCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACCC54619.2021.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

目标跟踪是计算机视觉领域的一个重要研究方向。在过去的两年中，基于Siamese网络的目标跟踪算法在CVPR中取得了辉煌的成果。然而，以往的算法只提取第一帧的目标特征作为跟踪模板。在跟踪目标的过程中，目标模板保持不变，导致跟踪精度较差。鉴于此，本文提出了一种新的、端到端训练的、基于全卷积3D Siamese网络的跟踪算法来提取多个特征。利用Logistic损失函数和SGD对网络进行训练。训练后的网络实现了在跟踪视频对象过程中利用多帧特征更新对象模板。本文的跟踪器可以在OTB、VOT和GOT-10k的实时帧速率下运行。将该算法应用于SiamFC，在OTB和VOT-2016数据集上，其准确率分别提高了4%和3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Template Update Based on 3D-Convolutional Siamese Network for Object Tracking

Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 2nd Asia Conference on Computers and Communications (ACCC)

自引率

0.00%

发文量