Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei
{"title":"Template Update Based on 3D-Convolutional Siamese Network for Object Tracking","authors":"Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei","doi":"10.1109/ACCC54619.2021.00016","DOIUrl":null,"url":null,"abstract":"Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.","PeriodicalId":215546,"journal":{"name":"2021 2nd Asia Conference on Computers and Communications (ACCC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Asia Conference on Computers and Communications (ACCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACCC54619.2021.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.