Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei
{"title":"基于三维卷积Siamese网络的目标跟踪模板更新","authors":"Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei","doi":"10.1109/ACCC54619.2021.00016","DOIUrl":null,"url":null,"abstract":"Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.","PeriodicalId":215546,"journal":{"name":"2021 2nd Asia Conference on Computers and Communications (ACCC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Template Update Based on 3D-Convolutional Siamese Network for Object Tracking\",\"authors\":\"Xiaofeng Lu, Xuan Wang, Zhengyan Wang, Xinhong Hei\",\"doi\":\"10.1109/ACCC54619.2021.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.\",\"PeriodicalId\":215546,\"journal\":{\"name\":\"2021 2nd Asia Conference on Computers and Communications (ACCC)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd Asia Conference on Computers and Communications (ACCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACCC54619.2021.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Asia Conference on Computers and Communications (ACCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACCC54619.2021.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Template Update Based on 3D-Convolutional Siamese Network for Object Tracking
Object tracking is an important research area in the field of computer vision. In the past two years, the object tracking algorithms based on Siamese network have yielded brilliant results in CVPR. However, previous algorithms only extract the object feature of the first frame as a tracking template. In the process of tracking the object, the object template remains unchanged, leading to poor tracking accuracy. In view of this, the present paper proposes a new, end-to-end trained, fully convolutional 3D Siamese network-based tracking algorithm to extract multiple features. Logistic loss function and SGD are used to train the network. The trained network realizes the use of multi-frame features to update the object template in the process of tracking the video's object. The tracker in this paper can run at real-time frame-rates in OTB, VOT, and GOT-10k. The algorithm is applied to SiamFC, its accuracy is improved by 4% and 3% on the OTB and VOT-2016 data sets, respectively.