Qi Xu , Zhuoming Xu , Huabin Wang , Yun Chen , Liang Tao
{"title":"Online learning discriminative sparse convolution networks for robust UAV object tracking","authors":"Qi Xu , Zhuoming Xu , Huabin Wang , Yun Chen , Liang Tao","doi":"10.1016/j.knosys.2024.112742","DOIUrl":null,"url":null,"abstract":"<div><div>Despite the remarkable empirical success for UAV object tracking, current convolutional networks usually have three unavoidable limitations: (1) The feature maps produced by convolutional layers are difficult to interpret. (2) The network needs to be trained offline on a large-scale auxiliary training set, resulting in the feature extraction ability of the trained network depending on the categories of the training set. (3) The performance of networks suffers from sensitivity to hyper-parameters (such as learning rate and weight decay) when the network needs online fine-tuning. To overcome the three limitations, this paper proposes a Discriminative Sparse Convolutional Network (DSCN) that exhibits good layer-wise interpretability and can be trained online without requiring any auxiliary training data. By imposing sparsity constraints on the convolutional kernels, DSCN furnishes the convolution layer with an explicit data meaning, thus enhancing the interpretability of the feature maps. These convolutional kernels are directly learned online from image blocks, which eliminates the offline training process on auxiliary data sets. Moreover, a simple yet effective online tuning method with few hyper-parameters is proposed to fine-tune fully connected layers online. We have successfully applied DSCN to UAV object tracking and conducted extensive experiments on six mainstream UAV datasets. The experimental results demonstrate that our method performs favorably against several state-of-the-art tracking algorithms in terms of tracking accuracy and robustness.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"308 ","pages":"Article 112742"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124013765","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Despite the remarkable empirical success for UAV object tracking, current convolutional networks usually have three unavoidable limitations: (1) The feature maps produced by convolutional layers are difficult to interpret. (2) The network needs to be trained offline on a large-scale auxiliary training set, resulting in the feature extraction ability of the trained network depending on the categories of the training set. (3) The performance of networks suffers from sensitivity to hyper-parameters (such as learning rate and weight decay) when the network needs online fine-tuning. To overcome the three limitations, this paper proposes a Discriminative Sparse Convolutional Network (DSCN) that exhibits good layer-wise interpretability and can be trained online without requiring any auxiliary training data. By imposing sparsity constraints on the convolutional kernels, DSCN furnishes the convolution layer with an explicit data meaning, thus enhancing the interpretability of the feature maps. These convolutional kernels are directly learned online from image blocks, which eliminates the offline training process on auxiliary data sets. Moreover, a simple yet effective online tuning method with few hyper-parameters is proposed to fine-tune fully connected layers online. We have successfully applied DSCN to UAV object tracking and conducted extensive experiments on six mainstream UAV datasets. The experimental results demonstrate that our method performs favorably against several state-of-the-art tracking algorithms in terms of tracking accuracy and robustness.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.