{"title":"Visual object tracking: Review and challenges","authors":"Zeshi Chen , Caiping Peng , Shuai Liu , Weiping Ding","doi":"10.1016/j.asoc.2025.113140","DOIUrl":null,"url":null,"abstract":"<div><div>Visual object tracking is a challenging research topic in computer vision. Numerous visual tracking algorithms have been proposed to solve this problem and achieved promising results. Traditional visual tracking algorithms can be categorized into generative and discriminative algorithms. Recently, deep learning based visual tracking algorithms attracted great attention from researchers due to their excellent performance. In order to summarize the development of visual object tracking, some studys have analyzed non-deep learning and deep learning visual tracking algorithms. In this paper, the most advanced tracking algorithms are comprehensively summarized, including both non-deep learning and deep learning based algorithms. First, traditional non-deep learning based tracking algorithms are categorized into generative and discriminative methods. The generative algorithms are summarized from three perspectives: kernel series, subspace series and sparse representation series, and the discriminative algorithms are summarized from two perspectives: correlation filtering series and deep features series. Then, deep learning based algorithms are divided into Siamese network series and Transformer series. Siamese network based algorithms are summarized from different innovation directions, and Transformer based algorithms are summarized from two perspectives: CNN-Transformer and Fully-Transformer. Moreover, the commonly used datasets and evaluation indicators are introduced in visual object tracking, as well as the results and analysis of representative algorithms. Finally, the challenges faced in visual object tracking were summarized and its future development trends were pointed out.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"177 ","pages":"Article 113140"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462500451X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Visual object tracking is a challenging research topic in computer vision. Numerous visual tracking algorithms have been proposed to solve this problem and achieved promising results. Traditional visual tracking algorithms can be categorized into generative and discriminative algorithms. Recently, deep learning based visual tracking algorithms attracted great attention from researchers due to their excellent performance. In order to summarize the development of visual object tracking, some studys have analyzed non-deep learning and deep learning visual tracking algorithms. In this paper, the most advanced tracking algorithms are comprehensively summarized, including both non-deep learning and deep learning based algorithms. First, traditional non-deep learning based tracking algorithms are categorized into generative and discriminative methods. The generative algorithms are summarized from three perspectives: kernel series, subspace series and sparse representation series, and the discriminative algorithms are summarized from two perspectives: correlation filtering series and deep features series. Then, deep learning based algorithms are divided into Siamese network series and Transformer series. Siamese network based algorithms are summarized from different innovation directions, and Transformer based algorithms are summarized from two perspectives: CNN-Transformer and Fully-Transformer. Moreover, the commonly used datasets and evaluation indicators are introduced in visual object tracking, as well as the results and analysis of representative algorithms. Finally, the challenges faced in visual object tracking were summarized and its future development trends were pointed out.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.