Reciprocal interlayer-temporal discriminative target model for robust visual tracking

IF 2.7 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-15 DOI:10.1007/s13042-024-02296-z

Huanlong Zhang, Zonghao Ma, Yanchun Zhao, Yong Wang, Bin Jiang

{"title":"Reciprocal interlayer-temporal discriminative target model for robust visual tracking","authors":"Huanlong Zhang, Zonghao Ma, Yanchun Zhao, Yong Wang, Bin Jiang","doi":"10.1007/s13042-024-02296-z","DOIUrl":null,"url":null,"abstract":"<p>Most Siamese algorithms take little account of the information interaction between the target and search region, leading to tracking results that are often disturbed by the cumulative effect of target-like distractors between layers. In this paper, we propose a reciprocal interlayer-temporal discriminative target model for robust visual tracking. Firstly, an interlayer target-aware enhancement model is constructed, which establishes pixel-by-pixel correlation between the template and search region to achieve interlayer feature information interaction. This alleviates the cumulative error caused by the blindness of the target to search region during feature extraction, enhancing target perception. Secondly, to weaken the impact of interference, a temporal interference evaluation strategy is designed. It utilizes the interframe candidate propagation module to build associations among multi-candidates in the current frame and the previous frame. Then, the similar distractors are eliminated by using object inference constraint, so as to obtain a more accurate target location. Finally, we integrate the interlayer target-aware enhancement model and temporal interference evaluation strategy into the Siamese framework to achieve reciprocity for robust target tracking. Experimental results show that our tracking approach performs well, especially on seven benchmark datasets, including OTB-100, TC-128, DTB, UAV-123, VOT-2016, VOT-2018 and GOT-10k.</p>","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"47 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Machine Learning and Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s13042-024-02296-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Most Siamese algorithms take little account of the information interaction between the target and search region, leading to tracking results that are often disturbed by the cumulative effect of target-like distractors between layers. In this paper, we propose a reciprocal interlayer-temporal discriminative target model for robust visual tracking. Firstly, an interlayer target-aware enhancement model is constructed, which establishes pixel-by-pixel correlation between the template and search region to achieve interlayer feature information interaction. This alleviates the cumulative error caused by the blindness of the target to search region during feature extraction, enhancing target perception. Secondly, to weaken the impact of interference, a temporal interference evaluation strategy is designed. It utilizes the interframe candidate propagation module to build associations among multi-candidates in the current frame and the previous frame. Then, the similar distractors are eliminated by using object inference constraint, so as to obtain a more accurate target location. Finally, we integrate the interlayer target-aware enhancement model and temporal interference evaluation strategy into the Siamese framework to achieve reciprocity for robust target tracking. Experimental results show that our tracking approach performs well, especially on seven benchmark datasets, including OTB-100, TC-128, DTB, UAV-123, VOT-2016, VOT-2018 and GOT-10k.

Abstract Image

查看原文本刊更多论文

用于稳健视觉跟踪的互惠层间时空判别目标模型

大多数连体算法很少考虑目标和搜索区域之间的信息交互，导致跟踪结果经常受到层间目标类干扰物累积效应的干扰。在本文中，我们提出了一种用于稳健视觉跟踪的互惠层间时态判别目标模型。首先，构建层间目标感知增强模型，在模板和搜索区域之间建立逐像素关联，实现层间特征信息交互。这就减轻了特征提取过程中目标对搜索区域的盲区所造成的累积误差，增强了目标感知能力。其次，为了削弱干扰的影响，设计了一种时间干扰评估策略。它利用帧间候选物传播模块，在当前帧和上一帧的多个候选物之间建立关联。然后，利用对象推理约束剔除相似的干扰项，从而获得更准确的目标位置。最后，我们将层间目标感知增强模型和时空干扰评估策略整合到连体框架中，实现互惠的鲁棒目标跟踪。实验结果表明，我们的跟踪方法性能良好，尤其是在七个基准数据集上，包括 OTB-100、TC-128、DTB、UAV-123、VOT-2016、VOT-2018 和 GOT-10k。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Machine Learning and Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

7.90

自引率

10.70%

发文量

225

期刊介绍： Cybernetics is concerned with describing complex interactions and interrelationships between systems which are omnipresent in our daily life. Machine Learning discovers fundamental functional relationships between variables and ensembles of variables in systems. The merging of the disciplines of Machine Learning and Cybernetics is aimed at the discovery of various forms of interaction between systems through diverse mechanisms of learning from data. The International Journal of Machine Learning and Cybernetics (IJMLC) focuses on the key research problems emerging at the junction of machine learning and cybernetics and serves as a broad forum for rapid dissemination of the latest advancements in the area. The emphasis of IJMLC is on the hybrid development of machine learning and cybernetics schemes inspired by different contributing disciplines such as engineering, mathematics, cognitive sciences, and applications. New ideas, design alternatives, implementations and case studies pertaining to all the aspects of machine learning and cybernetics fall within the scope of the IJMLC. Key research areas to be covered by the journal include: Machine Learning for modeling interactions between systems Pattern Recognition technology to support discovery of system-environment interaction Control of system-environment interactions Biochemical interaction in biological and biologically-inspired systems Learning for improvement of communication schemes between systems