Yanchao Wang , Run Li , Dawei Zhang , Minglu Li , Jinli Cao , Zhonglong Zheng
{"title":"CATrack:利用时间增强型外观特征进行条件感知多目标跟踪","authors":"Yanchao Wang , Run Li , Dawei Zhang , Minglu Li , Jinli Cao , Zhonglong Zheng","doi":"10.1016/j.knosys.2024.112760","DOIUrl":null,"url":null,"abstract":"<div><div>Multiple Object Tracking (MOT) is a critical task in computer vision with a wide range of practical applications. However, current methods often use a uniform approach for associating all targets, overlooking the varying conditions of each target. This can lead to performance degradation, especially in crowded scenes with dense targets. To address this issue, we propose a novel Condition-Aware Tracking method (CATrack) to differentiate the appearance feature flow for targets under different conditions. Specifically, we propose three designs for data association and feature update. First, we develop an Adaptive Appearance Association Module (AAAM) that selects suitable track templates based on detection conditions, reducing association errors in long-tail cases like occlusions or motion blur. Second, we design an ambiguous track filtering Selective Update strategy (SU) that filters out potential low-quality embeddings. Thus, the noise accumulation in the maintained track feature will also be reduced. Meanwhile, we propose a confidence-based Adaptive Exponential Moving Average (AEMA) method for the feature state transition. By adaptively adjusting the weights of track and detection embeddings, our AEMA better preserves high-quality target features. By integrating the above modules, CATrack enhances the discriminative capability of appearance features and improves the robustness of appearance-based associations. Extensive experiments on the MOT17 and MOT20 benchmarks validate the effectiveness of the proposed CATrack. Notably, the state-of-the-art results on MOT20 demonstrate the superiority of our method in highly crowded scenarios.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"308 ","pages":"Article 112760"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CATrack: Condition-aware multi-object tracking with temporally enhanced appearance features\",\"authors\":\"Yanchao Wang , Run Li , Dawei Zhang , Minglu Li , Jinli Cao , Zhonglong Zheng\",\"doi\":\"10.1016/j.knosys.2024.112760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multiple Object Tracking (MOT) is a critical task in computer vision with a wide range of practical applications. However, current methods often use a uniform approach for associating all targets, overlooking the varying conditions of each target. This can lead to performance degradation, especially in crowded scenes with dense targets. To address this issue, we propose a novel Condition-Aware Tracking method (CATrack) to differentiate the appearance feature flow for targets under different conditions. Specifically, we propose three designs for data association and feature update. First, we develop an Adaptive Appearance Association Module (AAAM) that selects suitable track templates based on detection conditions, reducing association errors in long-tail cases like occlusions or motion blur. Second, we design an ambiguous track filtering Selective Update strategy (SU) that filters out potential low-quality embeddings. Thus, the noise accumulation in the maintained track feature will also be reduced. Meanwhile, we propose a confidence-based Adaptive Exponential Moving Average (AEMA) method for the feature state transition. By adaptively adjusting the weights of track and detection embeddings, our AEMA better preserves high-quality target features. By integrating the above modules, CATrack enhances the discriminative capability of appearance features and improves the robustness of appearance-based associations. Extensive experiments on the MOT17 and MOT20 benchmarks validate the effectiveness of the proposed CATrack. Notably, the state-of-the-art results on MOT20 demonstrate the superiority of our method in highly crowded scenarios.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"308 \",\"pages\":\"Article 112760\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705124013947\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124013947","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
CATrack: Condition-aware multi-object tracking with temporally enhanced appearance features
Multiple Object Tracking (MOT) is a critical task in computer vision with a wide range of practical applications. However, current methods often use a uniform approach for associating all targets, overlooking the varying conditions of each target. This can lead to performance degradation, especially in crowded scenes with dense targets. To address this issue, we propose a novel Condition-Aware Tracking method (CATrack) to differentiate the appearance feature flow for targets under different conditions. Specifically, we propose three designs for data association and feature update. First, we develop an Adaptive Appearance Association Module (AAAM) that selects suitable track templates based on detection conditions, reducing association errors in long-tail cases like occlusions or motion blur. Second, we design an ambiguous track filtering Selective Update strategy (SU) that filters out potential low-quality embeddings. Thus, the noise accumulation in the maintained track feature will also be reduced. Meanwhile, we propose a confidence-based Adaptive Exponential Moving Average (AEMA) method for the feature state transition. By adaptively adjusting the weights of track and detection embeddings, our AEMA better preserves high-quality target features. By integrating the above modules, CATrack enhances the discriminative capability of appearance features and improves the robustness of appearance-based associations. Extensive experiments on the MOT17 and MOT20 benchmarks validate the effectiveness of the proposed CATrack. Notably, the state-of-the-art results on MOT20 demonstrate the superiority of our method in highly crowded scenarios.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.