{"title":"用于弱监督多目标跟踪的小波域特征解耦","authors":"Yu-Lei Li, Yan Yan, Yang Lu, Hanzi Wang","doi":"10.1007/s11432-022-4097-y","DOIUrl":null,"url":null,"abstract":"<p>We present a wavelet-domain feature-decoupling Transformer-based tracking network for the weakly supervised MOT task (FDMOT). Our FDMOT has two improvements over the previous weakly supervised methods. First, FDMOT decouples noisy intermediate features caused by noisy pseudo identity labels in the wavelet domain, extracting discriminative features for accurately detecting and identifying multiple targets. Second, FDMOT further improves the noise-decoupled embedding features into the well-refined ones with the cooperation of the three feature-decoupling Transformer-based branches, which can accurately identify and track heavily occluded targets in crowded scenes. Experimental results show the superiority of FDMOT compared with several state-of-the-art supervised and weakly supervised MOT methods.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"35 1","pages":""},"PeriodicalIF":7.3000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Wavelet-domain feature decoupling for weakly supervised multi-object tracking\",\"authors\":\"Yu-Lei Li, Yan Yan, Yang Lu, Hanzi Wang\",\"doi\":\"10.1007/s11432-022-4097-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We present a wavelet-domain feature-decoupling Transformer-based tracking network for the weakly supervised MOT task (FDMOT). Our FDMOT has two improvements over the previous weakly supervised methods. First, FDMOT decouples noisy intermediate features caused by noisy pseudo identity labels in the wavelet domain, extracting discriminative features for accurately detecting and identifying multiple targets. Second, FDMOT further improves the noise-decoupled embedding features into the well-refined ones with the cooperation of the three feature-decoupling Transformer-based branches, which can accurately identify and track heavily occluded targets in crowded scenes. Experimental results show the superiority of FDMOT compared with several state-of-the-art supervised and weakly supervised MOT methods.</p>\",\"PeriodicalId\":21618,\"journal\":{\"name\":\"Science China Information Sciences\",\"volume\":\"35 1\",\"pages\":\"\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2024-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science China Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s11432-022-4097-y\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science China Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11432-022-4097-y","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
我们提出了一种基于小波域特征解耦变换器的跟踪网络,用于弱监督 MOT 任务(FDMOT)。与之前的弱监督方法相比,我们的 FDMOT 有两点改进。首先,FDMOT 在小波域中解耦了由噪声伪身份标签引起的噪声中间特征,提取了用于准确检测和识别多个目标的判别特征。其次,在基于变换器的三个特征解耦分支的配合下,FDMOT 进一步将噪声解耦嵌入特征改进为精炼特征,可在拥挤场景中准确识别和跟踪重度遮挡目标。实验结果表明,与几种最先进的监督式和弱监督式 MOT 方法相比,FDMOT 更具优势。
Wavelet-domain feature decoupling for weakly supervised multi-object tracking
We present a wavelet-domain feature-decoupling Transformer-based tracking network for the weakly supervised MOT task (FDMOT). Our FDMOT has two improvements over the previous weakly supervised methods. First, FDMOT decouples noisy intermediate features caused by noisy pseudo identity labels in the wavelet domain, extracting discriminative features for accurately detecting and identifying multiple targets. Second, FDMOT further improves the noise-decoupled embedding features into the well-refined ones with the cooperation of the three feature-decoupling Transformer-based branches, which can accurately identify and track heavily occluded targets in crowded scenes. Experimental results show the superiority of FDMOT compared with several state-of-the-art supervised and weakly supervised MOT methods.
期刊介绍:
Science China Information Sciences is a dedicated journal that showcases high-quality, original research across various domains of information sciences. It encompasses Computer Science & Technologies, Control Science & Engineering, Information & Communication Engineering, Microelectronics & Solid-State Electronics, and Quantum Information, providing a platform for the dissemination of significant contributions in these fields.