基于最优运输和协调注意机制的多目标跟踪

IF 3.4 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Wenjuan Shi, Xiangwei Zheng, Lifeng Zhang, Cun Ji, Yuang Zhang, Ji Bian
{"title":"基于最优运输和协调注意机制的多目标跟踪","authors":"Wenjuan Shi,&nbsp;Xiangwei Zheng,&nbsp;Lifeng Zhang,&nbsp;Cun Ji,&nbsp;Yuang Zhang,&nbsp;Ji Bian","doi":"10.1016/j.sigpro.2025.110058","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-Object Tracking (MOT) has currently attracted significant interest due to its wide applications in various fields, such as autonomous driving, intelligent surveillance, and behavior recognition. However, appearance similarity of different objects results in low accuracy of target matching and difficulties in data association. In this paper, we propose a Multi-Object Tracking based on Optimal Transport and Coordinate Attention Mechanism (MOT2A), which addresses above challenges by integrating the attention mechanism with optimal transport. These strategies effectively enhance the extraction of discriminative appearance features and improve target matching between different frames. Firstly, we construct a novel Coordinate attention module (CASA), which models the interdependence between the channel domain and the spatial domain of the feature map. Secondly, a Triplet loss with optimal transport (SK-Triplet) is designed to adjust the distance matrix for effective clustering of positive and negative samples during loss calculation. Finally, extensive experiments are conducted on MOT17 and MOT20. For MOT17: 79.4 MOTA, 78.9 IDF1, and 63.9 HOTA; For MOT20: 77.0 MOTA, 76.3 IDF1, and 62.3 HOTA are achieved, respectively. Compared to existing MOT methods, our method shows significant improvements in accuracy and stability. The code is available at: <span><span>https://github.com/420-s/MOT2A</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"236 ","pages":"Article 110058"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Object Tracking based on Optimal Transport and Coordinate Attention Mechanism\",\"authors\":\"Wenjuan Shi,&nbsp;Xiangwei Zheng,&nbsp;Lifeng Zhang,&nbsp;Cun Ji,&nbsp;Yuang Zhang,&nbsp;Ji Bian\",\"doi\":\"10.1016/j.sigpro.2025.110058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-Object Tracking (MOT) has currently attracted significant interest due to its wide applications in various fields, such as autonomous driving, intelligent surveillance, and behavior recognition. However, appearance similarity of different objects results in low accuracy of target matching and difficulties in data association. In this paper, we propose a Multi-Object Tracking based on Optimal Transport and Coordinate Attention Mechanism (MOT2A), which addresses above challenges by integrating the attention mechanism with optimal transport. These strategies effectively enhance the extraction of discriminative appearance features and improve target matching between different frames. Firstly, we construct a novel Coordinate attention module (CASA), which models the interdependence between the channel domain and the spatial domain of the feature map. Secondly, a Triplet loss with optimal transport (SK-Triplet) is designed to adjust the distance matrix for effective clustering of positive and negative samples during loss calculation. Finally, extensive experiments are conducted on MOT17 and MOT20. For MOT17: 79.4 MOTA, 78.9 IDF1, and 63.9 HOTA; For MOT20: 77.0 MOTA, 76.3 IDF1, and 62.3 HOTA are achieved, respectively. Compared to existing MOT methods, our method shows significant improvements in accuracy and stability. The code is available at: <span><span>https://github.com/420-s/MOT2A</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49523,\"journal\":{\"name\":\"Signal Processing\",\"volume\":\"236 \",\"pages\":\"Article 110058\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165168425001720\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425001720","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

多目标跟踪(MOT)由于其在自动驾驶、智能监控和行为识别等各个领域的广泛应用,目前引起了人们极大的兴趣。然而,不同目标的外观相似性导致目标匹配精度低,数据关联困难。在本文中,我们提出了一种基于最优运输和协调注意机制(MOT2A)的多目标跟踪,通过将注意机制与最优运输相结合来解决上述挑战。这些策略有效地增强了识别性外观特征的提取,提高了不同帧之间的目标匹配。首先,我们构建了一种新的坐标关注模块(CASA),该模块对特征映射的通道域和空间域之间的相互依存关系进行了建模。其次,设计了具有最优输运的三重态损耗(SK-Triplet),在损耗计算过程中调整距离矩阵,实现正、负样本的有效聚类。最后,对MOT17和MOT20进行了大量的实验。mo17: 79.4 MOTA, 78.9 IDF1, 63.9 HOTA;MOTA为77.0,IDF1为76.3,HOTA为62.3。与现有的MOT方法相比,我们的方法在精度和稳定性上有了显著的提高。代码可从https://github.com/420-s/MOT2A获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Object Tracking based on Optimal Transport and Coordinate Attention Mechanism
Multi-Object Tracking (MOT) has currently attracted significant interest due to its wide applications in various fields, such as autonomous driving, intelligent surveillance, and behavior recognition. However, appearance similarity of different objects results in low accuracy of target matching and difficulties in data association. In this paper, we propose a Multi-Object Tracking based on Optimal Transport and Coordinate Attention Mechanism (MOT2A), which addresses above challenges by integrating the attention mechanism with optimal transport. These strategies effectively enhance the extraction of discriminative appearance features and improve target matching between different frames. Firstly, we construct a novel Coordinate attention module (CASA), which models the interdependence between the channel domain and the spatial domain of the feature map. Secondly, a Triplet loss with optimal transport (SK-Triplet) is designed to adjust the distance matrix for effective clustering of positive and negative samples during loss calculation. Finally, extensive experiments are conducted on MOT17 and MOT20. For MOT17: 79.4 MOTA, 78.9 IDF1, and 63.9 HOTA; For MOT20: 77.0 MOTA, 76.3 IDF1, and 62.3 HOTA are achieved, respectively. Compared to existing MOT methods, our method shows significant improvements in accuracy and stability. The code is available at: https://github.com/420-s/MOT2A.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Signal Processing
Signal Processing 工程技术-工程:电子与电气
CiteScore
9.20
自引率
9.10%
发文量
309
审稿时长
41 days
期刊介绍: Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信