A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection

IF 4.6 2区 计算机科学 Q2 ROBOTICS
Xiyang Wang;Chunyun Fu;Jiawei He;Mingguang Huang;Ting Meng;Siyu Zhang;Hangning Zhou;Ziyao Xu;Chi Zhang
{"title":"A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection","authors":"Xiyang Wang;Chunyun Fu;Jiawei He;Mingguang Huang;Ting Meng;Siyu Zhang;Hangning Zhou;Ziyao Xu;Chi Zhang","doi":"10.1109/LRA.2024.3511438","DOIUrl":null,"url":null,"abstract":"In the classical tracking-by-detection (TBD) paradigm, detection and tracking are separately and sequentially conducted, and data association must be properly performed to achieve satisfactory tracking performance. In this letter, a new multi-object tracking framework is proposed, which integrates object detection and multi-object tracking into a single model. The proposed tracking framework eliminates the complex data association process in the classical TBD paradigm, and requires no additional training. Secondly, the regression confidence of historical trajectories is investigated, and the possible states of a trajectory (weak object or strong object) in the current frame are predicted. Then, a confidence fusion module is designed to guide non-maximum suppression for trajectories and detections to achieve ordered and robust tracking. Thirdly, by integrating historical trajectory features, the regression performance of the detector is enhanced, which better reflects the occlusion and disappearance patterns of objects in real world. Lastly, extensive experiments are conducted on the commonly used KITTI and Waymo datasets. The results show that the proposed framework can achieve robust tracking by using only a 2D detector and a 3D detector, and it is proven more accurate than many of the state-of-the-art TBD-based multi-modal tracking methods.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"10 1","pages":"532-539"},"PeriodicalIF":4.6000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Robotics and Automation Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10777493/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

Abstract

In the classical tracking-by-detection (TBD) paradigm, detection and tracking are separately and sequentially conducted, and data association must be properly performed to achieve satisfactory tracking performance. In this letter, a new multi-object tracking framework is proposed, which integrates object detection and multi-object tracking into a single model. The proposed tracking framework eliminates the complex data association process in the classical TBD paradigm, and requires no additional training. Secondly, the regression confidence of historical trajectories is investigated, and the possible states of a trajectory (weak object or strong object) in the current frame are predicted. Then, a confidence fusion module is designed to guide non-maximum suppression for trajectories and detections to achieve ordered and robust tracking. Thirdly, by integrating historical trajectory features, the regression performance of the detector is enhanced, which better reflects the occlusion and disappearance patterns of objects in real world. Lastly, extensive experiments are conducted on the commonly used KITTI and Waymo datasets. The results show that the proposed framework can achieve robust tracking by using only a 2D detector and a 3D detector, and it is proven more accurate than many of the state-of-the-art TBD-based multi-modal tracking methods.
在经典的 "通过检测跟踪"(TBD)范例中,检测和跟踪是分开并按顺序进行的,必须正确进行数据关联才能获得令人满意的跟踪性能。本文提出了一种新的多目标跟踪框架,它将目标检测和多目标跟踪整合到一个模型中。所提出的跟踪框架省去了经典 TBD 范式中复杂的数据关联过程,而且不需要额外的训练。其次,研究历史轨迹的回归置信度,预测当前帧中轨迹的可能状态(弱目标或强目标)。然后,设计一个置信度融合模块,引导对轨迹和检测进行非最大化抑制,从而实现有序和稳健的跟踪。第三,通过整合历史轨迹特征,提高检测器的回归性能,从而更好地反映真实世界中物体的遮挡和消失模式。最后,我们在常用的 KITTI 和 Waymo 数据集上进行了大量实验。实验结果表明,所提出的框架只需使用一个二维检测器和一个三维检测器就能实现鲁棒跟踪,而且比许多最先进的基于 TBD 的多模态跟踪方法更加精确。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Robotics and Automation Letters
IEEE Robotics and Automation Letters Computer Science-Computer Science Applications
CiteScore
9.60
自引率
15.40%
发文量
1428
期刊介绍: The scope of this journal is to publish peer-reviewed articles that provide a timely and concise account of innovative research ideas and application results, reporting significant theoretical findings and application case studies in areas of robotics and automation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信