DB-Tracker: Multi-Object Tracking for Drone Aerial Video Based on Box-MeMBer and MB-OSNet

IF 4.4 2区 地球科学 Q1 REMOTE SENSING
Drones Pub Date : 2023-09-27 DOI:10.3390/drones7100607
Yubin Yuan, Yiquan Wu, Langyue Zhao, Jinlin Chen, Qichang Zhao
{"title":"DB-Tracker: Multi-Object Tracking for Drone Aerial Video Based on Box-MeMBer and MB-OSNet","authors":"Yubin Yuan, Yiquan Wu, Langyue Zhao, Jinlin Chen, Qichang Zhao","doi":"10.3390/drones7100607","DOIUrl":null,"url":null,"abstract":"Drone aerial videos offer a promising future in modern digital media and remote sensing applications, but effectively tracking several objects in these recordings is difficult. Drone aerial footage typically includes complicated sceneries with moving objects, such as people, vehicles, and animals. Complicated scenarios such as large-scale viewing angle shifts and object crossings may occur simultaneously. Random finite sets are mixed in a detection-based tracking framework, taking the object’s location and appearance into account. It maintains the detection box information of the detected object and constructs the Box-MeMBer object position prediction framework based on the MeMBer random finite set point object tracking. We develop a hierarchical connection structure in the OSNet network, build MB-OSNet to get the object appearance information, and connect feature maps of different levels through the hierarchy such that the network may obtain rich semantic information at different sizes. Similarity measurements are computed and collected for all detections and trajectories in a cost matrix that estimates the likelihood of all possible matches. The cost matrix entries compare the similarity of tracks and detections in terms of position and appearance. The DB-Tracker algorithm performs excellently in multi-target tracking of drone aerial videos, achieving MOTA of 37.4% and 46.2% on the VisDrone and UAVDT data sets, respectively. DB-Tracker achieves high robustness by comprehensively considering the object position and appearance information, especially in handling complex scenes and target occlusion. This makes DB-Tracker a powerful tool in challenging applications such as drone aerial videos.","PeriodicalId":36448,"journal":{"name":"Drones","volume":"38 1","pages":"0"},"PeriodicalIF":4.4000,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drones","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/drones7100607","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0

Abstract

Drone aerial videos offer a promising future in modern digital media and remote sensing applications, but effectively tracking several objects in these recordings is difficult. Drone aerial footage typically includes complicated sceneries with moving objects, such as people, vehicles, and animals. Complicated scenarios such as large-scale viewing angle shifts and object crossings may occur simultaneously. Random finite sets are mixed in a detection-based tracking framework, taking the object’s location and appearance into account. It maintains the detection box information of the detected object and constructs the Box-MeMBer object position prediction framework based on the MeMBer random finite set point object tracking. We develop a hierarchical connection structure in the OSNet network, build MB-OSNet to get the object appearance information, and connect feature maps of different levels through the hierarchy such that the network may obtain rich semantic information at different sizes. Similarity measurements are computed and collected for all detections and trajectories in a cost matrix that estimates the likelihood of all possible matches. The cost matrix entries compare the similarity of tracks and detections in terms of position and appearance. The DB-Tracker algorithm performs excellently in multi-target tracking of drone aerial videos, achieving MOTA of 37.4% and 46.2% on the VisDrone and UAVDT data sets, respectively. DB-Tracker achieves high robustness by comprehensively considering the object position and appearance information, especially in handling complex scenes and target occlusion. This makes DB-Tracker a powerful tool in challenging applications such as drone aerial videos.
基于Box-MeMBer和MB-OSNet的无人机航拍视频多目标跟踪
无人机航拍视频在现代数字媒体和遥感应用中具有广阔的前景,但在这些记录中有效跟踪多个目标是困难的。无人机的航拍镜头通常包括带有移动物体的复杂场景,如人、车辆和动物。复杂的场景,如大范围的视角移动和物体交叉可能同时发生。随机有限集混合在基于检测的跟踪框架中,考虑到目标的位置和外观。维护被检测目标的检测框信息,并基于成员随机有限设定点目标跟踪构造box -MeMBer目标位置预测框架。我们在OSNet网络中开发了层次连接结构,构建MB-OSNet获取对象外观信息,并通过层次连接不同层次的特征图,使网络可以获得不同规模的丰富语义信息。在成本矩阵中计算和收集所有检测和轨迹的相似性测量值,以估计所有可能匹配的可能性。成本矩阵条目比较轨道和检测在位置和外观方面的相似性。DB-Tracker算法在无人机航拍视频的多目标跟踪中表现优异,在VisDrone和UAVDT数据集上的MOTA分别达到37.4%和46.2%。DB-Tracker通过综合考虑目标位置和外观信息,实现了较高的鲁棒性,特别是在处理复杂场景和目标遮挡时。这使得DB-Tracker成为具有挑战性的应用程序(如无人机航拍视频)的强大工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Drones
Drones Engineering-Aerospace Engineering
CiteScore
5.60
自引率
18.80%
发文量
331
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信