ShipsMOT: A Comprehensive Benchmark and Framework for Multiobject Tracking of Ships

IF 1.3 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Fang Luo, Pengju Jiang, George To Sum Ho, Wenjing Zeng
{"title":"ShipsMOT: A Comprehensive Benchmark and Framework for Multiobject Tracking of Ships","authors":"Fang Luo,&nbsp;Pengju Jiang,&nbsp;George To Sum Ho,&nbsp;Wenjing Zeng","doi":"10.1049/cvi2.70042","DOIUrl":null,"url":null,"abstract":"<p>Multiobject tracking of ships is crucial for various applications, such as maritime security and the development of ship autopilot systems. However, existing ship visual datasets primarily focus on ship detection tasks, lacking a fully open-source dataset for multiobject tracking research. Furthermore, current methods often struggle with extracting appearance features under complex sea conditions, varying scales and different ship types, affecting tracking precision. To address these issues, we propose ShipsMOT, a new benchmark dataset containing 121 video sequences with an average of 15.45 s per sequence, covering 15 distinct ship types and a total of 237,999 annotated bounding boxes. Additionally, we propose JDR-CSTrack, a ship multiobject tracking framework that improves feature extraction at different scales by optimising a joint detection and Re-ID network. JDR-CSTrack utilises the fusion of appearance and motion features for multilevel data association, thereby minimising track loss and ID switches. Experimental results confirm that ShipsMOT can serve as a benchmark for future research in ship multiobject tracking and validate the superiority of the proposed JDR-CSTrack framework. The dataset and code can be found on https://github.com/jpj0916/ShipsMOT.</p>","PeriodicalId":56304,"journal":{"name":"IET Computer Vision","volume":"19 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cvi2.70042","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Computer Vision","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cvi2.70042","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Multiobject tracking of ships is crucial for various applications, such as maritime security and the development of ship autopilot systems. However, existing ship visual datasets primarily focus on ship detection tasks, lacking a fully open-source dataset for multiobject tracking research. Furthermore, current methods often struggle with extracting appearance features under complex sea conditions, varying scales and different ship types, affecting tracking precision. To address these issues, we propose ShipsMOT, a new benchmark dataset containing 121 video sequences with an average of 15.45 s per sequence, covering 15 distinct ship types and a total of 237,999 annotated bounding boxes. Additionally, we propose JDR-CSTrack, a ship multiobject tracking framework that improves feature extraction at different scales by optimising a joint detection and Re-ID network. JDR-CSTrack utilises the fusion of appearance and motion features for multilevel data association, thereby minimising track loss and ID switches. Experimental results confirm that ShipsMOT can serve as a benchmark for future research in ship multiobject tracking and validate the superiority of the proposed JDR-CSTrack framework. The dataset and code can be found on https://github.com/jpj0916/ShipsMOT.

Abstract Image

ShipsMOT:船舶多目标跟踪的综合基准和框架
船舶的多目标跟踪对于各种应用至关重要,例如海上安全和船舶自动驾驶系统的开发。然而,现有的船舶视觉数据集主要集中在船舶检测任务上,缺乏一个完全开源的多目标跟踪研究数据集。此外,目前的方法往往难以在复杂的海况、不同的尺度和不同的船型下提取外观特征,影响了跟踪精度。为了解决这些问题,我们提出了ShipsMOT,这是一个新的基准数据集,包含121个视频序列,每个序列平均15.45秒,涵盖15种不同的船舶类型和总共237,999个带注释的边界框。此外,我们提出了JDR-CSTrack,这是一种船舶多目标跟踪框架,通过优化联合检测和Re-ID网络来改进不同尺度下的特征提取。JDR-CSTrack利用外观和运动特征的融合进行多级数据关联,从而最大限度地减少轨道损失和ID切换。实验结果表明,ShipsMOT可以作为舰船多目标跟踪研究的基准,并验证了所提出的JDR-CSTrack框架的优越性。数据集和代码可以在https://github.com/jpj0916/ShipsMOT上找到。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IET Computer Vision
IET Computer Vision 工程技术-工程:电子与电气
CiteScore
3.30
自引率
11.80%
发文量
76
审稿时长
3.4 months
期刊介绍: IET Computer Vision seeks original research papers in a wide range of areas of computer vision. The vision of the journal is to publish the highest quality research work that is relevant and topical to the field, but not forgetting those works that aim to introduce new horizons and set the agenda for future avenues of research in computer vision. IET Computer Vision welcomes submissions on the following topics: Biologically and perceptually motivated approaches to low level vision (feature detection, etc.); Perceptual grouping and organisation Representation, analysis and matching of 2D and 3D shape Shape-from-X Object recognition Image understanding Learning with visual inputs Motion analysis and object tracking Multiview scene analysis Cognitive approaches in low, mid and high level vision Control in visual systems Colour, reflectance and light Statistical and probabilistic models Face and gesture Surveillance Biometrics and security Robotics Vehicle guidance Automatic model aquisition Medical image analysis and understanding Aerial scene analysis and remote sensing Deep learning models in computer vision Both methodological and applications orientated papers are welcome. Manuscripts submitted are expected to include a detailed and analytical review of the literature and state-of-the-art exposition of the original proposed research and its methodology, its thorough experimental evaluation, and last but not least, comparative evaluation against relevant and state-of-the-art methods. Submissions not abiding by these minimum requirements may be returned to authors without being sent to review. Special Issues Current Call for Papers: Computer Vision for Smart Cameras and Camera Networks - https://digital-library.theiet.org/files/IET_CVI_SC.pdf Computer Vision for the Creative Industries - https://digital-library.theiet.org/files/IET_CVI_CVCI.pdf
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信