{"title":"PUTrack:通过渐进提示改进水下目标跟踪","authors":"Qiuyang Zhang;Wei Song;Cong Liu;Minghua Zhang","doi":"10.1109/TII.2025.3538056","DOIUrl":null,"url":null,"abstract":"Existing underwater tracking methods can be categorized into two paradigms: first, “enhance-then-track”—first enhancing the quality of the input image, then employing an open-air tracker; second, “track-then-process”—initially using an open-air tracker, followed by calibrating the prediction box. These methods that lack unified objectives among the modules impair tracking performance. To overcome this, we propose a novel end-to-end framework called prompting underwater tracking (PUTrack). It adapts the open-air tracker to the scenario-specific (underwater) tracking task by deploying a set of underwater prompters at the lateral side of an existing open-air tracker, and injecting the generated prompts layer-by-layer into the encoder. Experiments on various underwater tracking datasets demonstrate that the method significantly improves the underwater performance of the tracker by introducing only 0.6 M trainable parameters (0.4% of total parameters). Moreover, to drive the development of underwater tracking, we construct a high quality underwater tracking dataset that is manually annotated with 139 k frames, which exceeds the total number of frames in previous underwater tracking datasets. It provides 90 test sets rich in challenge properties and 200 training sets of diverse kinds.","PeriodicalId":13301,"journal":{"name":"IEEE Transactions on Industrial Informatics","volume":"21 5","pages":"3986-3995"},"PeriodicalIF":9.9000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PUTrack: Improved Underwater Object Tracking via Progressive Prompting\",\"authors\":\"Qiuyang Zhang;Wei Song;Cong Liu;Minghua Zhang\",\"doi\":\"10.1109/TII.2025.3538056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Existing underwater tracking methods can be categorized into two paradigms: first, “enhance-then-track”—first enhancing the quality of the input image, then employing an open-air tracker; second, “track-then-process”—initially using an open-air tracker, followed by calibrating the prediction box. These methods that lack unified objectives among the modules impair tracking performance. To overcome this, we propose a novel end-to-end framework called prompting underwater tracking (PUTrack). It adapts the open-air tracker to the scenario-specific (underwater) tracking task by deploying a set of underwater prompters at the lateral side of an existing open-air tracker, and injecting the generated prompts layer-by-layer into the encoder. Experiments on various underwater tracking datasets demonstrate that the method significantly improves the underwater performance of the tracker by introducing only 0.6 M trainable parameters (0.4% of total parameters). Moreover, to drive the development of underwater tracking, we construct a high quality underwater tracking dataset that is manually annotated with 139 k frames, which exceeds the total number of frames in previous underwater tracking datasets. It provides 90 test sets rich in challenge properties and 200 training sets of diverse kinds.\",\"PeriodicalId\":13301,\"journal\":{\"name\":\"IEEE Transactions on Industrial Informatics\",\"volume\":\"21 5\",\"pages\":\"3986-3995\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-02-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Industrial Informatics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10892342/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Informatics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10892342/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
PUTrack: Improved Underwater Object Tracking via Progressive Prompting
Existing underwater tracking methods can be categorized into two paradigms: first, “enhance-then-track”—first enhancing the quality of the input image, then employing an open-air tracker; second, “track-then-process”—initially using an open-air tracker, followed by calibrating the prediction box. These methods that lack unified objectives among the modules impair tracking performance. To overcome this, we propose a novel end-to-end framework called prompting underwater tracking (PUTrack). It adapts the open-air tracker to the scenario-specific (underwater) tracking task by deploying a set of underwater prompters at the lateral side of an existing open-air tracker, and injecting the generated prompts layer-by-layer into the encoder. Experiments on various underwater tracking datasets demonstrate that the method significantly improves the underwater performance of the tracker by introducing only 0.6 M trainable parameters (0.4% of total parameters). Moreover, to drive the development of underwater tracking, we construct a high quality underwater tracking dataset that is manually annotated with 139 k frames, which exceeds the total number of frames in previous underwater tracking datasets. It provides 90 test sets rich in challenge properties and 200 training sets of diverse kinds.
期刊介绍:
The IEEE Transactions on Industrial Informatics is a multidisciplinary journal dedicated to publishing technical papers that connect theory with practical applications of informatics in industrial settings. It focuses on the utilization of information in intelligent, distributed, and agile industrial automation and control systems. The scope includes topics such as knowledge-based and AI-enhanced automation, intelligent computer control systems, flexible and collaborative manufacturing, industrial informatics in software-defined vehicles and robotics, computer vision, industrial cyber-physical and industrial IoT systems, real-time and networked embedded systems, security in industrial processes, industrial communications, systems interoperability, and human-machine interaction.