UVTD: A Large-Scale Multilabel Data Set for Underwater Vision Tasks

IF 5.3 2区工程技术 Q1 ENGINEERING, CIVIL

IEEE Journal of Oceanic Engineering Pub Date : 2025-01-23 DOI:10.1109/JOE.2024.3503664

Zhengyong Wang;Meng Yu;Lei Cao;Pengyu Liu;Linfeng Wang;Xiang Li;Yixiao Hong;Chang He;Liquan Shen

{"title":"UVTD: A Large-Scale Multilabel Data Set for Underwater Vision Tasks","authors":"Zhengyong Wang;Meng Yu;Lei Cao;Pengyu Liu;Linfeng Wang;Xiang Li;Yixiao Hong;Chang He;Liquan Shen","doi":"10.1109/JOE.2024.3503664","DOIUrl":null,"url":null,"abstract":"With the exploration of ocean resources, underwater vision tasks (UVTs) have attracted increasing interest in recent years. However, the advancement of UVTs is hampered by several challenges, prominently the difficulty in acquiring large-scale data sets with accurate annotations, and the absence of a unified multi-label data set for underwater multitask learning (UMTL). Motivated by the critical need for a large-scale, multilabel data set tailored for UVTs, we present a large-scale multilabel data set for underwater vision tasks (UVTD), which offers solutions to several bottlenecks in UVT research: first, supporting diverse applications. With annotations for two underwater low-level vision tasks (i.e., underwater image enhancement and underwater image quality assessment) and three high-level vision tasks (i.e., underwater semantic segmentation, underwater object detection, and underwater salient object detection), UVTD supports a wide range of underwater applications. Second, enhancing model generalization. UVTD comprises 5380 real underwater images, covering diverse scenes and varied degradation characteristics, improving the robustness of vision algorithms. Finally, facilitating multitask learning. Our multilabel data set enables researchers to explore the correlations between tasks and develop robust UMTL algorithms. Based on UVTD, we propose two UMTL networks tailored to the low-level and high-level tasks separately, serving as benchmarks for future research in the UMTL field. Extensive experiments demonstrate UVTD's superiority across multiple UVTs, and the proposed UMTL networks exhibit competitive performance on these tasks, implying the significant implications of UVTD for future research.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 2","pages":"898-918"},"PeriodicalIF":5.3000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10850640/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

Abstract

With the exploration of ocean resources, underwater vision tasks (UVTs) have attracted increasing interest in recent years. However, the advancement of UVTs is hampered by several challenges, prominently the difficulty in acquiring large-scale data sets with accurate annotations, and the absence of a unified multi-label data set for underwater multitask learning (UMTL). Motivated by the critical need for a large-scale, multilabel data set tailored for UVTs, we present a large-scale multilabel data set for underwater vision tasks (UVTD), which offers solutions to several bottlenecks in UVT research: first, supporting diverse applications. With annotations for two underwater low-level vision tasks (i.e., underwater image enhancement and underwater image quality assessment) and three high-level vision tasks (i.e., underwater semantic segmentation, underwater object detection, and underwater salient object detection), UVTD supports a wide range of underwater applications. Second, enhancing model generalization. UVTD comprises 5380 real underwater images, covering diverse scenes and varied degradation characteristics, improving the robustness of vision algorithms. Finally, facilitating multitask learning. Our multilabel data set enables researchers to explore the correlations between tasks and develop robust UMTL algorithms. Based on UVTD, we propose two UMTL networks tailored to the low-level and high-level tasks separately, serving as benchmarks for future research in the UMTL field. Extensive experiments demonstrate UVTD's superiority across multiple UVTs, and the proposed UMTL networks exhibit competitive performance on these tasks, implying the significant implications of UVTD for future research.

查看原文本刊更多论文

UVTD：用于水下视觉任务的大规模多标签数据集

随着海洋资源的开发，水下视觉任务（UVTs）近年来受到越来越多的关注。然而，uvt的发展受到一些挑战的阻碍，特别是难以获取具有准确注释的大规模数据集，以及缺乏用于水下多任务学习（UMTL）的统一多标签数据集。由于迫切需要为UVT量身定制的大规模多标签数据集，我们提出了一个用于水下视觉任务（UVTD）的大规模多标签数据集，它为UVT研究中的几个瓶颈提供了解决方案：首先，支持多样化的应用。UVTD通过对两项水下低层次视觉任务（即水下图像增强和水下图像质量评估）和三个高级视觉任务（即水下语义分割、水下目标检测和水下显著目标检测）的注释，支持广泛的水下应用。二是增强模型泛化能力。UVTD包含5380幅真实水下图像，涵盖了多种场景和多种退化特性，提高了视觉算法的鲁棒性。最后，促进多任务学习。我们的多标签数据集使研究人员能够探索任务之间的相关性并开发健壮的UMTL算法。在UVTD的基础上，我们提出了分别针对低级和高级任务定制的两种UMTL网络，作为未来UMTL领域研究的基准。广泛的实验证明了UVTD在多个uvt中的优势，并且所提出的UMTL网络在这些任务中表现出竞争性的性能，这意味着UVTD对未来研究的重要意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Journal of Oceanic Engineering 工程技术-工程：大洋

CiteScore

9.60

自引率

12.20%

发文量

审稿时长

12 months

期刊介绍： The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.