Meng Wu , Yifeng Cui , Rong Min , Shanghang Jiang , Lei Zhang , Jing Yu
{"title":"WaterBox: Weakly supervised underwater instance segmentation and a new benchmark","authors":"Meng Wu , Yifeng Cui , Rong Min , Shanghang Jiang , Lei Zhang , Jing Yu","doi":"10.1016/j.neucom.2025.131582","DOIUrl":null,"url":null,"abstract":"<div><div>Box-supervised instance segmentation has gained increasing attention due to its reliance on weak box annotations, which are considerably less expensive than pixel-wise mask annotations. Despite the advantage, existing methods in this category often struggle in complex underwater scenes, where degraded image quality causes foreground objects to become heavily entangled with the background. To address this issue, we propose WaterBox, a cost-effective box-supervised underwater instance segmentation method. Considering the intrinsic characteristics of underwater imaging, we introduce a novel pairwise loss function that leverages a mixed color affinity map with a dynamic threshold to effectively disambiguate foreground and background boundaries. Additionally, we devise a bounding box refinement strategy that generates tight and accurate bounding boxes for each instance, alleviating the negative impact of imprecise box annotations on segmentation performance. Furthermore, to fill in the gaps caused by data scarcity, we construct the first diver instance segmentation dataset, DSeg, which consists of 2000 underwater images with high-quality instance masks. Extensive experiments on two underwater datasets demonstrate the superiority of our approach over the state-of-the-art (SOTA) weakly supervised methods. The code and dataset will be made publicly available.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"657 ","pages":"Article 131582"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225022544","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Box-supervised instance segmentation has gained increasing attention due to its reliance on weak box annotations, which are considerably less expensive than pixel-wise mask annotations. Despite the advantage, existing methods in this category often struggle in complex underwater scenes, where degraded image quality causes foreground objects to become heavily entangled with the background. To address this issue, we propose WaterBox, a cost-effective box-supervised underwater instance segmentation method. Considering the intrinsic characteristics of underwater imaging, we introduce a novel pairwise loss function that leverages a mixed color affinity map with a dynamic threshold to effectively disambiguate foreground and background boundaries. Additionally, we devise a bounding box refinement strategy that generates tight and accurate bounding boxes for each instance, alleviating the negative impact of imprecise box annotations on segmentation performance. Furthermore, to fill in the gaps caused by data scarcity, we construct the first diver instance segmentation dataset, DSeg, which consists of 2000 underwater images with high-quality instance masks. Extensive experiments on two underwater datasets demonstrate the superiority of our approach over the state-of-the-art (SOTA) weakly supervised methods. The code and dataset will be made publicly available.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.