海参检测的域泛化：处理水产养殖环境中的背景颜色变化

IF 2.2 3区农林科学 Q2 FISHERIES

Aquaculture International Pub Date : 2025-05-14 DOI:10.1007/s10499-025-02022-8

Fangqun Niu, Yifan Sheng, Junyi Wang, Xinyu Zheng, Kexin Liu, Yuanshan Lin, Wei Wang, GuoDong Li

{"title":"海参检测的域泛化：处理水产养殖环境中的背景颜色变化","authors":"Fangqun Niu, Yifan Sheng, Junyi Wang, Xinyu Zheng, Kexin Liu, Yuanshan Lin, Wei Wang, GuoDong Li","doi":"10.1007/s10499-025-02022-8","DOIUrl":null,"url":null,"abstract":"<div><p>In underwater aquaculture environments, variations in image styles caused by factors such as lighting conditions, water quality, and plankton presence introduce significant domain shift challenges for object detection tasks. To address these challenges, this paper proposes a novel sea cucumber detection model, UICTDG-YOLO, which utilizes color domain generalization techniques. Specifically, we employ a frequency-domain enhancement method based on the Fourier transform. This technique reconstructs images by perturbing the amplitude spectrum while retaining the original phase spectrum, effectively enhancing color consistency across diverse aquatic environments. Additionally, a parameterized compensation mechanism is integrated to preserve target information, thereby augmenting the dataset, increasing domain diversity, and improving the model’s generalization capability. One of the major challenges in sea cucumber detection is distinguishing target features from complex background elements. To address this, we integrate a SENetv2-based compression and aggregation network into the model backbone, enhancing its ability to extract key target features from cluttered underwater environments. Furthermore, considering the substantial shape and scale variations of sea cucumbers across different aquaculture environments, as well as the presence of background objects with textures resembling sea cucumbers, we incorporate a Bidirectional Feature Pyramid Network (BiFPN) module into the network’s neck. This module facilitates multi-scale feature fusion, improving detection accuracy across varying scales and effectively reducing background interference. Given the class imbalance in the dataset, we employ the Focal-GIoU loss function to address the imbalance between positive and negative samples while improving the accuracy of bounding box regression. Experimental results demonstrate that UICTDG-YOLO significantly outperforms the baseline model, achieving a 5.8% improvement in mean average precision (mAP), a 5.8% improvement in precision (<i>P</i>), and a 6.2% improvement in recall (<i>R</i>). The model consists of 10.4 million parameters, with a computational load of 27.5 GFLOPs and a detection rate of 28.5 frames per second. When compared to prominent object detection models, including Faster R-CNN, YOLO series models, WQTDG-YOLO, and OA-DG, UICTDG-YOLO demonstrates clear advantages in sea cucumber detection tasks within real aquaculture environments. This model provides valuable technical insights and practical applications for the scientific farming of sea cucumbers.</p></div>","PeriodicalId":8122,"journal":{"name":"Aquaculture International","volume":"33 5","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Domain generalization for sea cucumber detection: Tackling background color variability in aquaculture settings\",\"authors\":\"Fangqun Niu, Yifan Sheng, Junyi Wang, Xinyu Zheng, Kexin Liu, Yuanshan Lin, Wei Wang, GuoDong Li\",\"doi\":\"10.1007/s10499-025-02022-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In underwater aquaculture environments, variations in image styles caused by factors such as lighting conditions, water quality, and plankton presence introduce significant domain shift challenges for object detection tasks. To address these challenges, this paper proposes a novel sea cucumber detection model, UICTDG-YOLO, which utilizes color domain generalization techniques. Specifically, we employ a frequency-domain enhancement method based on the Fourier transform. This technique reconstructs images by perturbing the amplitude spectrum while retaining the original phase spectrum, effectively enhancing color consistency across diverse aquatic environments. Additionally, a parameterized compensation mechanism is integrated to preserve target information, thereby augmenting the dataset, increasing domain diversity, and improving the model’s generalization capability. One of the major challenges in sea cucumber detection is distinguishing target features from complex background elements. To address this, we integrate a SENetv2-based compression and aggregation network into the model backbone, enhancing its ability to extract key target features from cluttered underwater environments. Furthermore, considering the substantial shape and scale variations of sea cucumbers across different aquaculture environments, as well as the presence of background objects with textures resembling sea cucumbers, we incorporate a Bidirectional Feature Pyramid Network (BiFPN) module into the network’s neck. This module facilitates multi-scale feature fusion, improving detection accuracy across varying scales and effectively reducing background interference. Given the class imbalance in the dataset, we employ the Focal-GIoU loss function to address the imbalance between positive and negative samples while improving the accuracy of bounding box regression. Experimental results demonstrate that UICTDG-YOLO significantly outperforms the baseline model, achieving a 5.8% improvement in mean average precision (mAP), a 5.8% improvement in precision (<i>P</i>), and a 6.2% improvement in recall (<i>R</i>). The model consists of 10.4 million parameters, with a computational load of 27.5 GFLOPs and a detection rate of 28.5 frames per second. When compared to prominent object detection models, including Faster R-CNN, YOLO series models, WQTDG-YOLO, and OA-DG, UICTDG-YOLO demonstrates clear advantages in sea cucumber detection tasks within real aquaculture environments. This model provides valuable technical insights and practical applications for the scientific farming of sea cucumbers.</p></div>\",\"PeriodicalId\":8122,\"journal\":{\"name\":\"Aquaculture International\",\"volume\":\"33 5\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aquaculture International\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10499-025-02022-8\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FISHERIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aquaculture International","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1007/s10499-025-02022-8","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}

引用次数: 0

摘要

在水下水产养殖环境中，由光照条件、水质和浮游生物存在等因素引起的图像样式变化给目标检测任务带来了重大的域漂移挑战。为了解决这些问题，本文提出了一种利用色域泛化技术的新型海参检测模型UICTDG-YOLO。具体来说，我们采用了一种基于傅里叶变换的频域增强方法。该技术通过扰动振幅谱来重建图像，同时保留原始相位谱，有效增强了不同水生环境的颜色一致性。此外，还集成了参数化补偿机制来保留目标信息，从而增强了数据集，增加了领域多样性，提高了模型的泛化能力。如何从复杂的背景元素中区分目标特征是海参检测的主要挑战之一。为了解决这个问题，我们将基于senetv2的压缩和聚合网络集成到模型骨干中，增强其从混乱的水下环境中提取关键目标特征的能力。此外，考虑到海参在不同养殖环境中的形状和尺度的巨大变化，以及具有类似海参纹理的背景物体的存在，我们将双向特征金字塔网络（Bidirectional Feature Pyramid Network, BiFPN）模块纳入网络颈部。该模块促进了多尺度特征融合，提高了不同尺度的检测精度，有效降低了背景干扰。考虑到数据集中的类不平衡，我们采用Focal-GIoU损失函数来解决正负样本之间的不平衡，同时提高了边界盒回归的精度。实验结果表明，UICTDG-YOLO显著优于基线模型，平均精度（mAP）提高5.8%，精度(P)提高5.8%，召回率(R)提高6.2%。该模型包含1040万个参数，计算负载为27.5 GFLOPs，检测速率为28.5帧/秒。与Faster R-CNN、YOLO系列模型、WQTDG-YOLO、OA-DG等知名目标检测模型相比，UICTDG-YOLO在真实水产养殖环境下的海参检测任务中表现出明显的优势。该模型为科学养殖海参提供了有价值的技术见解和实际应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Domain generalization for sea cucumber detection: Tackling background color variability in aquaculture settings

In underwater aquaculture environments, variations in image styles caused by factors such as lighting conditions, water quality, and plankton presence introduce significant domain shift challenges for object detection tasks. To address these challenges, this paper proposes a novel sea cucumber detection model, UICTDG-YOLO, which utilizes color domain generalization techniques. Specifically, we employ a frequency-domain enhancement method based on the Fourier transform. This technique reconstructs images by perturbing the amplitude spectrum while retaining the original phase spectrum, effectively enhancing color consistency across diverse aquatic environments. Additionally, a parameterized compensation mechanism is integrated to preserve target information, thereby augmenting the dataset, increasing domain diversity, and improving the model’s generalization capability. One of the major challenges in sea cucumber detection is distinguishing target features from complex background elements. To address this, we integrate a SENetv2-based compression and aggregation network into the model backbone, enhancing its ability to extract key target features from cluttered underwater environments. Furthermore, considering the substantial shape and scale variations of sea cucumbers across different aquaculture environments, as well as the presence of background objects with textures resembling sea cucumbers, we incorporate a Bidirectional Feature Pyramid Network (BiFPN) module into the network’s neck. This module facilitates multi-scale feature fusion, improving detection accuracy across varying scales and effectively reducing background interference. Given the class imbalance in the dataset, we employ the Focal-GIoU loss function to address the imbalance between positive and negative samples while improving the accuracy of bounding box regression. Experimental results demonstrate that UICTDG-YOLO significantly outperforms the baseline model, achieving a 5.8% improvement in mean average precision (mAP), a 5.8% improvement in precision (P), and a 6.2% improvement in recall (R). The model consists of 10.4 million parameters, with a computational load of 27.5 GFLOPs and a detection rate of 28.5 frames per second. When compared to prominent object detection models, including Faster R-CNN, YOLO series models, WQTDG-YOLO, and OA-DG, UICTDG-YOLO demonstrates clear advantages in sea cucumber detection tasks within real aquaculture environments. This model provides valuable technical insights and practical applications for the scientific farming of sea cucumbers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Aquaculture International 农林科学-渔业

CiteScore

5.10

自引率

6.90%

发文量

204

审稿时长

1.0 months

期刊介绍： Aquaculture International is an international journal publishing original research papers, short communications, technical notes and review papers on all aspects of aquaculture. The Journal covers topics such as the biology, physiology, pathology and genetics of cultured fish, crustaceans, molluscs and plants, especially new species; water quality of supply systems, fluctuations in water quality within farms and the environmental impacts of aquacultural operations; nutrition, feeding and stocking practices, especially as they affect the health and growth rates of cultured species; sustainable production techniques; bioengineering studies on the design and management of offshore and land-based systems; the improvement of quality and marketing of farmed products; sociological and societal impacts of aquaculture, and more. This is the official Journal of the European Aquaculture Society.