Unsupervised domain adaptation framework with global-local adversarial learning and masked image consistency for fish counting in deep-sea aquaculture

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-07-14 DOI:10.1016/j.engappai.2025.111735

Hanchi Liu, Xin Ma

{"title":"Unsupervised domain adaptation framework with global-local adversarial learning and masked image consistency for fish counting in deep-sea aquaculture","authors":"Hanchi Liu, Xin Ma","doi":"10.1016/j.engappai.2025.111735","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate fish counting is crucial for effective management in deep-sea aquaculture. However, the diverse deep-sea aquaculture environments pose significant challenges to the generalizability of fish counting models. Current methods rely heavily on extensive labeled datasets and struggle to adapt to unseen scenarios. To address these limitations, this study proposes an unsupervised domain adaptation framework that leverages global-local adversarial learning and masked image consistency for cross-domain fish counting. A global-local discriminator is designed to extract the domain-invariant features by aligning the density map prediction across domains at the image and patch levels. Additionally, a masked image consistency module is designed to enhance the utilization of spatial context in the target image by enforcing prediction consistency between masked and complete target images. To validate our approach, we established four fish counting datasets using cameras with varying lighting conditions and viewpoints from two actual deep-sea cages. The framework was evaluated on three domain adaptation tasks: (1) natural to artificial lighting, (2) fixed to free viewpoints, and (3) “Shenlan 1” to “Genghai 1” cages. Experimental results demonstrate that the proposed method significantly improves generalization to diverse aquaculture scenarios without requiring additional data annotations. It outperformed state-of-the-art methods, reducing the mean absolute percentage error by 17.34 % for cross-view counting and 6.77 % for cross-cage counting.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111735"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625017373","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate fish counting is crucial for effective management in deep-sea aquaculture. However, the diverse deep-sea aquaculture environments pose significant challenges to the generalizability of fish counting models. Current methods rely heavily on extensive labeled datasets and struggle to adapt to unseen scenarios. To address these limitations, this study proposes an unsupervised domain adaptation framework that leverages global-local adversarial learning and masked image consistency for cross-domain fish counting. A global-local discriminator is designed to extract the domain-invariant features by aligning the density map prediction across domains at the image and patch levels. Additionally, a masked image consistency module is designed to enhance the utilization of spatial context in the target image by enforcing prediction consistency between masked and complete target images. To validate our approach, we established four fish counting datasets using cameras with varying lighting conditions and viewpoints from two actual deep-sea cages. The framework was evaluated on three domain adaptation tasks: (1) natural to artificial lighting, (2) fixed to free viewpoints, and (3) “Shenlan 1” to “Genghai 1” cages. Experimental results demonstrate that the proposed method significantly improves generalization to diverse aquaculture scenarios without requiring additional data annotations. It outperformed state-of-the-art methods, reducing the mean absolute percentage error by 17.34 % for cross-view counting and 6.77 % for cross-cage counting.

查看原文本刊更多论文

基于全局-局部对抗学习和掩膜图像一致性的深海水产养殖鱼类计数无监督域自适应框架

准确的鱼类计数对深海水产养殖的有效管理至关重要。然而，深海水产养殖环境的多样性对鱼类计数模型的通用性提出了重大挑战。目前的方法严重依赖于广泛的标记数据集，难以适应未知的场景。为了解决这些限制，本研究提出了一种无监督域自适应框架，该框架利用全局-局部对抗学习和掩膜图像一致性进行跨域鱼类计数。设计了一种全局-局部鉴别器，通过对图像和贴片水平上的密度图预测进行比对来提取区域不变特征。此外，设计了掩膜图像一致性模块，通过增强掩膜图像与完整目标图像之间的预测一致性，增强对目标图像空间上下文的利用。为了验证我们的方法，我们建立了四个鱼类计数数据集，这些数据集使用了来自两个实际深海网箱的不同照明条件和视点的摄像机。对该框架进行了3个领域的自适应任务的评估，即：(1)自然光照到人工光照，(2)固定视点到自由视点，以及(3)“深蓝1号”到“耿海1号”笼。实验结果表明，该方法在不需要额外数据注释的情况下显著提高了对不同水产养殖场景的泛化能力。它优于最先进的方法，将交叉视图计数的平均绝对百分比误差降低了17.34%，交叉笼计数的平均绝对百分比误差降低了6.77%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.