Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture

IF 4.3 2区 综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Hanchi Liu;Xin Ma;Haoran Li
{"title":"Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture","authors":"Hanchi Liu;Xin Ma;Haoran Li","doi":"10.1109/JSEN.2025.3550322","DOIUrl":null,"url":null,"abstract":"Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 9","pages":"16411-16424"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10934728/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.
基于局部全局上下文聚合和自蒸馏的深度神经网络在深海水产养殖中的鱼类计数
在深海水产养殖中,基于视觉的鱼类计数在监测养殖密度、优化饲养策略和规划销售时间表方面发挥着至关重要的作用。然而,由于水下图像中鱼类的大规模变化和背景光照的不均匀性,对深海网箱中的鱼类进行准确计数具有挑战性。为了解决这些问题,本研究提出了一种具有局部-全局上下文聚合和自蒸馏的深度神经网络(DNN),称为LGSDNet,用于深海水产养殖中鱼类计数和密度估计。首先,设计了局部-全局上下文聚合模块(LGCAM),对图像中的密集局部多尺度上下文和全局上下文进行聚合,使网络能够在不同背景光照条件下捕获不同鳞片鱼类的鲁棒特征表示。然后,设计了一个自蒸馏模块(SDM),利用网络深层的信息来指导浅层的学习,在不增加推理时间的情况下增强网络的表示学习。在深海网箱中收集的鱼类计数数据集上进行了大量的对比实验,证明了LGSDNet的有效性。它的平均绝对误差(MAE)为5.68,均方根误差(RMSE)为7.38,平均绝对百分比误差(MAPE)为3.27%,优于基线,分别减少了6.75,7.92和3.42%。此外,LGSDNet在数据集上优于最先进的鱼类和人群计数方法,而只有13.03 M个参数。推广实验进一步证明了LGSDNet对不同水产养殖环境的适应性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Sensors Journal
IEEE Sensors Journal 工程技术-工程:电子与电气
CiteScore
7.70
自引率
14.00%
发文量
2058
审稿时长
5.2 months
期刊介绍: The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信