基于局部全局上下文聚合和自蒸馏的深度神经网络在深海水产养殖中的鱼类计数

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2025-03-21 DOI:10.1109/JSEN.2025.3550322

Hanchi Liu;Xin Ma;Haoran Li

{"title":"基于局部全局上下文聚合和自蒸馏的深度神经网络在深海水产养殖中的鱼类计数","authors":"Hanchi Liu;Xin Ma;Haoran Li","doi":"10.1109/JSEN.2025.3550322","DOIUrl":null,"url":null,"abstract":"Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 9","pages":"16411-16424"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture\",\"authors\":\"Hanchi Liu;Xin Ma;Haoran Li\",\"doi\":\"10.1109/JSEN.2025.3550322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 9\",\"pages\":\"16411-16424\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10934728/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10934728/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

在深海水产养殖中，基于视觉的鱼类计数在监测养殖密度、优化饲养策略和规划销售时间表方面发挥着至关重要的作用。然而，由于水下图像中鱼类的大规模变化和背景光照的不均匀性，对深海网箱中的鱼类进行准确计数具有挑战性。为了解决这些问题，本研究提出了一种具有局部-全局上下文聚合和自蒸馏的深度神经网络（DNN），称为LGSDNet，用于深海水产养殖中鱼类计数和密度估计。首先，设计了局部-全局上下文聚合模块（LGCAM），对图像中的密集局部多尺度上下文和全局上下文进行聚合，使网络能够在不同背景光照条件下捕获不同鳞片鱼类的鲁棒特征表示。然后，设计了一个自蒸馏模块（SDM），利用网络深层的信息来指导浅层的学习，在不增加推理时间的情况下增强网络的表示学习。在深海网箱中收集的鱼类计数数据集上进行了大量的对比实验，证明了LGSDNet的有效性。它的平均绝对误差（MAE）为5.68，均方根误差（RMSE）为7.38，平均绝对百分比误差（MAPE）为3.27%，优于基线，分别减少了6.75,7.92和3.42%。此外，LGSDNet在数据集上优于最先进的鱼类和人群计数方法，而只有13.03 M个参数。推广实验进一步证明了LGSDNet对不同水产养殖环境的适应性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture

Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice