{"title":"基于局部全局上下文聚合和自蒸馏的深度神经网络在深海水产养殖中的鱼类计数","authors":"Hanchi Liu;Xin Ma;Haoran Li","doi":"10.1109/JSEN.2025.3550322","DOIUrl":null,"url":null,"abstract":"Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 9","pages":"16411-16424"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture\",\"authors\":\"Hanchi Liu;Xin Ma;Haoran Li\",\"doi\":\"10.1109/JSEN.2025.3550322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 9\",\"pages\":\"16411-16424\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10934728/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10934728/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Deep Neural Network With Local–Global Context Aggregation and Self-Distillation for Fish Counting in Deep-Sea Aquaculture
Vision-based fish counting plays a vital role in monitoring breeding density, optimizing feeding strategies, and planning marketing schedules in deep-sea aquaculture. However, large-scale variations in fish and nonuniform background illumination in underwater images make it challenging to accurately count fish in deep-sea cages. Aiming to solve these issues, this study proposes a deep neural network (DNN) with local-global context aggregation and self-distillation called LGSDNet for fish counting and density estimation in deep-sea aquaculture. First, a local-global context aggregation module (LGCAM) is designed to aggregate the dense local multiscale context and global context in images, enabling the network to capture robust feature representations for fish with various scales under various background illumination conditions. Then, a self-distillation module (SDM) is designed to leverage information from the deep layers of the network to guide the learning of the shallow layers, enhancing the representation learning of the network without increasing the inference time. Extensive comparative experiments on the fish counting dataset collected from a deep-sea cage demonstrate the effectiveness of LGSDNet. It achieves a mean absolute error (MAE) of 5.68, a root-mean-squared error (RMSE) of 7.38, and a mean absolute percentage error (MAPE) of 3.27%, outperforming the Baseline with a reduction in the aforementioned metrics by 6.75, 7.92, and 3.42%, respectively. In addition, LGSDNet outperforms state-of-the-art fish and crowd counting methods on the dataset while having only 13.03 M parameters. Generalization experiments further demonstrate the adaptability of LGSDNet to diverse aquaculture environments.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice