Mengtian Li , Xiaorun Li , Shuhan Chen , Hui Huang
{"title":"基于改进的yolox -纳米算法的高精度轻量化水下鱼类检测与识别方法","authors":"Mengtian Li , Xiaorun Li , Shuhan Chen , Hui Huang","doi":"10.1016/j.aquaeng.2025.102533","DOIUrl":null,"url":null,"abstract":"<div><div>In the past few years, the aquaculture industry has experienced a growing demand for in-situ fish detection using live videos from underwater observatory platforms to autonomously monitor cultivated fish and assess their health and growth conditions. To achieve the ideal balance between model size and detection precision, this paper offers a lightweight YOLOX-based fish detection and recognition approach termed Foc_YOLOXn_ASFF. In this study, we introduce Focal loss as a substitute for the binary cross-entropy (BCE) loss utilized by the original YOLOX to train the objectiveness branch, aiming to alleviate the class imbalance between the background and foreground classes. Meanwhile, we focus on moderating the inconsistency across various feature scales by combining YOLOX-nano with the adaptively spatial feature fusion (ASFF) structure to better fuse multi-scale features. Foc_YOLOXn_ASFF remarkably boosts the AP by over 3.6 % from 0.846 to 0.877 compared with the original YOLOX-nano through the implementation of Focal loss and ASFF structure. Foc_YOLOXn_ASFF achieves superior performance with 0.877 AP but only 2.27 M parameters, outperforming the counterparts YOLOv5n and SE_YOLOv5s_DGhost by 1.2 % AP and 7.9 % AP respectively. Extensive experimental findings indicate that Foc_YOLOXn_ASFF for both model size and detection precision is capable of meeting the requirements of practical applications on embedded devices with limited computational resources.</div></div>","PeriodicalId":8120,"journal":{"name":"Aquacultural Engineering","volume":"110 ","pages":"Article 102533"},"PeriodicalIF":3.6000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A high-precision and lightweight underwater fish detection and recognition approach based on the improved YOLOX-nano algorithm\",\"authors\":\"Mengtian Li , Xiaorun Li , Shuhan Chen , Hui Huang\",\"doi\":\"10.1016/j.aquaeng.2025.102533\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the past few years, the aquaculture industry has experienced a growing demand for in-situ fish detection using live videos from underwater observatory platforms to autonomously monitor cultivated fish and assess their health and growth conditions. To achieve the ideal balance between model size and detection precision, this paper offers a lightweight YOLOX-based fish detection and recognition approach termed Foc_YOLOXn_ASFF. In this study, we introduce Focal loss as a substitute for the binary cross-entropy (BCE) loss utilized by the original YOLOX to train the objectiveness branch, aiming to alleviate the class imbalance between the background and foreground classes. Meanwhile, we focus on moderating the inconsistency across various feature scales by combining YOLOX-nano with the adaptively spatial feature fusion (ASFF) structure to better fuse multi-scale features. Foc_YOLOXn_ASFF remarkably boosts the AP by over 3.6 % from 0.846 to 0.877 compared with the original YOLOX-nano through the implementation of Focal loss and ASFF structure. Foc_YOLOXn_ASFF achieves superior performance with 0.877 AP but only 2.27 M parameters, outperforming the counterparts YOLOv5n and SE_YOLOv5s_DGhost by 1.2 % AP and 7.9 % AP respectively. Extensive experimental findings indicate that Foc_YOLOXn_ASFF for both model size and detection precision is capable of meeting the requirements of practical applications on embedded devices with limited computational resources.</div></div>\",\"PeriodicalId\":8120,\"journal\":{\"name\":\"Aquacultural Engineering\",\"volume\":\"110 \",\"pages\":\"Article 102533\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-03-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aquacultural Engineering\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0144860925000226\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aquacultural Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0144860925000226","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
A high-precision and lightweight underwater fish detection and recognition approach based on the improved YOLOX-nano algorithm
In the past few years, the aquaculture industry has experienced a growing demand for in-situ fish detection using live videos from underwater observatory platforms to autonomously monitor cultivated fish and assess their health and growth conditions. To achieve the ideal balance between model size and detection precision, this paper offers a lightweight YOLOX-based fish detection and recognition approach termed Foc_YOLOXn_ASFF. In this study, we introduce Focal loss as a substitute for the binary cross-entropy (BCE) loss utilized by the original YOLOX to train the objectiveness branch, aiming to alleviate the class imbalance between the background and foreground classes. Meanwhile, we focus on moderating the inconsistency across various feature scales by combining YOLOX-nano with the adaptively spatial feature fusion (ASFF) structure to better fuse multi-scale features. Foc_YOLOXn_ASFF remarkably boosts the AP by over 3.6 % from 0.846 to 0.877 compared with the original YOLOX-nano through the implementation of Focal loss and ASFF structure. Foc_YOLOXn_ASFF achieves superior performance with 0.877 AP but only 2.27 M parameters, outperforming the counterparts YOLOv5n and SE_YOLOv5s_DGhost by 1.2 % AP and 7.9 % AP respectively. Extensive experimental findings indicate that Foc_YOLOXn_ASFF for both model size and detection precision is capable of meeting the requirements of practical applications on embedded devices with limited computational resources.
期刊介绍:
Aquacultural Engineering is concerned with the design and development of effective aquacultural systems for marine and freshwater facilities. The journal aims to apply the knowledge gained from basic research which potentially can be translated into commercial operations.
Problems of scale-up and application of research data involve many parameters, both physical and biological, making it difficult to anticipate the interaction between the unit processes and the cultured animals. Aquacultural Engineering aims to develop this bioengineering interface for aquaculture and welcomes contributions in the following areas:
– Engineering and design of aquaculture facilities
– Engineering-based research studies
– Construction experience and techniques
– In-service experience, commissioning, operation
– Materials selection and their uses
– Quantification of biological data and constraints