Yuxiang Li , Hequn Tan , Yuxuan Deng , Dianzhuo Zhou , Ming Zhu
{"title":"A two-stage 3D multi-fish tracking model using patch-based underwater stereo matching","authors":"Yuxiang Li , Hequn Tan , Yuxuan Deng , Dianzhuo Zhou , Ming Zhu","doi":"10.1016/j.biosystemseng.2024.12.011","DOIUrl":null,"url":null,"abstract":"<div><div>Indoor recirculating aquaculture systems necessitate the extensive application of automatic monitoring technologies due to the inherent complexity and precision involved. Underwater three-dimensional (3D) tracking is essential for monitoring fish. However, underwater imaging in aquaculture environments frequently encounters challenges such as image blurring, low contrast, and excessive noise. These problems reduce the effectiveness of stereo matching algorithms, subsequently affecting the precision of binocular-based underwater 3D tracking. As a result, current applications often rely on expensive hardware, such as Time-of-Flight (TOF) cameras or structured light cameras. To achieve high-precision underwater 3D tracking economically with binocular cameras, a two-stage 3D multi-fish tracking (TMT) model was proposed. The TMT model concentrates on processing fish-containing patches, eliminating complex and extensive background regions in underwater images. This improves contour and spatial position accuracy in stereo matching, reduces computational demands, and simplifies point cloud filtering. Specifically, in stage 1, a detector and tracker identify fish-containing patches within stereo images. Stage 2 uses a novel patch-based underwater stereo matching method to match patches of the same fish in the left and right images, then calculates and filters the fish point cloud. Evaluations of positioning accuracy showed that the Mean Absolute Error (MAE) between model results and the true values is less than 2 cm in both horizontal and vertical directions, and less than 5 cm in the depth direction. Furthermore, the TMT model demonstrates stable performance during a 7-h hypoxia experiment on fish, effectively identifying various fish behavioural changes.</div></div>","PeriodicalId":9173,"journal":{"name":"Biosystems Engineering","volume":"250 ","pages":"Pages 144-157"},"PeriodicalIF":4.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosystems Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1537511024002848","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Indoor recirculating aquaculture systems necessitate the extensive application of automatic monitoring technologies due to the inherent complexity and precision involved. Underwater three-dimensional (3D) tracking is essential for monitoring fish. However, underwater imaging in aquaculture environments frequently encounters challenges such as image blurring, low contrast, and excessive noise. These problems reduce the effectiveness of stereo matching algorithms, subsequently affecting the precision of binocular-based underwater 3D tracking. As a result, current applications often rely on expensive hardware, such as Time-of-Flight (TOF) cameras or structured light cameras. To achieve high-precision underwater 3D tracking economically with binocular cameras, a two-stage 3D multi-fish tracking (TMT) model was proposed. The TMT model concentrates on processing fish-containing patches, eliminating complex and extensive background regions in underwater images. This improves contour and spatial position accuracy in stereo matching, reduces computational demands, and simplifies point cloud filtering. Specifically, in stage 1, a detector and tracker identify fish-containing patches within stereo images. Stage 2 uses a novel patch-based underwater stereo matching method to match patches of the same fish in the left and right images, then calculates and filters the fish point cloud. Evaluations of positioning accuracy showed that the Mean Absolute Error (MAE) between model results and the true values is less than 2 cm in both horizontal and vertical directions, and less than 5 cm in the depth direction. Furthermore, the TMT model demonstrates stable performance during a 7-h hypoxia experiment on fish, effectively identifying various fish behavioural changes.
期刊介绍:
Biosystems Engineering publishes research in engineering and the physical sciences that represent advances in understanding or modelling of the performance of biological systems for sustainable developments in land use and the environment, agriculture and amenity, bioproduction processes and the food chain. The subject matter of the journal reflects the wide range and interdisciplinary nature of research in engineering for biological systems.