Chao Zhang , Lars Christian Gansel , Marc Bracke , Ricardo da Silva Torres
{"title":"复杂海水条件下鲑鱼虱幼虫增强检测的图像合成框架","authors":"Chao Zhang , Lars Christian Gansel , Marc Bracke , Ricardo da Silva Torres","doi":"10.1016/j.compag.2025.110985","DOIUrl":null,"url":null,"abstract":"<div><div>Salmon lice (<em>Lepeophtheirus salmonis</em>) infections pose significant threats to farmed and wild salmon populations, making early detection during pre-infective and infective stages essential for sustainable management. Traditional methods, such as PCR, are costly and unsuitable for large-scale deployment, while existing machine-learning approaches are limited by the lack of annotated data from complex seawater environments and are largely restricted to simplified laboratory conditions. To address these challenges, this study presents an automated synthetic data generation method based on the Segment Anything Model (SAM), specifically designed to improve the detection of multi-stage salmon louse larvae in real-world and complex seawater environments. A total of 54 orthogonal experiments were conducted to optimize the factor configuration for synthetic data generation, resulting in a high-quality dataset comprising 120,864 images. To maximize the utility of synthetic data and enhance model performance, we evaluated YOLO series models and the transformer-based RT-DETR-L model. Our experiments revealed that pretraining on synthetic data before fine-tuning (or hybrid training) consistently improved model performance across multiple evaluation metrics. The YOLOv8n’s F1-score increased from 70.6% to 87.7%, a notable relative improvement of 24.22%. In real-world seawater tests, models trained with synthetic data improved recall by 11.0% to 87.0% compared to models trained exclusively with the original data and outperformed well-trained biologists, further validating the effectiveness and practicality of the synthetic approach. This study provides a scalable solution for monitoring salmon louse larvae in large-scale seawater environments, improving the welfare of farmed and wild salmon. The source code is publicly available at <span><span>https://github.com/jay-zc/synthetic-dataset</span><svg><path></path></svg></span> (as of July 2025).</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 110985"},"PeriodicalIF":8.9000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An image synthesis framework for enhanced salmon louse larvae (Lepeophtheirus Salmonis) detection in complex seawater conditions\",\"authors\":\"Chao Zhang , Lars Christian Gansel , Marc Bracke , Ricardo da Silva Torres\",\"doi\":\"10.1016/j.compag.2025.110985\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Salmon lice (<em>Lepeophtheirus salmonis</em>) infections pose significant threats to farmed and wild salmon populations, making early detection during pre-infective and infective stages essential for sustainable management. Traditional methods, such as PCR, are costly and unsuitable for large-scale deployment, while existing machine-learning approaches are limited by the lack of annotated data from complex seawater environments and are largely restricted to simplified laboratory conditions. To address these challenges, this study presents an automated synthetic data generation method based on the Segment Anything Model (SAM), specifically designed to improve the detection of multi-stage salmon louse larvae in real-world and complex seawater environments. A total of 54 orthogonal experiments were conducted to optimize the factor configuration for synthetic data generation, resulting in a high-quality dataset comprising 120,864 images. To maximize the utility of synthetic data and enhance model performance, we evaluated YOLO series models and the transformer-based RT-DETR-L model. Our experiments revealed that pretraining on synthetic data before fine-tuning (or hybrid training) consistently improved model performance across multiple evaluation metrics. The YOLOv8n’s F1-score increased from 70.6% to 87.7%, a notable relative improvement of 24.22%. In real-world seawater tests, models trained with synthetic data improved recall by 11.0% to 87.0% compared to models trained exclusively with the original data and outperformed well-trained biologists, further validating the effectiveness and practicality of the synthetic approach. This study provides a scalable solution for monitoring salmon louse larvae in large-scale seawater environments, improving the welfare of farmed and wild salmon. The source code is publicly available at <span><span>https://github.com/jay-zc/synthetic-dataset</span><svg><path></path></svg></span> (as of July 2025).</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"239 \",\"pages\":\"Article 110985\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925010919\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925010919","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
An image synthesis framework for enhanced salmon louse larvae (Lepeophtheirus Salmonis) detection in complex seawater conditions
Salmon lice (Lepeophtheirus salmonis) infections pose significant threats to farmed and wild salmon populations, making early detection during pre-infective and infective stages essential for sustainable management. Traditional methods, such as PCR, are costly and unsuitable for large-scale deployment, while existing machine-learning approaches are limited by the lack of annotated data from complex seawater environments and are largely restricted to simplified laboratory conditions. To address these challenges, this study presents an automated synthetic data generation method based on the Segment Anything Model (SAM), specifically designed to improve the detection of multi-stage salmon louse larvae in real-world and complex seawater environments. A total of 54 orthogonal experiments were conducted to optimize the factor configuration for synthetic data generation, resulting in a high-quality dataset comprising 120,864 images. To maximize the utility of synthetic data and enhance model performance, we evaluated YOLO series models and the transformer-based RT-DETR-L model. Our experiments revealed that pretraining on synthetic data before fine-tuning (or hybrid training) consistently improved model performance across multiple evaluation metrics. The YOLOv8n’s F1-score increased from 70.6% to 87.7%, a notable relative improvement of 24.22%. In real-world seawater tests, models trained with synthetic data improved recall by 11.0% to 87.0% compared to models trained exclusively with the original data and outperformed well-trained biologists, further validating the effectiveness and practicality of the synthetic approach. This study provides a scalable solution for monitoring salmon louse larvae in large-scale seawater environments, improving the welfare of farmed and wild salmon. The source code is publicly available at https://github.com/jay-zc/synthetic-dataset (as of July 2025).
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.