基于生成式人工智能的管道架构，提高智能杂草控制系统的训练效率

IF 3.7 2区计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Journal of Systems Architecture Pub Date : 2025-06-02 DOI:10.1016/j.sysarc.2025.103464

Sourav Modak, Anthony Stein

{"title":"基于生成式人工智能的管道架构，提高智能杂草控制系统的训练效率","authors":"Sourav Modak, Anthony Stein","doi":"10.1016/j.sysarc.2025.103464","DOIUrl":null,"url":null,"abstract":"<div><div>In automated crop protection tasks, deep learning has demonstrated significant potential. However, these advanced models rely heavily on high-quality, diverse datasets, which are often scarce and costly to obtain in agricultural settings. Traditional data augmentation techniques, while useful for increasing the volume of the dataset, often fail to capture the real-world variability needed for robust model training. In this paper, we present a novel method for generating synthetic images to enhance the training of deep learning-based object detection models for intelligent weed control, aiming to improve data efficiency. The architecture of our GenAI-based image generation pipeline integrates the Segment Anything Model (SAM) for zero-shot domain adaptation with a text-to-image Stable Diffusion Model, enabling the creation of synthetic images that can accurately reflect the idiosyncratic properties and appearances of a variety of real-world conditions. We further assess the application of these synthetic datasets on edge devices by evaluating state-of-the-art lightweight YOLO models, measuring data efficiency by comparing mAP50 and mAP50-95 scores among different proportions of real and synthetic training data. Incorporating these synthetic datasets into the training process has been found to result in notable improvements in terms of data efficiency. For instance, most YOLO models that are trained on a dataset consisting of 10% synthetic images and 90% real-world images typically demonstrate superior scores on mAP50 and mAP50-95 metrics compared to those trained solely on real-world images. The integration of this approach opens opportunities for achieving continual self-improvement of perception modules in intelligent technical systems.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"167 ","pages":"Article 103464"},"PeriodicalIF":3.7000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generative AI-based pipeline architecture for increasing training efficiency in intelligent weed control systems\",\"authors\":\"Sourav Modak, Anthony Stein\",\"doi\":\"10.1016/j.sysarc.2025.103464\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In automated crop protection tasks, deep learning has demonstrated significant potential. However, these advanced models rely heavily on high-quality, diverse datasets, which are often scarce and costly to obtain in agricultural settings. Traditional data augmentation techniques, while useful for increasing the volume of the dataset, often fail to capture the real-world variability needed for robust model training. In this paper, we present a novel method for generating synthetic images to enhance the training of deep learning-based object detection models for intelligent weed control, aiming to improve data efficiency. The architecture of our GenAI-based image generation pipeline integrates the Segment Anything Model (SAM) for zero-shot domain adaptation with a text-to-image Stable Diffusion Model, enabling the creation of synthetic images that can accurately reflect the idiosyncratic properties and appearances of a variety of real-world conditions. We further assess the application of these synthetic datasets on edge devices by evaluating state-of-the-art lightweight YOLO models, measuring data efficiency by comparing mAP50 and mAP50-95 scores among different proportions of real and synthetic training data. Incorporating these synthetic datasets into the training process has been found to result in notable improvements in terms of data efficiency. For instance, most YOLO models that are trained on a dataset consisting of 10% synthetic images and 90% real-world images typically demonstrate superior scores on mAP50 and mAP50-95 metrics compared to those trained solely on real-world images. The integration of this approach opens opportunities for achieving continual self-improvement of perception modules in intelligent technical systems.</div></div>\",\"PeriodicalId\":50027,\"journal\":{\"name\":\"Journal of Systems Architecture\",\"volume\":\"167 \",\"pages\":\"Article 103464\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems Architecture\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1383762125001365\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems Architecture","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1383762125001365","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

摘要

在自动化作物保护任务中，深度学习已经显示出巨大的潜力。然而，这些先进的模型严重依赖于高质量、多样化的数据集，而这些数据集在农业环境中往往稀缺且成本高昂。传统的数据增强技术虽然有助于增加数据集的容量，但往往无法捕获健壮模型训练所需的真实世界的可变性。在本文中，我们提出了一种新的生成合成图像的方法，以增强基于深度学习的智能杂草控制目标检测模型的训练，旨在提高数据效率。我们基于genai的图像生成管道的架构集成了分段任意模型（SAM），用于零射击域适应和文本到图像的稳定扩散模型，使合成图像能够准确地反映各种现实世界条件的特殊属性和外观。我们进一步评估了这些合成数据集在边缘设备上的应用，通过评估最先进的轻量级YOLO模型，通过比较mAP50和mAP50-95在不同比例的真实和合成训练数据中的得分来衡量数据效率。将这些合成数据集整合到训练过程中可以显著提高数据效率。例如，在由10%合成图像和90%真实图像组成的数据集上训练的大多数YOLO模型在mAP50和mAP50-95指标上的得分通常高于仅在真实图像上训练的模型。这种方法的集成为智能技术系统中感知模块的持续自我完善提供了机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Generative AI-based pipeline architecture for increasing training efficiency in intelligent weed control systems

In automated crop protection tasks, deep learning has demonstrated significant potential. However, these advanced models rely heavily on high-quality, diverse datasets, which are often scarce and costly to obtain in agricultural settings. Traditional data augmentation techniques, while useful for increasing the volume of the dataset, often fail to capture the real-world variability needed for robust model training. In this paper, we present a novel method for generating synthetic images to enhance the training of deep learning-based object detection models for intelligent weed control, aiming to improve data efficiency. The architecture of our GenAI-based image generation pipeline integrates the Segment Anything Model (SAM) for zero-shot domain adaptation with a text-to-image Stable Diffusion Model, enabling the creation of synthetic images that can accurately reflect the idiosyncratic properties and appearances of a variety of real-world conditions. We further assess the application of these synthetic datasets on edge devices by evaluating state-of-the-art lightweight YOLO models, measuring data efficiency by comparing mAP50 and mAP50-95 scores among different proportions of real and synthetic training data. Incorporating these synthetic datasets into the training process has been found to result in notable improvements in terms of data efficiency. For instance, most YOLO models that are trained on a dataset consisting of 10% synthetic images and 90% real-world images typically demonstrate superior scores on mAP50 and mAP50-95 metrics compared to those trained solely on real-world images. The integration of this approach opens opportunities for achieving continual self-improvement of perception modules in intelligent technical systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Systems Architecture 工程技术-计算机：硬件

CiteScore

8.70

自引率

15.60%

发文量

226

审稿时长

46 days

期刊介绍： The Journal of Systems Architecture: Embedded Software Design (JSA) is a journal covering all design and architectural aspects related to embedded systems and software. It ranges from the microarchitecture level via the system software level up to the application-specific architecture level. Aspects such as real-time systems, operating systems, FPGA programming, programming languages, communications (limited to analysis and the software stack), mobile systems, parallel and distributed architectures as well as additional subjects in the computer and system architecture area will fall within the scope of this journal. Technology will not be a main focus, but its use and relevance to particular designs will be. Case studies are welcome but must contribute more than just a design for a particular piece of software. Design automation of such systems including methodologies, techniques and tools for their design as well as novel designs of software components fall within the scope of this journal. Novel applications that use embedded systems are also central in this journal. While hardware is not a part of this journal hardware/software co-design methods that consider interplay between software and hardware components with and emphasis on software are also relevant here.