Automated generation of ground truth images of greenhouse-grown plant shoots using a GAN approach.

IF 4.4 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

Plant Methods Pub Date : 2025-10-04 DOI:10.1186/s13007-025-01441-1

Sajid Ullah, Narendra Narisetti, Kerstin Neumann, Thomas Altmann, Jan Hejatko, Evgeny Gladilin

{"title":"Automated generation of ground truth images of greenhouse-grown plant shoots using a GAN approach.","authors":"Sajid Ullah, Narendra Narisetti, Kerstin Neumann, Thomas Altmann, Jan Hejatko, Evgeny Gladilin","doi":"10.1186/s13007-025-01441-1","DOIUrl":null,"url":null,"abstract":"<p><p>The generation of a large amount of ground truth data is an essential bottleneck for the application of deep learning-based approaches to plant image analysis. In particular, the generation of accurately labeled images of various plant types at different developmental stages from multiple renderings is a laborious task that substantially extends the time required for AI model development and adaptation to new data. Here, generative adversarial networks (GANs) can potentially offer a solution by enabling widely automated synthesis of realistic images of plant and background structures. In this study, we present a two-stage GAN-based approach to generation of pairs of RGB and binary-segmented images of greenhouse-grown plant shoots. In the first stage, FastGAN is applied to augment original RGB images of greenhouse-grown plants using intensity and texture transformations. The augmented data were then employed as additional test sets for a Pix2Pix model trained on a limited set of 2D RGB images and their corresponding binary ground truth segmentation. This two-step approach was evaluated on unseen images of different greenhouse-grown plants. Our experimental results show that the accuracy of GAN predicted binary segmentation ranges between 0.88 and 0.95 in terms of the Dice coefficient. Among several loss functions tested, Sigmoid Loss enables the most efficient model convergence during the training achieving the highest average Dice Coefficient scores of 0.94 and 0.95 for Arabidopsis and maize images. This underscores the advantages of employing tailored loss functions for the optimization of model performance.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"126"},"PeriodicalIF":4.4000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495773/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01441-1","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

The generation of a large amount of ground truth data is an essential bottleneck for the application of deep learning-based approaches to plant image analysis. In particular, the generation of accurately labeled images of various plant types at different developmental stages from multiple renderings is a laborious task that substantially extends the time required for AI model development and adaptation to new data. Here, generative adversarial networks (GANs) can potentially offer a solution by enabling widely automated synthesis of realistic images of plant and background structures. In this study, we present a two-stage GAN-based approach to generation of pairs of RGB and binary-segmented images of greenhouse-grown plant shoots. In the first stage, FastGAN is applied to augment original RGB images of greenhouse-grown plants using intensity and texture transformations. The augmented data were then employed as additional test sets for a Pix2Pix model trained on a limited set of 2D RGB images and their corresponding binary ground truth segmentation. This two-step approach was evaluated on unseen images of different greenhouse-grown plants. Our experimental results show that the accuracy of GAN predicted binary segmentation ranges between 0.88 and 0.95 in terms of the Dice coefficient. Among several loss functions tested, Sigmoid Loss enables the most efficient model convergence during the training achieving the highest average Dice Coefficient scores of 0.94 and 0.95 for Arabidopsis and maize images. This underscores the advantages of employing tailored loss functions for the optimization of model performance.

查看原文本刊更多论文

使用GAN方法自动生成温室植物芽的地面真实图像。

大量地面真值数据的生成是基于深度学习的植物图像分析方法应用的一个重要瓶颈。特别是，从多个渲染图中生成不同发育阶段的各种植物类型的准确标记图像是一项艰巨的任务，大大延长了人工智能模型开发和适应新数据所需的时间。在这里，生成对抗网络（gan）可以通过广泛自动合成植物和背景结构的真实图像来提供潜在的解决方案。在这项研究中，我们提出了一种基于gan的两阶段方法来生成温室植物芽的RGB和二值分割图像对。在第一阶段，FastGAN应用于利用强度和纹理变换增强温室植物的原始RGB图像。然后将增强的数据用作Pix2Pix模型的附加测试集，该模型在有限的2D RGB图像集上进行训练，并进行相应的二值地面真值分割。这种两步方法在不同温室植物的未见图像上进行了评估。实验结果表明，GAN预测二值分割的准确率在0.88 ~ 0.95之间。在测试的几个损失函数中，Sigmoid loss在训练过程中实现了最有效的模型收敛，拟南芥和玉米图像的平均Dice系数得分最高，分别为0.94和0.95。这强调了采用定制损失函数来优化模型性能的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Plant Methods 生物-植物科学

CiteScore

9.20

自引率

3.90%

发文量

121

审稿时长

2 months

期刊介绍： Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences. There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics. Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.