{"title":"When synthetic plants get sick: Disease graded image datasets by novel regression-conditional diffusion models","authors":"Itziar Egusquiza , Leire Benito-Del-Valle , Artzai Picón , Arantza Bereciartua-Pérez , Laura Gómez-Zamanillo , Andoni Elola , Elisabete Aramendi , Rocío Espejo , Till Eggers , Christian Klukas , Ramón Navarra-Mestre","doi":"10.1016/j.compag.2024.109690","DOIUrl":null,"url":null,"abstract":"<div><div>This paper introduces DiffusionPix2Pix, an innovative extension of diffusion models (DMs) that revolutionizes synthetic image generation by seamlessly integrating image priors, surpassing existing state-of-the-art models. Key contributions include regression (graded) conditioning and an arbitrary binary mask, enabling regression-conditional image-to-image translation. DiffusionPix2Pix is compared with Pix2Pix-G and Pix2Pix-GD, two alternative models that rely on image-conditioned GANs adapted for an additional regression conditional task. The model is applied to generate a graded plant disease dataset focusing on <em>Puccinia striiformis</em> symptoms, using disease degree as an additional conditioning input to control the level of disease in generated images. Experiments demonstrate that DiffusionPix2Pix outperforms GAN-based approaches in both sample fidelity and diversity, achieving an Improved Precision (fidelity) of 0.81 (versus 0.45 and 0.47) and an Improved Recall (diversity) of 0.58 (versus 0.31 and 0.31). Furthermore, DiffusionPix2Pix obtained the best Fréchet Inception Distance (FID), with a score of 31.61 compared to 57.38 and 54.34 for GAN-based models. Additionally, perception-based tests with field technicians showed 71.3% of images generated by DiffusionPix2Pix were classified as authentic, significantly outperforming the 20.19% and 22.22% rates for GAN-based models. These findings substantiate the performance of the proposed DiffusionPix2Pix model, both quantitatively and through subjective assessments by domain experts, highlighting its potential in applications requiring precise regression conditioning.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"229 ","pages":"Article 109690"},"PeriodicalIF":7.7000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924010810","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces DiffusionPix2Pix, an innovative extension of diffusion models (DMs) that revolutionizes synthetic image generation by seamlessly integrating image priors, surpassing existing state-of-the-art models. Key contributions include regression (graded) conditioning and an arbitrary binary mask, enabling regression-conditional image-to-image translation. DiffusionPix2Pix is compared with Pix2Pix-G and Pix2Pix-GD, two alternative models that rely on image-conditioned GANs adapted for an additional regression conditional task. The model is applied to generate a graded plant disease dataset focusing on Puccinia striiformis symptoms, using disease degree as an additional conditioning input to control the level of disease in generated images. Experiments demonstrate that DiffusionPix2Pix outperforms GAN-based approaches in both sample fidelity and diversity, achieving an Improved Precision (fidelity) of 0.81 (versus 0.45 and 0.47) and an Improved Recall (diversity) of 0.58 (versus 0.31 and 0.31). Furthermore, DiffusionPix2Pix obtained the best Fréchet Inception Distance (FID), with a score of 31.61 compared to 57.38 and 54.34 for GAN-based models. Additionally, perception-based tests with field technicians showed 71.3% of images generated by DiffusionPix2Pix were classified as authentic, significantly outperforming the 20.19% and 22.22% rates for GAN-based models. These findings substantiate the performance of the proposed DiffusionPix2Pix model, both quantitatively and through subjective assessments by domain experts, highlighting its potential in applications requiring precise regression conditioning.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.