利用生成模型综合训练数据集提高植物病害分类准确率

Research in Biotechnology Pub Date : 2023-02-13 DOI:10.25081/rib.2023.v14.8214

Enow Albert, N. Bille, N. Leonard

{"title":"利用生成模型综合训练数据集提高植物病害分类准确率","authors":"Enow Albert, N. Bille, N. Leonard","doi":"10.25081/rib.2023.v14.8214","DOIUrl":null,"url":null,"abstract":"Digitalization in agriculture requires critical research into applications of artificial intelligence to various specialization domains. This work aimed at investigating the application of image synthesis technology to the mitigation of the data volume constraint to digital plant disease phenotyping accuracy. We designed an experiment involving the use of a deep convolutional generative adversarial network (DC-GAN) to synthesize photorealistic data for healthy and bacterial spot disease-infected tomato leaves. The training dataset contained 1,272 instances per class. We further employed a 3-block visual geometry group (VGG) convolutional neural network (CNN) model with dropout regularization and 1 epoch to compare classification accuracies of the original dataset and various synthetic datasets. Our results showed that the third DC-GAN synthesized training dataset containing 3,816 synthetic examples of both healthy and bacterial spot infected tomato leaf classes outperformed the original training dataset containing 1,272 real examples of both tomato leaf classes (77.088% accuracy with the former dataset on a 3-block VGG CNN model with dropout regularization and 1 epoch, as compared to 76.447% accuracy with the latter dataset on the same classifier).","PeriodicalId":21082,"journal":{"name":"Research in Biotechnology","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improvement of plant disease classification accuracy with generative model-synthesized training datasets\",\"authors\":\"Enow Albert, N. Bille, N. Leonard\",\"doi\":\"10.25081/rib.2023.v14.8214\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Digitalization in agriculture requires critical research into applications of artificial intelligence to various specialization domains. This work aimed at investigating the application of image synthesis technology to the mitigation of the data volume constraint to digital plant disease phenotyping accuracy. We designed an experiment involving the use of a deep convolutional generative adversarial network (DC-GAN) to synthesize photorealistic data for healthy and bacterial spot disease-infected tomato leaves. The training dataset contained 1,272 instances per class. We further employed a 3-block visual geometry group (VGG) convolutional neural network (CNN) model with dropout regularization and 1 epoch to compare classification accuracies of the original dataset and various synthetic datasets. Our results showed that the third DC-GAN synthesized training dataset containing 3,816 synthetic examples of both healthy and bacterial spot infected tomato leaf classes outperformed the original training dataset containing 1,272 real examples of both tomato leaf classes (77.088% accuracy with the former dataset on a 3-block VGG CNN model with dropout regularization and 1 epoch, as compared to 76.447% accuracy with the latter dataset on the same classifier).\",\"PeriodicalId\":21082,\"journal\":{\"name\":\"Research in Biotechnology\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research in Biotechnology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25081/rib.2023.v14.8214\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25081/rib.2023.v14.8214","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

农业数字化需要对人工智能在各个专业领域的应用进行批判性研究。这项工作旨在研究图像合成技术的应用，以减轻数据量对数字植物病害表型准确性的限制。我们设计了一个实验，涉及使用深度卷积生成对抗网络(DC-GAN)来合成健康和细菌性斑点病感染番茄叶片的逼真数据。训练数据集包含每个类1,272个实例。我们进一步采用了一个3块视觉几何组(VGG)卷积神经网络(CNN)模型，该模型具有dropout正则化和1 epoch，比较了原始数据集和各种合成数据集的分类精度。我们的结果表明，第三个DC-GAN合成训练数据集包含3816个健康和细菌斑点感染番茄叶片类的合成样本，优于包含1272个番茄叶片类真实样本的原始训练数据集(前者数据集在3块VGG CNN模型上具有dropout规范化和1 epoch的准确率为77.088%，后者数据集在相同分类器上的准确率为76.447%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improvement of plant disease classification accuracy with generative model-synthesized training datasets

Digitalization in agriculture requires critical research into applications of artificial intelligence to various specialization domains. This work aimed at investigating the application of image synthesis technology to the mitigation of the data volume constraint to digital plant disease phenotyping accuracy. We designed an experiment involving the use of a deep convolutional generative adversarial network (DC-GAN) to synthesize photorealistic data for healthy and bacterial spot disease-infected tomato leaves. The training dataset contained 1,272 instances per class. We further employed a 3-block visual geometry group (VGG) convolutional neural network (CNN) model with dropout regularization and 1 epoch to compare classification accuracies of the original dataset and various synthetic datasets. Our results showed that the third DC-GAN synthesized training dataset containing 3,816 synthetic examples of both healthy and bacterial spot infected tomato leaf classes outperformed the original training dataset containing 1,272 real examples of both tomato leaf classes (77.088% accuracy with the former dataset on a 3-block VGG CNN model with dropout regularization and 1 epoch, as compared to 76.447% accuracy with the latter dataset on the same classifier).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Research in Biotechnology

自引率

0.00%

发文量