基于生成对抗网络的合成高光谱反射率数据增强，增强葡萄成熟度测定

IF 7.7 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2025-04-03 DOI:10.1016/j.compag.2025.110341

Hongyi Lyu , Miles Grafton , Thiagarajah Ramilan , Matthew Irwin , Eduardo Sandoval

{"title":"基于生成对抗网络的合成高光谱反射率数据增强，增强葡萄成熟度测定","authors":"Hongyi Lyu , Miles Grafton , Thiagarajah Ramilan , Matthew Irwin , Eduardo Sandoval","doi":"10.1016/j.compag.2025.110341","DOIUrl":null,"url":null,"abstract":"<div><div>Non-destructive and rapid grape maturity detection is important for the wine industry. The ongoing development of hyperspectral imaging techniques and deep learning methods has greatly helped in non-destructive assessing of grape quality and maturity, but the performance of deep learning methods depends on the volume and the quality of labeled data for training. Building non-destructive grape quality or maturity testing datasets requires damaging grapes for chemical analysis to produce labels which are time consuming and resource intensive. To solve this problem, this study proposed a conditional Wasserstain Generative Adversarial Network (WGAN) with the gradient penalty data augmentation technique to generate synthetic hyperspectral reflectance data of two grape maturity categories (ripe and unripe) and different Total Soluble Solids (TSS) values. The conditional WGAN with the gradient penalty was trained for a range of epochs: 500, 1000, 2000, 8000, 10,000, and 20,000. After training of 10,000 epochs, synthetic hyperspectral reflectance data were very similar to real spectra for each maturity category and different TSS values. Thereafter, contextual deep three-dimensional CNN (3D-CNN), Spatial Residual Network (SSRN) and Support Vector Machine (SVM) are trained on original training and synthetic + original training datasets to classify grape maturity. The synthetic hyperspectral reflectance data, incrementally added to the original training set in steps of 250, 500, 1000, 1500, and 2000 samples, consistently resulted in higher model performance compared to training solely on the original dataset. The best results were achieved by augmenting the training dataset with 2000 synthetic samples and training with a 3D-CNN, yielding a classification accuracy of 91 % on the testing set. To better assess the effectiveness of GAN-based data augmentation methods, two widely used regression models: Partial Least Squares Regression (PLSR) and one-dimensional CNN (1D-CNN) were used based on same data augmentation method. The best result was achieved by adding 250 synthetic samples to the original training set when training 1D-CNN model, yielding an R<sup>2</sup> of 0.78, RMSE of 0.63 °Brix, and RPIQ of 3.36 on the testing set. This study indicated that deep learning models combined with conditional WGAN with the gradient penalty data augmentation technique had a good application prospect in the grape maturity assessment.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"235 ","pages":"Article 110341"},"PeriodicalIF":7.7000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Synthetic hyperspectral reflectance data augmentation by generative adversarial network to enhance grape maturity determination\",\"authors\":\"Hongyi Lyu , Miles Grafton , Thiagarajah Ramilan , Matthew Irwin , Eduardo Sandoval\",\"doi\":\"10.1016/j.compag.2025.110341\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Non-destructive and rapid grape maturity detection is important for the wine industry. The ongoing development of hyperspectral imaging techniques and deep learning methods has greatly helped in non-destructive assessing of grape quality and maturity, but the performance of deep learning methods depends on the volume and the quality of labeled data for training. Building non-destructive grape quality or maturity testing datasets requires damaging grapes for chemical analysis to produce labels which are time consuming and resource intensive. To solve this problem, this study proposed a conditional Wasserstain Generative Adversarial Network (WGAN) with the gradient penalty data augmentation technique to generate synthetic hyperspectral reflectance data of two grape maturity categories (ripe and unripe) and different Total Soluble Solids (TSS) values. The conditional WGAN with the gradient penalty was trained for a range of epochs: 500, 1000, 2000, 8000, 10,000, and 20,000. After training of 10,000 epochs, synthetic hyperspectral reflectance data were very similar to real spectra for each maturity category and different TSS values. Thereafter, contextual deep three-dimensional CNN (3D-CNN), Spatial Residual Network (SSRN) and Support Vector Machine (SVM) are trained on original training and synthetic + original training datasets to classify grape maturity. The synthetic hyperspectral reflectance data, incrementally added to the original training set in steps of 250, 500, 1000, 1500, and 2000 samples, consistently resulted in higher model performance compared to training solely on the original dataset. The best results were achieved by augmenting the training dataset with 2000 synthetic samples and training with a 3D-CNN, yielding a classification accuracy of 91 % on the testing set. To better assess the effectiveness of GAN-based data augmentation methods, two widely used regression models: Partial Least Squares Regression (PLSR) and one-dimensional CNN (1D-CNN) were used based on same data augmentation method. The best result was achieved by adding 250 synthetic samples to the original training set when training 1D-CNN model, yielding an R<sup>2</sup> of 0.78, RMSE of 0.63 °Brix, and RPIQ of 3.36 on the testing set. This study indicated that deep learning models combined with conditional WGAN with the gradient penalty data augmentation technique had a good application prospect in the grape maturity assessment.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"235 \",\"pages\":\"Article 110341\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925004478\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925004478","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

无损和快速的葡萄成熟度检测对于葡萄酒行业来说是非常重要的。高光谱成像技术和深度学习方法的不断发展极大地帮助了葡萄质量和成熟度的无损评估，但深度学习方法的性能取决于用于训练的标记数据的数量和质量。建立非破坏性的葡萄质量或成熟度测试数据集需要损坏的葡萄进行化学分析，以产生耗时和资源密集的标签。为了解决这一问题，本研究提出了一种条件Wasserstain生成对抗网络（WGAN），结合梯度惩罚数据增强技术，生成两种葡萄成熟度类别（成熟和未成熟）和不同总可溶性固溶物（TSS）值的合成高光谱反射率数据。对具有梯度惩罚的条件WGAN进行了一系列epoch的训练：500、1000、2000、8000、10,000和20,000。经过1万次历元的训练，各成熟度类别和不同TSS值的合成高光谱反射率数据与真实光谱非常接近。然后在原始训练数据集和合成+原始训练数据集上训练上下文深度三维CNN （3D-CNN）、空间残差网络（SSRN）和支持向量机（SVM）进行葡萄成熟度分类。合成的高光谱反射率数据，以250、500、1000、1500和2000个样本的步骤逐步添加到原始训练集中，与单独在原始数据集上训练相比，始终产生更高的模型性能。通过使用2000个合成样本增强训练数据集并使用3D-CNN进行训练，获得了最好的结果，在测试集上产生了91%的分类准确率。为了更好地评估基于gan的数据增强方法的有效性，基于相同的数据增强方法，使用了两种广泛使用的回归模型：偏最小二乘回归（PLSR）和一维CNN （1D-CNN）。当训练1D-CNN模型时，在原始训练集上添加250个合成样本获得了最好的结果，测试集的R2为0.78，RMSE为0.63°Brix， RPIQ为3.36。该研究表明，深度学习模型结合条件WGAN和梯度惩罚数据增强技术在葡萄成熟度评估中具有良好的应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Synthetic hyperspectral reflectance data augmentation by generative adversarial network to enhance grape maturity determination

Non-destructive and rapid grape maturity detection is important for the wine industry. The ongoing development of hyperspectral imaging techniques and deep learning methods has greatly helped in non-destructive assessing of grape quality and maturity, but the performance of deep learning methods depends on the volume and the quality of labeled data for training. Building non-destructive grape quality or maturity testing datasets requires damaging grapes for chemical analysis to produce labels which are time consuming and resource intensive. To solve this problem, this study proposed a conditional Wasserstain Generative Adversarial Network (WGAN) with the gradient penalty data augmentation technique to generate synthetic hyperspectral reflectance data of two grape maturity categories (ripe and unripe) and different Total Soluble Solids (TSS) values. The conditional WGAN with the gradient penalty was trained for a range of epochs: 500, 1000, 2000, 8000, 10,000, and 20,000. After training of 10,000 epochs, synthetic hyperspectral reflectance data were very similar to real spectra for each maturity category and different TSS values. Thereafter, contextual deep three-dimensional CNN (3D-CNN), Spatial Residual Network (SSRN) and Support Vector Machine (SVM) are trained on original training and synthetic + original training datasets to classify grape maturity. The synthetic hyperspectral reflectance data, incrementally added to the original training set in steps of 250, 500, 1000, 1500, and 2000 samples, consistently resulted in higher model performance compared to training solely on the original dataset. The best results were achieved by augmenting the training dataset with 2000 synthetic samples and training with a 3D-CNN, yielding a classification accuracy of 91 % on the testing set. To better assess the effectiveness of GAN-based data augmentation methods, two widely used regression models: Partial Least Squares Regression (PLSR) and one-dimensional CNN (1D-CNN) were used based on same data augmentation method. The best result was achieved by adding 250 synthetic samples to the original training set when training 1D-CNN model, yielding an R² of 0.78, RMSE of 0.63 °Brix, and RPIQ of 3.36 on the testing set. This study indicated that deep learning models combined with conditional WGAN with the gradient penalty data augmentation technique had a good application prospect in the grape maturity assessment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.