Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain
{"title":"Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain","authors":"Shubham Rana, Matteo Gatti","doi":"10.1016/j.mex.2025.103309","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features:<ul><li><span>•</span><span><div>Produces a 128 × 7 × 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 × 4 kernels and 128 and 64 filters for upsampling.</div></span></li><li><span>•</span><span><div>Uses 3 × 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.</div></span></li></ul></div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"14 ","pages":"Article 103309"},"PeriodicalIF":1.6000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125001554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features:
•
Produces a 128 × 7 × 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 × 4 kernels and 128 and 64 filters for upsampling.
•
Uses 3 × 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.