{"title":"改进的Wasserstein GAN- gp模型与最先进的GAN模型在RGB和红外域合成农业杂草图像的比较评价","authors":"Shubham Rana, Matteo Gatti","doi":"10.1016/j.mex.2025.103309","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features:<ul><li><span>•</span><span><div>Produces a 128 × 7 × 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 × 4 kernels and 128 and 64 filters for upsampling.</div></span></li><li><span>•</span><span><div>Uses 3 × 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.</div></span></li></ul></div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"14 ","pages":"Article 103309"},"PeriodicalIF":1.6000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain\",\"authors\":\"Shubham Rana, Matteo Gatti\",\"doi\":\"10.1016/j.mex.2025.103309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features:<ul><li><span>•</span><span><div>Produces a 128 × 7 × 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 × 4 kernels and 128 and 64 filters for upsampling.</div></span></li><li><span>•</span><span><div>Uses 3 × 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.</div></span></li></ul></div></div>\",\"PeriodicalId\":18446,\"journal\":{\"name\":\"MethodsX\",\"volume\":\"14 \",\"pages\":\"Article 103309\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MethodsX\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2215016125001554\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125001554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Comparative Evaluation of Modified Wasserstein GAN-GP and State-of-the-Art GAN Models for Synthesizing Agricultural Weed Images in RGB and Infrared Domain
This study investigates the application of modified Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic RGB and infrared (IR) datasets to meet the annotation requirements for wild radish (Raphanus raphanistrum). The RafanoSet dataset was used for evaluation. Traditional WGAN models struggle with vanishing gradients and poor convergence, affecting data quality. Customizations in WGAN-GP improved synthetic image quality, especially in maintaining SSIM for RGB datasets. However, generating high-quality IR images remains challenging due to spectral complexities, with lower SSIM scores. Architectural enhancements including transposed convolutions, dropout, and selective batch normalization improved SSIM scores from 0.5364 to 0.6615 for RGB and from 0.3306 to 0.4154 for IR images. This study highlights the customized model's key features:
•
Produces a 128 × 7 × 7 tensor, optimizes feature map size for subsequent layers, with two layers using 4 × 4 kernels and 128 and 64 filters for upsampling.
•
Uses 3 × 3 kernels in all convolutional layers to capture fine-grained spatial features, incorporates batch normalization for training stability, and applies dropout to reduce overfitting and improve generalization.