Generating Brain MRI with StyleGAN2-ADA: The Effect of the Training Set Size on the Quality of Synthetic Images.

Journal of imaging informatics in medicine Pub Date : 2025-09-23 DOI:10.1007/s10278-025-01536-0

Matteo Lai, Mario Mascalchi, Carlo Tessa, Stefano Diciotti

{"title":"Generating Brain MRI with StyleGAN2-ADA: The Effect of the Training Set Size on the Quality of Synthetic Images.","authors":"Matteo Lai, Mario Mascalchi, Carlo Tessa, Stefano Diciotti","doi":"10.1007/s10278-025-01536-0","DOIUrl":null,"url":null,"abstract":"<p><p>The potential of deep learning for medical imaging is often constrained by limited data availability. Generative models can unlock this potential by generating synthetic data that reproduces the statistical properties of real data while being more accessible for sharing. In this study, we investigated the influence of training set size on the performance of a state-of-the-art generative adversarial network, the StyleGAN2-ADA, trained on a cohort of 3,227 subjects from the OpenBHB dataset to generate 2D slices of brain MR images from healthy subjects. The quality of the synthetic images was assessed through qualitative evaluations and state-of-the-art quantitative metrics, which are provided in a publicly accessible repository. Our results demonstrate that StyleGAN2-ADA generates realistic and high-quality images, deceiving even expert radiologists while preserving privacy, as it did not memorize training images. Notably, increasing the training set size led to slight improvements in fidelity metrics. However, training set size had no noticeable impact on diversity metrics, highlighting the persistent limitation of mode collapse. Furthermore, we observed that diversity metrics, such as coverage and β-recall, are highly sensitive to the number of synthetic images used in their computation, leading to inflated values when synthetic data significantly outnumber real ones. These findings underscore the need to carefully interpret diversity metrics and the importance of employing complementary evaluation strategies for robust assessment. Overall, while StyleGAN2-ADA shows promise as a tool for generating privacy-preserving synthetic medical images, overcoming diversity limitations will require exploring alternative generative architectures or incorporating additional regularization techniques.</p>","PeriodicalId":516858,"journal":{"name":"Journal of imaging informatics in medicine","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of imaging informatics in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10278-025-01536-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The potential of deep learning for medical imaging is often constrained by limited data availability. Generative models can unlock this potential by generating synthetic data that reproduces the statistical properties of real data while being more accessible for sharing. In this study, we investigated the influence of training set size on the performance of a state-of-the-art generative adversarial network, the StyleGAN2-ADA, trained on a cohort of 3,227 subjects from the OpenBHB dataset to generate 2D slices of brain MR images from healthy subjects. The quality of the synthetic images was assessed through qualitative evaluations and state-of-the-art quantitative metrics, which are provided in a publicly accessible repository. Our results demonstrate that StyleGAN2-ADA generates realistic and high-quality images, deceiving even expert radiologists while preserving privacy, as it did not memorize training images. Notably, increasing the training set size led to slight improvements in fidelity metrics. However, training set size had no noticeable impact on diversity metrics, highlighting the persistent limitation of mode collapse. Furthermore, we observed that diversity metrics, such as coverage and β-recall, are highly sensitive to the number of synthetic images used in their computation, leading to inflated values when synthetic data significantly outnumber real ones. These findings underscore the need to carefully interpret diversity metrics and the importance of employing complementary evaluation strategies for robust assessment. Overall, while StyleGAN2-ADA shows promise as a tool for generating privacy-preserving synthetic medical images, overcoming diversity limitations will require exploring alternative generative architectures or incorporating additional regularization techniques.

查看原文本刊更多论文

用StyleGAN2-ADA生成脑MRI：训练集大小对合成图像质量的影响。

深度学习在医学成像方面的潜力往往受到有限的数据可用性的限制。生成模型可以通过生成合成数据来释放这一潜力，这些合成数据再现了真实数据的统计属性，同时更易于共享。在这项研究中，我们研究了训练集大小对最先进的生成对抗网络StyleGAN2-ADA性能的影响，StyleGAN2-ADA在来自OpenBHB数据集的3,227名受试者的队列上进行训练，以生成健康受试者的二维脑磁共振图像切片。合成图像的质量通过定性评价和最先进的定量指标进行评估，这些指标在一个公开访问的存储库中提供。我们的研究结果表明，StyleGAN2-ADA生成了逼真的高质量图像，甚至欺骗了放射科专家，同时保护了隐私，因为它没有记住训练图像。值得注意的是，增加训练集大小导致保真度指标略有改善。然而，训练集大小对多样性指标没有显著影响，突出了模式崩溃的持续局限性。此外，我们观察到多样性指标，如覆盖率和β-召回率，对其计算中使用的合成图像的数量高度敏感，当合成数据的数量明显超过真实数据时，会导致值膨胀。这些发现强调了仔细解释多样性指标的必要性，以及采用互补评估策略进行稳健评估的重要性。总的来说，虽然StyleGAN2-ADA显示了作为生成保护隐私的合成医学图像的工具的希望，但克服多样性限制将需要探索替代生成架构或结合额外的正则化技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of imaging informatics in medicine

自引率

0.00%

发文量