基于扩散模型的医学图像生成作为人工智能应用的潜在数据增强策略。

IF 1.1 4区医学 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Current Medical Imaging Reviews Pub Date : 2025-09-01 DOI:10.2174/0115734056401610250827114351

Zijian Cao, Jueye Zhang, Chen Lin, Tian Li, Hao Wu, Yibao Zhang

{"title":"基于扩散模型的医学图像生成作为人工智能应用的潜在数据增强策略。","authors":"Zijian Cao, Jueye Zhang, Chen Lin, Tian Li, Hao Wu, Yibao Zhang","doi":"10.2174/0115734056401610250827114351","DOIUrl":null,"url":null,"abstract":"Introduction: This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.Methods: The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model. In addition to observational assessment, quantitative evaluation was performed based on the gradient descent of the loss function during the generation process and the Fréchet Inception Distance (FID), using various loss functions and feature vector dimensions.Results: Compared to the original data, the proposed diffusion model successfully generated medical images of similar styles but with dramatically varied anatomic details. The model trained with the Huber loss function achieved a higher FID of 15.2 at a feature vector dimension of 2048, compared with the model trained with the L2 loss function, which achieved the best FID of 0.85 at a feature vector dimension of 64.Discussion: The use of the Huber loss enhanced model robustness, while FID values indicated acceptable similarity between generated and real images. Future work should explore the application of these models to more complex datasets and clinical scenarios.Conclusion: This study demonstrated that diffusion model-based medical image synthesis is potentially applicable as an augmentation strategy for AI, particularly in situations where access to real clinical data is limited. Optimal training parameters were also proposed by evaluating the dimensionality of feature vectors in FID calculations and the complexity of loss functions.","PeriodicalId":54215,"journal":{"name":"Current Medical Imaging Reviews","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications.\",\"authors\":\"Zijian Cao, Jueye Zhang, Chen Lin, Tian Li, Hao Wu, Yibao Zhang\",\"doi\":\"10.2174/0115734056401610250827114351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.Methods: The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model. In addition to observational assessment, quantitative evaluation was performed based on the gradient descent of the loss function during the generation process and the Fréchet Inception Distance (FID), using various loss functions and feature vector dimensions.Results: Compared to the original data, the proposed diffusion model successfully generated medical images of similar styles but with dramatically varied anatomic details. The model trained with the Huber loss function achieved a higher FID of 15.2 at a feature vector dimension of 2048, compared with the model trained with the L2 loss function, which achieved the best FID of 0.85 at a feature vector dimension of 64.Discussion: The use of the Huber loss enhanced model robustness, while FID values indicated acceptable similarity between generated and real images. Future work should explore the application of these models to more complex datasets and clinical scenarios.Conclusion: This study demonstrated that diffusion model-based medical image synthesis is potentially applicable as an augmentation strategy for AI, particularly in situations where access to real clinical data is limited. Optimal training parameters were also proposed by evaluating the dimensionality of feature vectors in FID calculations and the complexity of loss functions.\",\"PeriodicalId\":54215,\"journal\":{\"name\":\"Current Medical Imaging Reviews\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Medical Imaging Reviews\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2174/0115734056401610250827114351\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Medical Imaging Reviews","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2174/0115734056401610250827114351","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

本研究探索了一种基于扩散模型的生成式图像合成方法，有望为医疗人工智能（AI）应用提供一种低成本、高效率的训练数据增强策略。方法：利用MedMNIST v2数据集作为低性能计算条件下的小体积训练数据集。基于现有样本的特征，利用所提出的带注释扩散模型合成新的医学图像。除了观测评价外，还利用各种损失函数和特征向量维数，基于生成过程中损失函数的梯度下降和fr起始距离（FID）进行定量评价。结果：与原始数据相比，所提出的扩散模型成功地生成了风格相似但解剖细节差异很大的医学图像。与使用L2损失函数训练的模型相比，使用Huber损失函数训练的模型在特征向量维数为2048时获得了更高的FID为15.2，而使用L2损失函数训练的模型在特征向量维数为64时获得了0.85的最佳FID。讨论：Huber损失的使用增强了模型的鲁棒性，而FID值表明生成图像和真实图像之间的相似性是可以接受的。未来的工作应该探索这些模型在更复杂的数据集和临床场景中的应用。结论：本研究表明，基于扩散模型的医学图像合成可能适用于人工智能的增强策略，特别是在获取真实临床数据有限的情况下。通过评估FID计算中特征向量的维数和损失函数的复杂度，提出了最优训练参数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Diffusion Model-based Medical Image Generation as a Potential Data Augmentation Strategy for AI Applications.

Introduction: This study explored a generative image synthesis method based on diffusion models, potentially providing a low-cost and high-efficiency training data augmentation strategy for medical artificial intelligence (AI) applications.

Methods: The MedMNIST v2 dataset was utilized as a small-volume training dataset under low-performance computing conditions. Based on the characteristics of existing samples, new medical images were synthesized using the proposed annotated diffusion model. In addition to observational assessment, quantitative evaluation was performed based on the gradient descent of the loss function during the generation process and the Fréchet Inception Distance (FID), using various loss functions and feature vector dimensions.

Results: Compared to the original data, the proposed diffusion model successfully generated medical images of similar styles but with dramatically varied anatomic details. The model trained with the Huber loss function achieved a higher FID of 15.2 at a feature vector dimension of 2048, compared with the model trained with the L2 loss function, which achieved the best FID of 0.85 at a feature vector dimension of 64.

Discussion: The use of the Huber loss enhanced model robustness, while FID values indicated acceptable similarity between generated and real images. Future work should explore the application of these models to more complex datasets and clinical scenarios.

Conclusion: This study demonstrated that diffusion model-based medical image synthesis is potentially applicable as an augmentation strategy for AI, particularly in situations where access to real clinical data is limited. Optimal training parameters were also proposed by evaluating the dimensionality of feature vectors in FID calculations and the complexity of loss functions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Current Medical Imaging Reviews 医学-核医学

CiteScore

2.60

自引率

0.00%

发文量

246

审稿时长

1 months

期刊介绍： Current Medical Imaging Reviews publishes frontier review articles, original research articles, drug clinical trial studies and guest edited thematic issues on all the latest advances on medical imaging dedicated to clinical research. All relevant areas are covered by the journal, including advances in the diagnosis, instrumentation and therapeutic applications related to all modern medical imaging techniques. The journal is essential reading for all clinicians and researchers involved in medical imaging and diagnosis.