Deep Generative Model of the Distal Tibial Classic Metaphyseal Lesion in Infants: Assessment of Synthetic Images

Radiology advances Pub Date : 2024-07-04 DOI:10.1093/radadv/umae018

Shaoju Wu, Sila Kurugol, Paul K Kleinman, Kirsten Ecklund, Michele Walters, Susan A Connolly, Patrick Johnston, Andy Tsai

{"title":"Deep Generative Model of the Distal Tibial Classic Metaphyseal Lesion in Infants: Assessment of Synthetic Images","authors":"Shaoju Wu, Sila Kurugol, Paul K Kleinman, Kirsten Ecklund, Michele Walters, Susan A Connolly, Patrick Johnston, Andy Tsai","doi":"10.1093/radadv/umae018","DOIUrl":null,"url":null,"abstract":"\n \n \n The classic metaphyseal lesion (CML) is a distinctive fracture highly specific to infant abuse. To increase the size and diversity of the training CML database for automated deep-learning detection of this fracture, we developed a mask conditional diffusion model (MaC-DM) to generate synthetic images with and without CMLs.\n \n \n \n To objectively and subjectively assess the synthetic radiographic images with and without CMLs generated by MaC-DM.\n \n \n \n For retrospective testing, we randomly chose 100 real images (50 normals and 50 with CMLs; 39 infants, male = 22, female = 17; mean age = 4.1 months; SD = 3.1 months) from an existing distal tibia dataset (177 normal, 73 with CMLs), and generated 100 synthetic distal tibia images via MaC-DM (50 normals and 50 with CMLs). These test images were shown to three blinded radiologists. In the 1st session, radiologists determined if the images were normal or had CMLs. In the 2nd session, they determined if the images were real or synthetic. We analyzed the radiologists’ interpretations, and employed t-distributed stochastic neighbor embedding (t-SNE) technique to analyze the data distribution of the test images.\n \n \n \n When presented with the 200 images (100 synthetic, 100 with CMLs), radiologists reliably and accurately diagnosed CMLs (kappa = 0.90, 95% CI = [0.88, 0.92]; accuracy = 92%, 95% CI = [89%, 97%]). However, they were inaccurate in differentiating between real and synthetic images (kappa = 0.05, 95% CI = [0.03, 0.07]; accuracy = 53%, 95% CI = [49%, 59%]). The t-SNE analysis showed substantial differences in the data distribution between normal images and those with CMLs (AUC = 0.996, 95% CI = [0.992, 1.000], P < 0.01), but minor differences between real and synthetic images (AUC = 0.566, 95% CI = [0.486, 0.647], P = 0.11).\n \n \n \n Radiologists accurately diagnosed images with distal tibial CMLs but were unable to distinguish real from synthetically generated ones, indicating that our generative model could synthesize realistic images. Thus, MaC-DM holds promise as an effective strategy for data augmentation in training machine-learning models for diagnosis of distal tibial CMLs.\n","PeriodicalId":519940,"journal":{"name":"Radiology advances","volume":" 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology advances","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1093/radadv/umae018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The classic metaphyseal lesion (CML) is a distinctive fracture highly specific to infant abuse. To increase the size and diversity of the training CML database for automated deep-learning detection of this fracture, we developed a mask conditional diffusion model (MaC-DM) to generate synthetic images with and without CMLs. To objectively and subjectively assess the synthetic radiographic images with and without CMLs generated by MaC-DM. For retrospective testing, we randomly chose 100 real images (50 normals and 50 with CMLs; 39 infants, male = 22, female = 17; mean age = 4.1 months; SD = 3.1 months) from an existing distal tibia dataset (177 normal, 73 with CMLs), and generated 100 synthetic distal tibia images via MaC-DM (50 normals and 50 with CMLs). These test images were shown to three blinded radiologists. In the 1st session, radiologists determined if the images were normal or had CMLs. In the 2nd session, they determined if the images were real or synthetic. We analyzed the radiologists’ interpretations, and employed t-distributed stochastic neighbor embedding (t-SNE) technique to analyze the data distribution of the test images. When presented with the 200 images (100 synthetic, 100 with CMLs), radiologists reliably and accurately diagnosed CMLs (kappa = 0.90, 95% CI = [0.88, 0.92]; accuracy = 92%, 95% CI = [89%, 97%]). However, they were inaccurate in differentiating between real and synthetic images (kappa = 0.05, 95% CI = [0.03, 0.07]; accuracy = 53%, 95% CI = [49%, 59%]). The t-SNE analysis showed substantial differences in the data distribution between normal images and those with CMLs (AUC = 0.996, 95% CI = [0.992, 1.000], P < 0.01), but minor differences between real and synthetic images (AUC = 0.566, 95% CI = [0.486, 0.647], P = 0.11). Radiologists accurately diagnosed images with distal tibial CMLs but were unable to distinguish real from synthetically generated ones, indicating that our generative model could synthesize realistic images. Thus, MaC-DM holds promise as an effective strategy for data augmentation in training machine-learning models for diagnosis of distal tibial CMLs.

查看原文本刊更多论文

婴儿胫骨远端经典骺损伤的深度生成模型：合成图像评估

典型的骺端病变（CML）是婴儿受虐时的一种特殊骨折。为了增加训练 CML 数据库的规模和多样性，以便对这种骨折进行自动深度学习检测，我们开发了一种掩模条件扩散模型（MaC-DM），用于生成有 CML 和无 CML 的合成图像。客观和主观地评估由 MaC-DM 生成的有 CML 和无 CML 的合成放射影像。为了进行回顾性测试，我们从现有的胫骨远端数据集（正常 177 例，CMLs 73 例）中随机选择了 100 幅真实图像（正常 50 例，CMLs 50 例；39 名婴儿，男 22 名，女 17 名；平均年龄 4.1 个月；SD = 3.1 个月），并通过 MaC-DM 生成了 100 幅合成胫骨远端图像（正常 50 例，CMLs 50 例）。这些测试图像展示给三位盲放射科医生。在第一个环节中，放射科医生判断图像是正常的还是有 CMLs。在第二个环节中，他们判断图像是真实的还是合成的。我们分析了放射科医生的解释，并采用 t 分布随机邻域嵌入（t-SNE）技术分析了测试图像的数据分布。当放射科医生看到 200 张图像（100 张合成图像，100 张有 CMLs 的图像）时，他们能可靠、准确地诊断出 CMLs（kappa = 0.90，95% CI = [0.88，0.92]；准确率 = 92%，95% CI = [89%，97%]）。然而，它们在区分真实图像和合成图像方面并不准确（kappa = 0.05，95% CI = [0.03，0.07]；准确率 = 53%，95% CI = [49%，59%]）。t-SNE 分析表明，正常图像和有 CMLs 的图像之间的数据分布差异很大（AUC = 0.996，95% CI = [0.992，1.000]，P < 0.01），但真实图像和合成图像之间的差异很小（AUC = 0.566，95% CI = [0.486，0.647]，P = 0.11）。放射科医生能准确诊断出胫骨远端 CML 图像，但无法区分真实图像和合成图像，这表明我们的生成模型能合成逼真的图像。因此，MaC-DM有望成为一种有效的数据增强策略，用于训练诊断胫骨远端CML的机器学习模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Radiology advances

自引率

0.00%

发文量