Shaoju Wu, Sila Kurugol, Paul K Kleinman, Kirsten Ecklund, Michele Walters, Susan A Connolly, Patrick Johnston, Andy Tsai
{"title":"Deep Generative Model of the Distal Tibial Classic Metaphyseal Lesion in Infants: Assessment of Synthetic Images","authors":"Shaoju Wu, Sila Kurugol, Paul K Kleinman, Kirsten Ecklund, Michele Walters, Susan A Connolly, Patrick Johnston, Andy Tsai","doi":"10.1093/radadv/umae018","DOIUrl":null,"url":null,"abstract":"\n \n \n The classic metaphyseal lesion (CML) is a distinctive fracture highly specific to infant abuse. To increase the size and diversity of the training CML database for automated deep-learning detection of this fracture, we developed a mask conditional diffusion model (MaC-DM) to generate synthetic images with and without CMLs.\n \n \n \n To objectively and subjectively assess the synthetic radiographic images with and without CMLs generated by MaC-DM.\n \n \n \n For retrospective testing, we randomly chose 100 real images (50 normals and 50 with CMLs; 39 infants, male = 22, female = 17; mean age = 4.1 months; SD = 3.1 months) from an existing distal tibia dataset (177 normal, 73 with CMLs), and generated 100 synthetic distal tibia images via MaC-DM (50 normals and 50 with CMLs). These test images were shown to three blinded radiologists. In the 1st session, radiologists determined if the images were normal or had CMLs. In the 2nd session, they determined if the images were real or synthetic. We analyzed the radiologists’ interpretations, and employed t-distributed stochastic neighbor embedding (t-SNE) technique to analyze the data distribution of the test images.\n \n \n \n When presented with the 200 images (100 synthetic, 100 with CMLs), radiologists reliably and accurately diagnosed CMLs (kappa = 0.90, 95% CI = [0.88, 0.92]; accuracy = 92%, 95% CI = [89%, 97%]). However, they were inaccurate in differentiating between real and synthetic images (kappa = 0.05, 95% CI = [0.03, 0.07]; accuracy = 53%, 95% CI = [49%, 59%]). The t-SNE analysis showed substantial differences in the data distribution between normal images and those with CMLs (AUC = 0.996, 95% CI = [0.992, 1.000], P < 0.01), but minor differences between real and synthetic images (AUC = 0.566, 95% CI = [0.486, 0.647], P = 0.11).\n \n \n \n Radiologists accurately diagnosed images with distal tibial CMLs but were unable to distinguish real from synthetically generated ones, indicating that our generative model could synthesize realistic images. Thus, MaC-DM holds promise as an effective strategy for data augmentation in training machine-learning models for diagnosis of distal tibial CMLs.\n","PeriodicalId":519940,"journal":{"name":"Radiology advances","volume":" 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology advances","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1093/radadv/umae018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The classic metaphyseal lesion (CML) is a distinctive fracture highly specific to infant abuse. To increase the size and diversity of the training CML database for automated deep-learning detection of this fracture, we developed a mask conditional diffusion model (MaC-DM) to generate synthetic images with and without CMLs.
To objectively and subjectively assess the synthetic radiographic images with and without CMLs generated by MaC-DM.
For retrospective testing, we randomly chose 100 real images (50 normals and 50 with CMLs; 39 infants, male = 22, female = 17; mean age = 4.1 months; SD = 3.1 months) from an existing distal tibia dataset (177 normal, 73 with CMLs), and generated 100 synthetic distal tibia images via MaC-DM (50 normals and 50 with CMLs). These test images were shown to three blinded radiologists. In the 1st session, radiologists determined if the images were normal or had CMLs. In the 2nd session, they determined if the images were real or synthetic. We analyzed the radiologists’ interpretations, and employed t-distributed stochastic neighbor embedding (t-SNE) technique to analyze the data distribution of the test images.
When presented with the 200 images (100 synthetic, 100 with CMLs), radiologists reliably and accurately diagnosed CMLs (kappa = 0.90, 95% CI = [0.88, 0.92]; accuracy = 92%, 95% CI = [89%, 97%]). However, they were inaccurate in differentiating between real and synthetic images (kappa = 0.05, 95% CI = [0.03, 0.07]; accuracy = 53%, 95% CI = [49%, 59%]). The t-SNE analysis showed substantial differences in the data distribution between normal images and those with CMLs (AUC = 0.996, 95% CI = [0.992, 1.000], P < 0.01), but minor differences between real and synthetic images (AUC = 0.566, 95% CI = [0.486, 0.647], P = 0.11).
Radiologists accurately diagnosed images with distal tibial CMLs but were unable to distinguish real from synthetically generated ones, indicating that our generative model could synthesize realistic images. Thus, MaC-DM holds promise as an effective strategy for data augmentation in training machine-learning models for diagnosis of distal tibial CMLs.