Xuetong Tao, Ziba Gandomkar, Tong Li, Patrick C Brennan, Warren M Reed
{"title":"Radiomic Analysis of Cohort-Specific Diagnostic Errors in Reading Dense Mammograms Using Artificial Intelligence.","authors":"Xuetong Tao, Ziba Gandomkar, Tong Li, Patrick C Brennan, Warren M Reed","doi":"10.1093/bjr/tqae195","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study aims to investigate radiologists' interpretation errors when reading dense screening mammograms using a radiomics-based artificial intelligence approach.</p><p><strong>Methods: </strong>Thirty-six radiologists from China and Australia read 60 dense mammograms. For each cohort, we identified normal areas that looked suspicious of cancer and the malignant areas containing cancers. Then radiomic features were extracted from these identified areas and random forest models were trained to recognize the areas that were most frequently linked to diagnostic errors within each cohort. The performance of the model and discriminatory power of significant radiomic features were assessed.</p><p><strong>Results: </strong>We found that in the Chinese cohort, the AUC values for predicting false positives were 0.864 (CC) and 0.829 (MLO), while in the Australian cohort, they were 0.652 (CC) and 0.747 (MLO). For false negatives, the AUC values in the Chinese cohort were 0.677 (CC) and 0.673 (MLO), and in the Australian cohort, they were 0.600 (CC) and 0.505 (MLO). In both cohorts, regions with higher Gabor and maximum response filter outputs were more prone to false positives, while areas with significant intensity changes and coarse textures were more likely to yield false negatives.</p><p><strong>Conclusions: </strong>This cohort-based pipeline proves effective in identifying common errors for specific reader cohorts based on image-derived radiomic features.</p><p><strong>Advances in knowledge: </strong>This study demonstrates that radiomics-based AI can effectively identify and predict radiologists' interpretation errors in dense mammograms, with distinct radiomic features linked to false positives and false negatives in Chinese and Australian cohorts.</p>","PeriodicalId":9306,"journal":{"name":"British Journal of Radiology","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/bjr/tqae195","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: This study aims to investigate radiologists' interpretation errors when reading dense screening mammograms using a radiomics-based artificial intelligence approach.
Methods: Thirty-six radiologists from China and Australia read 60 dense mammograms. For each cohort, we identified normal areas that looked suspicious of cancer and the malignant areas containing cancers. Then radiomic features were extracted from these identified areas and random forest models were trained to recognize the areas that were most frequently linked to diagnostic errors within each cohort. The performance of the model and discriminatory power of significant radiomic features were assessed.
Results: We found that in the Chinese cohort, the AUC values for predicting false positives were 0.864 (CC) and 0.829 (MLO), while in the Australian cohort, they were 0.652 (CC) and 0.747 (MLO). For false negatives, the AUC values in the Chinese cohort were 0.677 (CC) and 0.673 (MLO), and in the Australian cohort, they were 0.600 (CC) and 0.505 (MLO). In both cohorts, regions with higher Gabor and maximum response filter outputs were more prone to false positives, while areas with significant intensity changes and coarse textures were more likely to yield false negatives.
Conclusions: This cohort-based pipeline proves effective in identifying common errors for specific reader cohorts based on image-derived radiomic features.
Advances in knowledge: This study demonstrates that radiomics-based AI can effectively identify and predict radiologists' interpretation errors in dense mammograms, with distinct radiomic features linked to false positives and false negatives in Chinese and Australian cohorts.
期刊介绍:
BJR is the international research journal of the British Institute of Radiology and is the oldest scientific journal in the field of radiology and related sciences.
Dating back to 1896, BJR’s history is radiology’s history, and the journal has featured some landmark papers such as the first description of Computed Tomography "Computerized transverse axial tomography" by Godfrey Hounsfield in 1973. A valuable historical resource, the complete BJR archive has been digitized from 1896.
Quick Facts:
- 2015 Impact Factor – 1.840
- Receipt to first decision – average of 6 weeks
- Acceptance to online publication – average of 3 weeks
- ISSN: 0007-1285
- eISSN: 1748-880X
Open Access option