{"title":"Impact of artificial intelligence assistance on bone scintigraphy diagnosis.","authors":"Yosita Uchuwat, Natthanan Ruengchaijatuporn, Chanan Sukprakun, Sira Vachatimanont, Maythinee Chantadisai, Kanaungnit Kingpetch, Tawatchai Chaiwatanarat, Supatporn Tepmongkol, Chanittha Buakhao, Kitwiwat Phuangmali, Sira Sriswasdi, Yothin Rakvongthai","doi":"10.1007/s13246-025-01621-2","DOIUrl":null,"url":null,"abstract":"<p><p>Bone scintigraphy is an important tool for detecting bone lesions. This study aimed to improve and evaluate the performance of our previously-developed deep learning-based model called MaligNet in helping nuclear medicine (NM) physicians interpret bone scan. Bone scintigraphy of 553 patients with imaging data from six-month follow-up records were split into training, validation, and test sets in a ratio of 353:100:100 to re-train MaligNet. Seven nuclear medicine physicians, including two junior and five senior physicians, were asked to segment and classify lesions in the test set images without and with AI assistance, which was the prediction of MaligNet. The improved performance of MaligNet was evaluated using the precision-recall (PR) and receiver operating characteristic (ROC) curves for lesion-based and patient-based classifications, respectively. The impact of AI assistance on physician reading was evaluated using reading time per case and malignancy diagnostic performance metrics. The re-trained MaligNet yielded considerably higher area under the PR curve (0.334 vs. 0.225) and higher area under the ROC curve (0.881 vs. 0.789) than the original model. For patient-based classification, AI assistance improved the average accuracy, sensitivity, specificity, and precision of the physician by 2.14%, 0.89%, 2.38%, and 1.97%, respectively, while reducing the average reading time by 31.14%. For lesion-based classification, it improved physicians' average precision by 2.95%, but did not improve sensitivity. With AI assistance, junior physicians achieved diagnostic performances comparable to those of senior physicians. AI assistance with MaligNet improved bone scintigraphy diagnostic performance and showed promise in clinical practice.</p>","PeriodicalId":48490,"journal":{"name":"Physical and Engineering Sciences in Medicine","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical and Engineering Sciences in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13246-025-01621-2","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Bone scintigraphy is an important tool for detecting bone lesions. This study aimed to improve and evaluate the performance of our previously-developed deep learning-based model called MaligNet in helping nuclear medicine (NM) physicians interpret bone scan. Bone scintigraphy of 553 patients with imaging data from six-month follow-up records were split into training, validation, and test sets in a ratio of 353:100:100 to re-train MaligNet. Seven nuclear medicine physicians, including two junior and five senior physicians, were asked to segment and classify lesions in the test set images without and with AI assistance, which was the prediction of MaligNet. The improved performance of MaligNet was evaluated using the precision-recall (PR) and receiver operating characteristic (ROC) curves for lesion-based and patient-based classifications, respectively. The impact of AI assistance on physician reading was evaluated using reading time per case and malignancy diagnostic performance metrics. The re-trained MaligNet yielded considerably higher area under the PR curve (0.334 vs. 0.225) and higher area under the ROC curve (0.881 vs. 0.789) than the original model. For patient-based classification, AI assistance improved the average accuracy, sensitivity, specificity, and precision of the physician by 2.14%, 0.89%, 2.38%, and 1.97%, respectively, while reducing the average reading time by 31.14%. For lesion-based classification, it improved physicians' average precision by 2.95%, but did not improve sensitivity. With AI assistance, junior physicians achieved diagnostic performances comparable to those of senior physicians. AI assistance with MaligNet improved bone scintigraphy diagnostic performance and showed promise in clinical practice.
骨显像是检测骨病变的重要工具。本研究旨在改进和评估我们之前开发的基于深度学习的模型MaligNet在帮助核医学(NM)医生解释骨扫描方面的表现。对553例患者的骨显像数据进行为期6个月的随访记录,按353:100:100的比例分成训练组、验证组和测试组,重新训练MaligNet。7名核医学医生,包括2名初级医生和5名高级医生,被要求在没有人工智能帮助和有人工智能帮助的情况下对测试集图像中的病变进行分割和分类,这是MaligNet的预测。使用基于病变和基于患者的分类的精确召回率(PR)和受试者工作特征(ROC)曲线分别评估MaligNet的改进性能。使用每个病例的阅读时间和恶性肿瘤诊断性能指标来评估人工智能辅助对医生阅读的影响。与原始模型相比,重新训练的MaligNet产生了更高的PR曲线下面积(0.334 vs. 0.225)和更高的ROC曲线下面积(0.881 vs. 0.789)。对于基于患者的分类,AI辅助将医生的平均准确率、灵敏度、特异性和精度分别提高了2.14%、0.89%、2.38%和1.97%,平均阅读时间减少了31.14%。对于基于病变的分类,它使医生的平均准确率提高了2.95%,但没有提高灵敏度。在人工智能的帮助下,初级医生的诊断表现与高级医生相当。人工智能辅助MaligNet提高了骨显像诊断性能,并在临床实践中显示出前景。