Artificial intelligence and pelvic fracture diagnosis on X-rays: a preliminary study on performance, workflow integration and radiologists' feedback assessment in a spoke emergency hospital

IF 1.8 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

European Journal of Radiology Open Pub Date : 2023-07-06 DOI:10.1016/j.ejro.2023.100504

Francesca Rosa , Duccio Buccicardi , Adolfo Romano , Fabio Borda , Maria Chiara D’Auria , Alessandro Gastaldo

{"title":"Artificial intelligence and pelvic fracture diagnosis on X-rays: a preliminary study on performance, workflow integration and radiologists' feedback assessment in a spoke emergency hospital","authors":"Francesca Rosa , Duccio Buccicardi , Adolfo Romano , Fabio Borda , Maria Chiara D’Auria , Alessandro Gastaldo","doi":"10.1016/j.ejro.2023.100504","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The aim of our study is to evaluate artificial intelligence (AI) support in pelvic fracture diagnosis on X-rays, focusing on performance, workflow integration and radiologists’ feedback in a spoke emergency hospital.</p></div><div><h3>Materials and methods</h3><p>Between August and November 2021, a total of 235 sites of fracture or suspected fracture were evaluated and enrolled in the prospective study. Radiologist’s specificity, sensibility accuracy, positive and negative predictive values were compared to AI. Cohen's kappa was used to calculate the agreement between AI and radiologist. We also reviewed the AI workflow integration process, focusing on potential issues and assessed radiologists’ opinion on AI via a survey.</p></div><div><h3>Results</h3><p>The radiologist performance in accuracy, sensitivity and specificity was better than AI but McNemar test demonstrated no statistically significant difference between AI and radiologist’s performance (<em>p</em> = 0.32). Calculated Cohen’s K of 0.64.</p></div><div><h3>Conclusion</h3><p>Contrary to expectations, our preliminary results did not prove a real improvement of patient outcome nor in reporting time but demonstrated AI high NPV (94,62%) and non-inferiority to radiologist performance. Moreover, the commercially available AI algorithm used in our study automatically learn from data and so we expect a progressive performance improvement. AI could be considered as a promising tool to rule-out fractures (especially when used as a “second reader”) and to prioritize positive cases, especially in increasing workload scenarios (ED, nightshifts) but further research is needed to evaluate the real impact on the clinical practice.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359726/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352047723000308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 1

Abstract

Purpose

The aim of our study is to evaluate artificial intelligence (AI) support in pelvic fracture diagnosis on X-rays, focusing on performance, workflow integration and radiologists’ feedback in a spoke emergency hospital.

Materials and methods

Between August and November 2021, a total of 235 sites of fracture or suspected fracture were evaluated and enrolled in the prospective study. Radiologist’s specificity, sensibility accuracy, positive and negative predictive values were compared to AI. Cohen's kappa was used to calculate the agreement between AI and radiologist. We also reviewed the AI workflow integration process, focusing on potential issues and assessed radiologists’ opinion on AI via a survey.

Results

The radiologist performance in accuracy, sensitivity and specificity was better than AI but McNemar test demonstrated no statistically significant difference between AI and radiologist’s performance (p = 0.32). Calculated Cohen’s K of 0.64.

Conclusion

Contrary to expectations, our preliminary results did not prove a real improvement of patient outcome nor in reporting time but demonstrated AI high NPV (94,62%) and non-inferiority to radiologist performance. Moreover, the commercially available AI algorithm used in our study automatically learn from data and so we expect a progressive performance improvement. AI could be considered as a promising tool to rule-out fractures (especially when used as a “second reader”) and to prioritize positive cases, especially in increasing workload scenarios (ED, nightshifts) but further research is needed to evaluate the real impact on the clinical practice.

Abstract Image

查看原文本刊更多论文

人工智能与骨盆骨折X光诊断——辐射急救医院绩效、工作流程集成和放射科医生反馈评估的初步研究

目的本研究的目的是评估人工智能（AI）在X光骨盆骨折诊断中的支持，重点关注辐射急救医院的绩效、工作流程集成和放射科医生的反馈。材料和方法在2021年8月至11月期间，共评估了235个骨折或疑似骨折部位，并将其纳入前瞻性研究。将放射科医生的特异性、敏感性、准确性、阳性和阴性预测值与AI进行比较。Cohen’s kappa用于计算AI与放射科医生之间的一致性。我们还回顾了人工智能工作流程集成过程，重点关注潜在问题，并通过调查评估了放射科医生对人工智能的看法。结果放射科医生在准确性、敏感性和特异性方面的表现优于AI，但McNemar检验显示AI和放射科医生的表现之间没有统计学上的显著差异（p=0.32）。计算Cohen’s K为0.64。结论与预期相反，我们的初步结果并没有证明患者的预后和报告时间有真正的改善，但证明了AI高NPV（94,62%），并且与放射科医生的表现相比没有劣势。此外，我们研究中使用的商用人工智能算法会自动从数据中学习，因此我们预计性能会逐步提高。人工智能可以被认为是一种很有前途的工具，可以排除骨折（尤其是当用作“第二读者”时），并优先考虑阳性病例，特别是在工作量增加的情况下（ED、夜班），但还需要进一步的研究来评估对临床实践的真正影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal of Radiology Open Medicine-Radiology, Nuclear Medicine and Imaging

CiteScore

4.10

自引率

5.00%

发文量

审稿时长

51 days