Praveen M Yogendra, Adriel Guang Wei Goh, Sze Ying Yee, Freda Jawan, Kelvin Kay Nguan Koh, Timothy Shao Ern Tan, Tian Kai Woon, Phey Ming Yeap, Min On Tan
{"title":"使用和不使用人工智能的放射科医师和住院医师检测小儿阑尾骨折的准确性。","authors":"Praveen M Yogendra, Adriel Guang Wei Goh, Sze Ying Yee, Freda Jawan, Kelvin Kay Nguan Koh, Timothy Shao Ern Tan, Tian Kai Woon, Phey Ming Yeap, Min On Tan","doi":"10.1136/bmjhci-2024-101091","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>We aim to evaluate the accuracy of radiologists and radiology residents in the detection of paediatric appendicular fractures with and without the help of a commercially available fracture detection artificial intelligence (AI) solution in the hopes of showing potential clinical benefits in a general hospital setting.</p><p><strong>Methods: </strong>This was a retrospective study involving three associate consultants (AC) and three senior residents (SR) in radiology, who acted as readers. One reader from each human group interpreted the radiographs with the aid of AI. Cases were categorised into concordant and discordant cases between each interpreting group. Discordant cases were further evaluated by three independent subspecialty radiology consultants to determine the final diagnosis. A total of 500 anonymised paediatric patient cases (aged 2-15 years) who presented to a tertiary general hospital with a Children's emergency were retrospectively collected. Main outcome measures include the presence of fracture, accuracy of readers with and without AI, and total time taken to interpret the radiographs.</p><p><strong>Results: </strong>The AI solution alone showed the highest accuracy (area under the receiver operating characteristic curve 0.97; AC: 95% CI -0.055 to 0.320, p=0; SR: 95% CI 0.244 to 0.598, p=0). The two readers aided with AI had higher area under curves compared with readers without AI support (AC: 95% CI -0.303 to 0.465, p=0; SR: 95% CI -0.154 to 0.331, p=0). These differences were statistically significant.</p><p><strong>Conclusion: </strong>Our study demonstrates excellent results in the detection of paediatric appendicular fractures using a commercially available AI solution. There is potential for the AI solution to function autonomously.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"31 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11624698/pdf/","citationCount":"0","resultStr":"{\"title\":\"Accuracy of radiologists and radiology residents in detection of paediatric appendicular fractures with and without artificial intelligence.\",\"authors\":\"Praveen M Yogendra, Adriel Guang Wei Goh, Sze Ying Yee, Freda Jawan, Kelvin Kay Nguan Koh, Timothy Shao Ern Tan, Tian Kai Woon, Phey Ming Yeap, Min On Tan\",\"doi\":\"10.1136/bmjhci-2024-101091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>We aim to evaluate the accuracy of radiologists and radiology residents in the detection of paediatric appendicular fractures with and without the help of a commercially available fracture detection artificial intelligence (AI) solution in the hopes of showing potential clinical benefits in a general hospital setting.</p><p><strong>Methods: </strong>This was a retrospective study involving three associate consultants (AC) and three senior residents (SR) in radiology, who acted as readers. One reader from each human group interpreted the radiographs with the aid of AI. Cases were categorised into concordant and discordant cases between each interpreting group. Discordant cases were further evaluated by three independent subspecialty radiology consultants to determine the final diagnosis. A total of 500 anonymised paediatric patient cases (aged 2-15 years) who presented to a tertiary general hospital with a Children's emergency were retrospectively collected. Main outcome measures include the presence of fracture, accuracy of readers with and without AI, and total time taken to interpret the radiographs.</p><p><strong>Results: </strong>The AI solution alone showed the highest accuracy (area under the receiver operating characteristic curve 0.97; AC: 95% CI -0.055 to 0.320, p=0; SR: 95% CI 0.244 to 0.598, p=0). The two readers aided with AI had higher area under curves compared with readers without AI support (AC: 95% CI -0.303 to 0.465, p=0; SR: 95% CI -0.154 to 0.331, p=0). These differences were statistically significant.</p><p><strong>Conclusion: </strong>Our study demonstrates excellent results in the detection of paediatric appendicular fractures using a commercially available AI solution. There is potential for the AI solution to function autonomously.</p>\",\"PeriodicalId\":9050,\"journal\":{\"name\":\"BMJ Health & Care Informatics\",\"volume\":\"31 1\",\"pages\":\"\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2024-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11624698/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMJ Health & Care Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1136/bmjhci-2024-101091\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Health & Care Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjhci-2024-101091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
摘要
目的:我们旨在评估放射科医生和放射科住院医师在使用和不使用市售骨折检测人工智能(AI)解决方案的情况下检测儿科阑尾骨折的准确性,以期在综合医院环境中显示潜在的临床效益。方法:本研究是一项回顾性研究,涉及三名放射学副顾问(AC)和三名高级住院医师(SR),他们作为读者。每组一名读者在人工智能的帮助下解读x光片。在每个口译组之间将案例分为和谐案例和不和谐案例。不一致的病例由三位独立的亚专科放射学顾问进一步评估,以确定最终诊断。回顾性收集了一家三级综合医院因儿童急诊就诊的500例匿名儿童患者(2-15岁)。主要的结果测量包括骨折的存在,使用和不使用人工智能阅读器的准确性,以及解释x线片所花费的总时间。结果:单独使用人工智能溶液准确度最高(受试者工作特征曲线下面积0.97;AC: 95% CI -0.055 ~ 0.320, p=0;SR: 95% CI 0.244 ~ 0.598, p=0)。与没有人工智能支持的读者相比,有人工智能辅助的两名读者的曲线下面积更高(AC: 95% CI -0.303至0.465,p=0;SR: 95% CI -0.154 ~ 0.331, p=0)。这些差异具有统计学意义。结论:我们的研究表明,使用市售的人工智能解决方案在检测儿科阑尾骨折方面取得了良好的效果。人工智能解决方案有可能实现自主功能。
Accuracy of radiologists and radiology residents in detection of paediatric appendicular fractures with and without artificial intelligence.
Objectives: We aim to evaluate the accuracy of radiologists and radiology residents in the detection of paediatric appendicular fractures with and without the help of a commercially available fracture detection artificial intelligence (AI) solution in the hopes of showing potential clinical benefits in a general hospital setting.
Methods: This was a retrospective study involving three associate consultants (AC) and three senior residents (SR) in radiology, who acted as readers. One reader from each human group interpreted the radiographs with the aid of AI. Cases were categorised into concordant and discordant cases between each interpreting group. Discordant cases were further evaluated by three independent subspecialty radiology consultants to determine the final diagnosis. A total of 500 anonymised paediatric patient cases (aged 2-15 years) who presented to a tertiary general hospital with a Children's emergency were retrospectively collected. Main outcome measures include the presence of fracture, accuracy of readers with and without AI, and total time taken to interpret the radiographs.
Results: The AI solution alone showed the highest accuracy (area under the receiver operating characteristic curve 0.97; AC: 95% CI -0.055 to 0.320, p=0; SR: 95% CI 0.244 to 0.598, p=0). The two readers aided with AI had higher area under curves compared with readers without AI support (AC: 95% CI -0.303 to 0.465, p=0; SR: 95% CI -0.154 to 0.331, p=0). These differences were statistically significant.
Conclusion: Our study demonstrates excellent results in the detection of paediatric appendicular fractures using a commercially available AI solution. There is potential for the AI solution to function autonomously.