急诊科急性腹痛患者阑尾炎的基于机器学习的预测

IF 6 1区 医学 Q1 EMERGENCY MEDICINE
Anoeska Schipper, Peter Belgers, Rory O’Connor, Kim Ellis Jie, Robin Dooijes, Joeran Sander Bosma, Steef Kurstjens, Ron Kusters, Bram van Ginneken, Matthieu Rutten
{"title":"急诊科急性腹痛患者阑尾炎的基于机器学习的预测","authors":"Anoeska Schipper, Peter Belgers, Rory O’Connor, Kim Ellis Jie, Robin Dooijes, Joeran Sander Bosma, Steef Kurstjens, Ron Kusters, Bram van Ginneken, Matthieu Rutten","doi":"10.1186/s13017-024-00570-7","DOIUrl":null,"url":null,"abstract":"Acute abdominal pain (AAP) constitutes 5–10% of all emergency department (ED) visits, with appendicitis being a prevalent AAP etiology often necessitating surgical intervention. The variability in AAP symptoms and causes, combined with the challenge of identifying appendicitis, complicate timely intervention. To estimate the risk of appendicitis, scoring systems such as the Alvarado score have been developed. However, diagnostic errors and delays remain common. Although various machine learning (ML) models have been proposed to enhance appendicitis detection, none have been seamlessly integrated into the ED workflows for AAP or are specifically designed to diagnose appendicitis as early as possible within the clinical decision-making process. To mimic daily clinical practice, this proof-of-concept study aims to develop ML models that support decision-making using comprehensive clinical data up to key decision points in the ED workflow to detect appendicitis in patients presenting with AAP. Data from the Dutch triage system at the ED, vital signs, complete medical history and physical examination findings and routine laboratory test results were retrospectively extracted from 350 AAP patients presenting to the ED of a Dutch teaching hospital from 2016 to 2023. Two eXtreme Gradient Boosting ML models were developed to differentiate cases with appendicitis from other AAP causes: one model used all data up to and including physical examination, and the other was extended with routine laboratory test results. The performance of both models was evaluated on a validation set (n = 68) and compared to the Alvarado scoring system as well as three ED physicians in a reader study. The ML models achieved AUROCs of 0.919 without laboratory test results and 0.923 with the addition of laboratory test results. The Alvarado scoring system attained an AUROC of 0.824. ED physicians achieved AUROCs of 0.894, 0.826, and 0.791 without laboratory test results, increasing to AUROCs of 0.923, 0.892, and 0.859 with laboratory test results. Both ML models demonstrated comparable high accuracy in predicting appendicitis in patients with AAP, outperforming the Alvarado scoring system. The ML models matched or surpassed ED physician performance in detecting appendicitis, with the largest potential performance gain observed in absence of laboratory test results. Integration could assist ED physicians in early and accurate diagnosis of appendicitis. ","PeriodicalId":48867,"journal":{"name":"World Journal of Emergency Surgery","volume":"148 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine-learning based prediction of appendicitis for patients presenting with acute abdominal pain at the emergency department\",\"authors\":\"Anoeska Schipper, Peter Belgers, Rory O’Connor, Kim Ellis Jie, Robin Dooijes, Joeran Sander Bosma, Steef Kurstjens, Ron Kusters, Bram van Ginneken, Matthieu Rutten\",\"doi\":\"10.1186/s13017-024-00570-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Acute abdominal pain (AAP) constitutes 5–10% of all emergency department (ED) visits, with appendicitis being a prevalent AAP etiology often necessitating surgical intervention. The variability in AAP symptoms and causes, combined with the challenge of identifying appendicitis, complicate timely intervention. To estimate the risk of appendicitis, scoring systems such as the Alvarado score have been developed. However, diagnostic errors and delays remain common. Although various machine learning (ML) models have been proposed to enhance appendicitis detection, none have been seamlessly integrated into the ED workflows for AAP or are specifically designed to diagnose appendicitis as early as possible within the clinical decision-making process. To mimic daily clinical practice, this proof-of-concept study aims to develop ML models that support decision-making using comprehensive clinical data up to key decision points in the ED workflow to detect appendicitis in patients presenting with AAP. Data from the Dutch triage system at the ED, vital signs, complete medical history and physical examination findings and routine laboratory test results were retrospectively extracted from 350 AAP patients presenting to the ED of a Dutch teaching hospital from 2016 to 2023. Two eXtreme Gradient Boosting ML models were developed to differentiate cases with appendicitis from other AAP causes: one model used all data up to and including physical examination, and the other was extended with routine laboratory test results. The performance of both models was evaluated on a validation set (n = 68) and compared to the Alvarado scoring system as well as three ED physicians in a reader study. The ML models achieved AUROCs of 0.919 without laboratory test results and 0.923 with the addition of laboratory test results. The Alvarado scoring system attained an AUROC of 0.824. ED physicians achieved AUROCs of 0.894, 0.826, and 0.791 without laboratory test results, increasing to AUROCs of 0.923, 0.892, and 0.859 with laboratory test results. Both ML models demonstrated comparable high accuracy in predicting appendicitis in patients with AAP, outperforming the Alvarado scoring system. The ML models matched or surpassed ED physician performance in detecting appendicitis, with the largest potential performance gain observed in absence of laboratory test results. Integration could assist ED physicians in early and accurate diagnosis of appendicitis. \",\"PeriodicalId\":48867,\"journal\":{\"name\":\"World Journal of Emergency Surgery\",\"volume\":\"148 1\",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2024-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Journal of Emergency Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13017-024-00570-7\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EMERGENCY MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Emergency Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13017-024-00570-7","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0

摘要

急性腹痛(AAP)占所有急诊科(ED)就诊的5-10%,阑尾炎是常见的AAP病因,通常需要手术干预。AAP症状和病因的可变性,加上阑尾炎的识别挑战,使及时干预复杂化。为了估计阑尾炎的风险,已经开发了评分系统,如阿尔瓦拉多评分。然而,诊断错误和延误仍然很常见。尽管已经提出了各种机器学习(ML)模型来增强阑尾炎的检测,但没有一种模型能够无缝集成到AAP的ED工作流程中,或者专门设计用于在临床决策过程中尽早诊断阑尾炎。为了模拟日常临床实践,这项概念验证研究旨在开发ML模型,利用综合临床数据支持决策,直至ED工作流程中的关键决策点,以检测AAP患者的阑尾炎。从2016年至2023年在荷兰一家教学医院急诊科就诊的350名AAP患者中,回顾性地提取了荷兰急诊科分诊系统的数据、生命体征、完整的病史、体检结果和常规实验室检查结果。开发了两种极端梯度增强ML模型来区分阑尾炎与其他AAP原因:一种模型使用了包括体检在内的所有数据,另一种模型扩展了常规实验室检查结果。在验证集(n = 68)上对两种模型的性能进行了评估,并与Alvarado评分系统以及读者研究中的三位ED医生进行了比较。ML模型未加实验室检测结果的auroc为0.919,加实验室检测结果的auroc为0.923。Alvarado评分系统的AUROC为0.824。无实验室检测结果的急诊科医师auroc分别为0.894、0.826、0.791,有实验室检测结果的急诊科医师auroc分别为0.923、0.892、0.859。两种ML模型在预测AAP患者阑尾炎方面均表现出相当高的准确性,优于Alvarado评分系统。ML模型在检测阑尾炎方面匹配或超过了ED医生的表现,在没有实验室测试结果的情况下观察到最大的潜在性能增益。整合可以帮助急诊科医师早期准确诊断阑尾炎。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine-learning based prediction of appendicitis for patients presenting with acute abdominal pain at the emergency department
Acute abdominal pain (AAP) constitutes 5–10% of all emergency department (ED) visits, with appendicitis being a prevalent AAP etiology often necessitating surgical intervention. The variability in AAP symptoms and causes, combined with the challenge of identifying appendicitis, complicate timely intervention. To estimate the risk of appendicitis, scoring systems such as the Alvarado score have been developed. However, diagnostic errors and delays remain common. Although various machine learning (ML) models have been proposed to enhance appendicitis detection, none have been seamlessly integrated into the ED workflows for AAP or are specifically designed to diagnose appendicitis as early as possible within the clinical decision-making process. To mimic daily clinical practice, this proof-of-concept study aims to develop ML models that support decision-making using comprehensive clinical data up to key decision points in the ED workflow to detect appendicitis in patients presenting with AAP. Data from the Dutch triage system at the ED, vital signs, complete medical history and physical examination findings and routine laboratory test results were retrospectively extracted from 350 AAP patients presenting to the ED of a Dutch teaching hospital from 2016 to 2023. Two eXtreme Gradient Boosting ML models were developed to differentiate cases with appendicitis from other AAP causes: one model used all data up to and including physical examination, and the other was extended with routine laboratory test results. The performance of both models was evaluated on a validation set (n = 68) and compared to the Alvarado scoring system as well as three ED physicians in a reader study. The ML models achieved AUROCs of 0.919 without laboratory test results and 0.923 with the addition of laboratory test results. The Alvarado scoring system attained an AUROC of 0.824. ED physicians achieved AUROCs of 0.894, 0.826, and 0.791 without laboratory test results, increasing to AUROCs of 0.923, 0.892, and 0.859 with laboratory test results. Both ML models demonstrated comparable high accuracy in predicting appendicitis in patients with AAP, outperforming the Alvarado scoring system. The ML models matched or surpassed ED physician performance in detecting appendicitis, with the largest potential performance gain observed in absence of laboratory test results. Integration could assist ED physicians in early and accurate diagnosis of appendicitis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
World Journal of Emergency Surgery
World Journal of Emergency Surgery EMERGENCY MEDICINE-SURGERY
CiteScore
14.50
自引率
5.00%
发文量
60
审稿时长
10 weeks
期刊介绍: The World Journal of Emergency Surgery is an open access, peer-reviewed journal covering all facets of clinical and basic research in traumatic and non-traumatic emergency surgery and related fields. Topics include emergency surgery, acute care surgery, trauma surgery, intensive care, trauma management, and resuscitation, among others.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信