Zero-shot large language model application for surgical site infection auditing.

Shrirajh Satheakeerthy, Brandon Stretton, James Tsimiklis, Andrew Ec Booth, Sarah Howson, Shaun Evans, Christina Guo, Joshua Kovoor, Aashray Gupta, Christina Gao, Weng Onn Chan, Tim French, Amelia Demopoulos, Alyssa Pradhan, Samuel Gluck, Toby Gilbert, Matthew Blake Roberts, Camille Kotton, Stephen Bacchi
{"title":"Zero-shot large language model application for surgical site infection auditing.","authors":"Shrirajh Satheakeerthy, Brandon Stretton, James Tsimiklis, Andrew Ec Booth, Sarah Howson, Shaun Evans, Christina Guo, Joshua Kovoor, Aashray Gupta, Christina Gao, Weng Onn Chan, Tim French, Amelia Demopoulos, Alyssa Pradhan, Samuel Gluck, Toby Gilbert, Matthew Blake Roberts, Camille Kotton, Stephen Bacchi","doi":"10.1016/j.idh.2025.05.001","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Artificial intelligence, in particular large language models (LLM), may be able to assist with monitoring for surgical site infections (SSI).</p><p><strong>Method: </strong>This retrospective study involved the application of the Llama 3.0 70-billion parameter model to the identification of SSI in a group of all SSI in two metropolitan hospitals from a 4-month period. Randomly selected control patients were chosen as comparators. Clinical inpatient and outpatient progress notes were provided to the LLM individually and classified as indicating an SSI or not. These classifications were then analysed to determine binary performance characteristics and to determine the timing of positive case classification.</p><p><strong>Results: </strong>There was a total of 28 cases in the study, 14 in the case (SSI) group and 14 in the control group. The operations involved in the SSI cases were caesarean section (12/14, 85.7 %) and arthroplasty (2/14, 14.2 %). The LLM had an overall accuracy at the patient-level of 26/28 (93 %). There was a sensitivity of 100 % and specificity of 86%. At the note-level, for the first note flagged by the LLM for each case, 13/14 (92.3 %) were on the same day as, or before, the date noted as the onset of infection as identified by infection control clinicians.</p><p><strong>Conclusions: </strong>The use of LLM for the screening of medical notes for SSI is feasible. Further studies may seek to evaluate the outcomes of LLM when deployed as part of a clinical workflow.</p>","PeriodicalId":94040,"journal":{"name":"Infection, disease & health","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infection, disease & health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.idh.2025.05.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Artificial intelligence, in particular large language models (LLM), may be able to assist with monitoring for surgical site infections (SSI).

Method: This retrospective study involved the application of the Llama 3.0 70-billion parameter model to the identification of SSI in a group of all SSI in two metropolitan hospitals from a 4-month period. Randomly selected control patients were chosen as comparators. Clinical inpatient and outpatient progress notes were provided to the LLM individually and classified as indicating an SSI or not. These classifications were then analysed to determine binary performance characteristics and to determine the timing of positive case classification.

Results: There was a total of 28 cases in the study, 14 in the case (SSI) group and 14 in the control group. The operations involved in the SSI cases were caesarean section (12/14, 85.7 %) and arthroplasty (2/14, 14.2 %). The LLM had an overall accuracy at the patient-level of 26/28 (93 %). There was a sensitivity of 100 % and specificity of 86%. At the note-level, for the first note flagged by the LLM for each case, 13/14 (92.3 %) were on the same day as, or before, the date noted as the onset of infection as identified by infection control clinicians.

Conclusions: The use of LLM for the screening of medical notes for SSI is feasible. Further studies may seek to evaluate the outcomes of LLM when deployed as part of a clinical workflow.

零射击大语言模型在手术部位感染审计中的应用。
人工智能,特别是大型语言模型(LLM),可能能够协助监测手术部位感染(SSI)。方法:本回顾性研究采用Llama 3.0 700亿参数模型对两家城市医院4个月的SSI患者进行SSI识别。随机选取对照患者作为对照。临床住院和门诊进展记录分别提供给LLM,并分类为是否指示SSI。然后对这些分类进行分析,以确定二进制性能特征并确定阳性病例分类的时间。结果:本组共28例,病例(SSI)组14例,对照组14例。SSI病例的手术包括剖宫产(12/14,85.7%)和关节置换术(2/14,14.2%)。LLM在患者水平上的总体准确率为26/28(93%)。敏感性为100%,特异性为86%。在记录水平上,对于每个病例LLM标记的第一个记录,13/14(92.3%)与感染控制临床医生确定的感染发病日期相同或之前。结论:利用LLM对SSI病历进行筛选是可行的。进一步的研究可能会寻求评估LLM作为临床工作流程的一部分时的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信