Large language model-based identification of venous thromboembolism diagnostic delays

IF 2.3 4区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Journal of hospital medicine Pub Date : 2026-04-09 Epub Date: 2025-10-07 DOI:10.1002/jhm.70194
Verity Schaye MD, MHPE, Daniel J. Sartori MD, Lexi Signoriello PhD, Kiran Malhotra MD, Benedict Guzman MS, Bijal Rajput MD, Ilan Reinstein MS, Jesse Burk-Rafel MD, MRes
{"title":"Large language model-based identification of venous thromboembolism diagnostic delays","authors":"Verity Schaye MD, MHPE,&nbsp;Daniel J. Sartori MD,&nbsp;Lexi Signoriello PhD,&nbsp;Kiran Malhotra MD,&nbsp;Benedict Guzman MS,&nbsp;Bijal Rajput MD,&nbsp;Ilan Reinstein MS,&nbsp;Jesse Burk-Rafel MD, MRes","doi":"10.1002/jhm.70194","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Delayed diagnosis of venous thromboembolism (VTE) is prevalent among hospitalized patients, yet case identification is challenging and feedback limited.</p>\n </section>\n \n <section>\n \n <h3> Objective</h3>\n \n <p>To develop a large language model (LLM)-based electronic-trigger to identify VTE diagnostic delays.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>All admissions to internal medicine (IM) residents at NYU Langone Health between January 2022 and December 2023 (<i>n</i> = 20,843) were included. Using an open-source LLM, prompts were validated to detect (1) residents considering VTE in admission notes and (2) VTE confirmation in five types of imaging reports (<i>n</i> = 100 for each prompt validation set). The validated prompts were applied to determine discordance between admission note differential omitting VTE and imaging report confirming VTE. Two hospitalists reviewed discordant cases using a validated tool to identify diagnostic delays. Hospitalizations were labeled as diagnostic delays, in-hospital complication, or false-positive. Based on in-hospital complication and false-positive patterns, exclusion criteria were implemented. Positive predictive value (PPV) and negative predictive value (NPV) were calculated.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The LLM prompts correctly classified admission notes and VTE imaging studies with high accuracy (range 98%–100%, <i>n</i> = 699 VTE cases identified). Of the 137 diagnostic delays the LLM-based electronic-trigger identified, 31 were true-positives, 60 in-hospital complications, and 46 false-positives. 4.4% of all VTE hospitalizations had a diagnostic delay. With the exclusion criteria, the PPV was 48% (95% confidence interval [CI], 35%–62%) and NPV was 95% (95% CI, 87%–98%).</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>We developed the first LLM-based electronic-trigger to identify VTE diagnostic delays, with higher performance than existing non-LLM electronic-triggers. LLM-based approaches can facilitate diagnostic performance feedback and are scalable to other conditions and institutions.</p>\n </section>\n </div>","PeriodicalId":15883,"journal":{"name":"Journal of hospital medicine","volume":"21 4","pages":"391-401"},"PeriodicalIF":2.3000,"publicationDate":"2026-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of hospital medicine","FirstCategoryId":"3","ListUrlMain":"https://shmpublications.onlinelibrary.wiley.com/doi/10.1002/jhm.70194","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/10/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Delayed diagnosis of venous thromboembolism (VTE) is prevalent among hospitalized patients, yet case identification is challenging and feedback limited.

Objective

To develop a large language model (LLM)-based electronic-trigger to identify VTE diagnostic delays.

Methods

All admissions to internal medicine (IM) residents at NYU Langone Health between January 2022 and December 2023 (n = 20,843) were included. Using an open-source LLM, prompts were validated to detect (1) residents considering VTE in admission notes and (2) VTE confirmation in five types of imaging reports (n = 100 for each prompt validation set). The validated prompts were applied to determine discordance between admission note differential omitting VTE and imaging report confirming VTE. Two hospitalists reviewed discordant cases using a validated tool to identify diagnostic delays. Hospitalizations were labeled as diagnostic delays, in-hospital complication, or false-positive. Based on in-hospital complication and false-positive patterns, exclusion criteria were implemented. Positive predictive value (PPV) and negative predictive value (NPV) were calculated.

Results

The LLM prompts correctly classified admission notes and VTE imaging studies with high accuracy (range 98%–100%, n = 699 VTE cases identified). Of the 137 diagnostic delays the LLM-based electronic-trigger identified, 31 were true-positives, 60 in-hospital complications, and 46 false-positives. 4.4% of all VTE hospitalizations had a diagnostic delay. With the exclusion criteria, the PPV was 48% (95% confidence interval [CI], 35%–62%) and NPV was 95% (95% CI, 87%–98%).

Conclusions

We developed the first LLM-based electronic-trigger to identify VTE diagnostic delays, with higher performance than existing non-LLM electronic-triggers. LLM-based approaches can facilitate diagnostic performance feedback and are scalable to other conditions and institutions.

基于大语言模型的静脉血栓栓塞诊断延迟识别。
背景:静脉血栓栓塞(VTE)的延迟诊断在住院患者中很普遍,但病例识别具有挑战性,反馈有限。目的:开发一种基于大语言模型(LLM)的VTE诊断延迟的电子触发器。方法:纳入2022年1月至2023年12月期间NYU Langone Health所有住院内科(IM)住院医师(n = 20,843)。使用开源LLM,对提示进行验证,以检测(1)住院医师在入院记录中考虑VTE,(2)在五种类型的成像报告中确认VTE(每个提示验证集n = 100)。应用经过验证的提示来确定忽略VTE的入院记录差异与确认VTE的影像学报告之间的不一致。两名医院医生使用经过验证的工具审查了不一致的病例,以确定诊断延误。住院治疗被标记为诊断延误、院内并发症或假阳性。根据院内并发症和假阳性模式,实施排除标准。计算阳性预测值(PPV)和阴性预测值(NPV)。结果:LLM提示正确分类的入院记录和VTE成像研究具有很高的准确性(范围98%-100%,n = 699例VTE确诊病例)。在基于llm的电子触发器识别的137例诊断延迟中,31例为真阳性,60例为院内并发症,46例为假阳性。所有静脉血栓栓塞住院患者中有4.4%的诊断延迟。根据排除标准,PPV为48%(95%可信区间[CI], 35%-62%), NPV为95% (95% CI, 87%-98%)。结论:我们开发了第一个基于llm的电子触发器来识别静脉血栓栓塞诊断延迟,其性能比现有的非llm电子触发器更高。基于法学硕士的方法可以促进诊断性能反馈,并可扩展到其他条件和机构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of hospital medicine
Journal of hospital medicine 医学-医学:内科
CiteScore
4.40
自引率
11.50%
发文量
233
审稿时长
4-8 weeks
期刊介绍: JHM is a peer-reviewed publication of the Society of Hospital Medicine and is published 12 times per year. JHM publishes manuscripts that address the care of hospitalized adults or children. Broad areas of interest include (1) Treatments for common inpatient conditions; (2) Approaches to improving perioperative care; (3) Improving care for hospitalized patients with geriatric or pediatric vulnerabilities (such as mobility problems, or those with complex longitudinal care); (4) Evaluation of innovative healthcare delivery or educational models; (5) Approaches to improving the quality, safety, and value of healthcare across the acute- and postacute-continuum of care; and (6) Evaluation of policy and payment changes that affect hospital and postacute care.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书