AUTOMATED IDENTIFICATION OF HOSPITAL-ACQUIRED VENOUS THROMBOEMBOLISM

P. Beeler
{"title":"AUTOMATED IDENTIFICATION OF HOSPITAL-ACQUIRED VENOUS THROMBOEMBOLISM","authors":"P. Beeler","doi":"10.4414/SMI.32.00354","DOIUrl":null,"url":null,"abstract":"Introduction Venous thromboembolism (VTE) as a hospital-acquired condition (HAC) – i.e. not ‘present on admission’ (POA) – is a potentially preventable complication. A decrease of HAC VTE events indicates success of efforts to prevent VTE in hospitalized patients. However, so far, costly chart reviews were needed to identify patients with HAC VTE. We investigated whether electronic health record data such as medication orders and their temporal relations allow for differentiating between HAC and POA. Therefore, we modeled a tree and two random forests and evaluated the automated classification of HAC VTE. Methods All inpatients with a length of stay of ≥24 hours (h), discharged from the Brigham and Women’s Hospital, a large tertiary care hospital in Boston, MA, between January 2009 and April 2014 were searched for ICD-9 diagnosis codes of acute venous thrombosis or pulmonary embolism. Patients were included who had VTE in the admitting diagnosis field – defined as POA VTE – or in one of up to 50 discharge diagnoses. Of those, only patients who received heparin, dalteparin, enoxaparin, alteplase, rivaroxaban or fondaparinux were considered, and the time from admission to the first order was calculated for each drug. Additionally included predictors: dose information, demographics (age, gender, race, language), length of stay, admission service, discharge service, transfer destination of the patient after discharge, and whether the patient was alive or died during the hospitalization or within 30 days after discharge. A single tree and two random forests (each with 5,000 trees) were generated to analyze the predictors and to assess the predictive power of the chosen approach. Since medication orders are electronically available in real time, such prospective predictors may have implications for clinical decision support – therefore, prospective predictors (i.e. demographics, admission service, time to order a drug, route and dose information for each drug) were separately analyzed in the first random forest. Half of the data served as calibration set, half as validation set. Statistical computing was performed using the software R version 3.1.0 (R Foundation for Statistical Computing, Vienna, Austria). Results A total of 5,374 patient stays featured a VTE diagnosis with a defined drug order. If VTE was POA (n=1,262; 23.5%), the median time to order one of the aforementioned drugs was 2.5h (IQR 1.3-5.0h). Among HAC VTE cases without an admitting diagnosis of VTE (n=4,112; 76.5%), the median time to order the drug was 4.2h (IQR 1.7-18.2h). Unsurprisingly, a single tree – after cross-validation and pruning – identified the time from admission to the ordering of intravenous (IV) heparin as the most significant predictor (Fig. 1). This tree’s validation resulted in an accuracy of 78.8% and a positive predictive value (PPV) of 83.3% for the classification of HAC VTE. The first validated random forest used predictors which are available in real time: the forest had an accuracy of 79.7% and a PPV of 85.3% for the classification of HAC VTE. The second validated random forest considered all variables and resulted in an accuracy of 81.7% and a PPV of 87.8% (variables’ importance is shown in Fig. 2). Discussion We modeled a tree and two random forests using structured data predictors to differentiate between HAC and POA VTE. Our validated tree (Fig. 1), considering the first order for IV heparin and the length of stay, could immediately be implemented as a first step to identifying HAC VTE patients. However, the random forests performed better, even when exclusively prospective predictors were used – and such real time models may have implications for clinical decision support tools. In conclusion, our random forests could help to evaluate interventions to improve thromboprophylaxis regimens for inpatients, where costly chart reviews are needed to differentiate between POA VTE and potentially preventable complications.","PeriodicalId":156842,"journal":{"name":"Swiss medical informatics","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swiss medical informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4414/SMI.32.00354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction Venous thromboembolism (VTE) as a hospital-acquired condition (HAC) – i.e. not ‘present on admission’ (POA) – is a potentially preventable complication. A decrease of HAC VTE events indicates success of efforts to prevent VTE in hospitalized patients. However, so far, costly chart reviews were needed to identify patients with HAC VTE. We investigated whether electronic health record data such as medication orders and their temporal relations allow for differentiating between HAC and POA. Therefore, we modeled a tree and two random forests and evaluated the automated classification of HAC VTE. Methods All inpatients with a length of stay of ≥24 hours (h), discharged from the Brigham and Women’s Hospital, a large tertiary care hospital in Boston, MA, between January 2009 and April 2014 were searched for ICD-9 diagnosis codes of acute venous thrombosis or pulmonary embolism. Patients were included who had VTE in the admitting diagnosis field – defined as POA VTE – or in one of up to 50 discharge diagnoses. Of those, only patients who received heparin, dalteparin, enoxaparin, alteplase, rivaroxaban or fondaparinux were considered, and the time from admission to the first order was calculated for each drug. Additionally included predictors: dose information, demographics (age, gender, race, language), length of stay, admission service, discharge service, transfer destination of the patient after discharge, and whether the patient was alive or died during the hospitalization or within 30 days after discharge. A single tree and two random forests (each with 5,000 trees) were generated to analyze the predictors and to assess the predictive power of the chosen approach. Since medication orders are electronically available in real time, such prospective predictors may have implications for clinical decision support – therefore, prospective predictors (i.e. demographics, admission service, time to order a drug, route and dose information for each drug) were separately analyzed in the first random forest. Half of the data served as calibration set, half as validation set. Statistical computing was performed using the software R version 3.1.0 (R Foundation for Statistical Computing, Vienna, Austria). Results A total of 5,374 patient stays featured a VTE diagnosis with a defined drug order. If VTE was POA (n=1,262; 23.5%), the median time to order one of the aforementioned drugs was 2.5h (IQR 1.3-5.0h). Among HAC VTE cases without an admitting diagnosis of VTE (n=4,112; 76.5%), the median time to order the drug was 4.2h (IQR 1.7-18.2h). Unsurprisingly, a single tree – after cross-validation and pruning – identified the time from admission to the ordering of intravenous (IV) heparin as the most significant predictor (Fig. 1). This tree’s validation resulted in an accuracy of 78.8% and a positive predictive value (PPV) of 83.3% for the classification of HAC VTE. The first validated random forest used predictors which are available in real time: the forest had an accuracy of 79.7% and a PPV of 85.3% for the classification of HAC VTE. The second validated random forest considered all variables and resulted in an accuracy of 81.7% and a PPV of 87.8% (variables’ importance is shown in Fig. 2). Discussion We modeled a tree and two random forests using structured data predictors to differentiate between HAC and POA VTE. Our validated tree (Fig. 1), considering the first order for IV heparin and the length of stay, could immediately be implemented as a first step to identifying HAC VTE patients. However, the random forests performed better, even when exclusively prospective predictors were used – and such real time models may have implications for clinical decision support tools. In conclusion, our random forests could help to evaluate interventions to improve thromboprophylaxis regimens for inpatients, where costly chart reviews are needed to differentiate between POA VTE and potentially preventable complications.
医院获得性静脉血栓栓塞的自动识别
静脉血栓栓塞(VTE)作为一种医院获得性疾病(HAC)——即入院时不存在(POA)——是一种潜在的可预防的并发症。HAC静脉血栓栓塞事件的减少表明预防住院患者静脉血栓栓塞的努力取得了成功。然而,到目前为止,需要昂贵的图表审查来确定HAC静脉血栓栓塞患者。我们调查了电子健康记录数据,如药物订单及其时间关系是否允许区分HAC和POA。因此,我们建立了一个树和两个随机森林模型,并对HAC VTE的自动分类进行了评估。方法检索2009年1月至2014年4月美国马萨诸塞州波士顿布里格姆妇女医院(Brigham and Women’s Hospital)出院的住院时间≥24小时(h)的患者,检索急性静脉血栓形成或肺栓塞的ICD-9诊断代码。患者包括在入院诊断领域有静脉血栓栓塞(定义为POA静脉血栓栓塞)或在多达50个出院诊断之一的患者。其中,仅考虑接受肝素、达特帕林、依诺肝素、阿替普酶、利伐沙班或氟达肝素治疗的患者,并计算每种药物从入院到第一次下单的时间。另外纳入的预测因素包括:剂量信息、人口统计学(年龄、性别、种族、语言)、住院时间、入院服务、出院服务、出院后患者的转院目的地,以及患者在住院期间或出院后30天内是否存活或死亡。生成了一棵树和两个随机森林(每个森林有5000棵树)来分析预测因子并评估所选方法的预测能力。由于药物订单是实时电子可获得的,这些前瞻性预测因素可能对临床决策支持有影响——因此,前瞻性预测因素(即人口统计学、入院服务、订购药物的时间、每种药物的路线和剂量信息)在第一个随机森林中分别进行了分析。一半的数据作为校准集,一半作为验证集。统计计算采用R 3.1.0版本软件(R Foundation for Statistical computing, Vienna, Austria)。结果共有5374例患者被诊断为静脉血栓栓塞,并有明确的药物处方。如果VTE为POA (n=1,262;23.5%),订购上述药物的中位时间为2.5h (IQR为1.3 ~ 5.0h)。未确诊为静脉血栓栓塞的HAC静脉血栓栓塞病例(n=4,112;76.5%),下单时间中位数为4.2h (IQR为1.7 ~ 18.2h)。不出所料,经过交叉验证和修剪后,单一树确定了从入院到静脉注射肝素的时间是最重要的预测因子(图1)。该树的验证导致HAC VTE分类的准确率为78.8%,阳性预测值(PPV)为83.3%。第一个经过验证的随机森林使用了实时可用的预测因子:该森林对HAC VTE分类的准确率为79.7%,PPV为85.3%。第二个经过验证的随机森林考虑了所有变量,结果准确率为81.7%,PPV为87.8%(变量的重要性如图2所示)。我们使用结构化数据预测器对树和两个随机森林进行建模,以区分HAC和POA VTE。我们验证的树(图1),考虑到静脉注射肝素的第一次订单和住院时间,可以立即实施作为识别HAC VTE患者的第一步。然而,随机森林的表现更好,即使只使用前瞻性预测因子——这种实时模型可能对临床决策支持工具有影响。总之,我们的随机森林可以帮助评估干预措施,以改善住院患者的血栓预防方案,其中需要昂贵的图表审查来区分POA静脉血栓栓塞和潜在可预防的并发症。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信