预测住院成人静脉血栓栓塞:开发和验证可实现的实时预后模型的协议。

IF 2.6
Henry J Domenico, Benjamin F Tillman, Shari L Just, Yeji Ko, Amanda S Mixon, Asli Weitkamp, Jonathan S Schildcrout, Colin Walsh, Thomas Ortel, Benjamin French
{"title":"预测住院成人静脉血栓栓塞:开发和验证可实现的实时预后模型的协议。","authors":"Henry J Domenico, Benjamin F Tillman, Shari L Just, Yeji Ko, Amanda S Mixon, Asli Weitkamp, Jonathan S Schildcrout, Colin Walsh, Thomas Ortel, Benjamin French","doi":"10.1186/s41512-025-00205-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Hospital-acquired venous thromboembolism (HA-VTE) is a leading cause of morbidity and mortality among hospitalized adults. Numerous prognostic models have been developed to identify those patients with elevated risk of HA-VTE. None, however, has met the necessary criteria to guide clinical decision-making. This study outlines a protocol for refining and validating a general-purpose prognostic model for HA-VTE, designed for real-time automation within the electronic health record (EHR) system.</p><p><strong>Methods: </strong>A retrospective cohort of 132,561 inpatient encounters (89,586 individual patients) at a large academic medical center will be collected, along with clinical and demographic data available as part of routine care. Data for temporal, geographic, and domain external validation cohorts will also be collected. Logistic regression will be used to predict occurrence of HA-VTE during an inpatient encounter. Variables considered for model inclusion will be based on prior demonstrated association with HA-VTE and their availability in both retrospective EHR data and routine clinical care. Least absolute shrinkage and selection operator (LASSO) with tenfold cross-validation will be used for initial variable selection. Variables selected by the LASSO procedure, along with those deemed necessary by clinicians, will be used in an unpenalized multivariable logistic regression model. Discrimination and calibration will be reported for the derivation and validation cohorts. Discrimination will be measured using Harrell's C statistic. Calibration will be measured using calibration intercept, calibration slope, Brier score, integrated calibration index, and visual examination of non-linear calibration curve. Model reporting will adhere to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis guidelines for clinical prediction models using machine learning methods (TRIPOD + AI).</p><p><strong>Discussion: </strong>We describe methods for developing, evaluating, and validating a prognostic model for HA-VTE using routinely collected EHR data. By combining best practices in statistical development and validation, knowledge engineering, and clinical domain knowledge, the resulting model should be well suited for real-time clinical implementation. Although this protocol describes our development of a model for HA-VTE, the general approach can be applied to other clinical outcomes.</p>","PeriodicalId":72800,"journal":{"name":"Diagnostic and prognostic research","volume":"9 1","pages":"19"},"PeriodicalIF":2.6000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12416065/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting venous thromboembolism among hospitalized adults: a protocol for development and validation of an implementable real-time prognostic model.\",\"authors\":\"Henry J Domenico, Benjamin F Tillman, Shari L Just, Yeji Ko, Amanda S Mixon, Asli Weitkamp, Jonathan S Schildcrout, Colin Walsh, Thomas Ortel, Benjamin French\",\"doi\":\"10.1186/s41512-025-00205-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Hospital-acquired venous thromboembolism (HA-VTE) is a leading cause of morbidity and mortality among hospitalized adults. Numerous prognostic models have been developed to identify those patients with elevated risk of HA-VTE. None, however, has met the necessary criteria to guide clinical decision-making. This study outlines a protocol for refining and validating a general-purpose prognostic model for HA-VTE, designed for real-time automation within the electronic health record (EHR) system.</p><p><strong>Methods: </strong>A retrospective cohort of 132,561 inpatient encounters (89,586 individual patients) at a large academic medical center will be collected, along with clinical and demographic data available as part of routine care. Data for temporal, geographic, and domain external validation cohorts will also be collected. Logistic regression will be used to predict occurrence of HA-VTE during an inpatient encounter. Variables considered for model inclusion will be based on prior demonstrated association with HA-VTE and their availability in both retrospective EHR data and routine clinical care. Least absolute shrinkage and selection operator (LASSO) with tenfold cross-validation will be used for initial variable selection. Variables selected by the LASSO procedure, along with those deemed necessary by clinicians, will be used in an unpenalized multivariable logistic regression model. Discrimination and calibration will be reported for the derivation and validation cohorts. Discrimination will be measured using Harrell's C statistic. Calibration will be measured using calibration intercept, calibration slope, Brier score, integrated calibration index, and visual examination of non-linear calibration curve. Model reporting will adhere to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis guidelines for clinical prediction models using machine learning methods (TRIPOD + AI).</p><p><strong>Discussion: </strong>We describe methods for developing, evaluating, and validating a prognostic model for HA-VTE using routinely collected EHR data. By combining best practices in statistical development and validation, knowledge engineering, and clinical domain knowledge, the resulting model should be well suited for real-time clinical implementation. Although this protocol describes our development of a model for HA-VTE, the general approach can be applied to other clinical outcomes.</p>\",\"PeriodicalId\":72800,\"journal\":{\"name\":\"Diagnostic and prognostic research\",\"volume\":\"9 1\",\"pages\":\"19\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12416065/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diagnostic and prognostic research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s41512-025-00205-8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostic and prognostic research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s41512-025-00205-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:医院获得性静脉血栓栓塞(HA-VTE)是住院成人发病率和死亡率的主要原因。已经开发了许多预后模型来识别HA-VTE风险升高的患者。然而,没有一个达到指导临床决策的必要标准。本研究概述了一种改进和验证HA-VTE通用预后模型的方案,该模型设计用于电子健康记录(EHR)系统中的实时自动化。方法:在一个大型学术医疗中心收集132561例住院患者(89586例个体患者)的回顾性队列,以及作为常规护理一部分的临床和人口统计数据。还将收集时间、地理和领域外部验证队列的数据。逻辑回归将用于预测住院期间HA-VTE的发生。考虑纳入模型的变量将基于先前证明的与HA-VTE的关联,以及它们在回顾性电子病历数据和常规临床护理中的可用性。最小绝对收缩和选择算子(LASSO)与十倍交叉验证将用于初始变量选择。LASSO程序选择的变量,以及临床医生认为必要的变量,将用于无惩罚的多变量逻辑回归模型。将报告衍生和验证队列的鉴别和校准。歧视将使用Harrell的C统计量来衡量。校准将使用校准截距、校准斜率、Brier评分、综合校准指数和非线性校准曲线的目视检查来测量。模型报告将遵循使用机器学习方法(TRIPOD + AI)的临床预测模型的透明报告(Transparent reporting of a multivariable prediction Model for Individual Prognosis Or Diagnosis)指南。讨论:我们描述了利用常规收集的电子病历数据开发、评估和验证HA-VTE预后模型的方法。通过结合统计开发和验证、知识工程和临床领域知识的最佳实践,得到的模型应该非常适合于实时临床实现。虽然该方案描述了我们对HA-VTE模型的开发,但一般方法可以应用于其他临床结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting venous thromboembolism among hospitalized adults: a protocol for development and validation of an implementable real-time prognostic model.

Background: Hospital-acquired venous thromboembolism (HA-VTE) is a leading cause of morbidity and mortality among hospitalized adults. Numerous prognostic models have been developed to identify those patients with elevated risk of HA-VTE. None, however, has met the necessary criteria to guide clinical decision-making. This study outlines a protocol for refining and validating a general-purpose prognostic model for HA-VTE, designed for real-time automation within the electronic health record (EHR) system.

Methods: A retrospective cohort of 132,561 inpatient encounters (89,586 individual patients) at a large academic medical center will be collected, along with clinical and demographic data available as part of routine care. Data for temporal, geographic, and domain external validation cohorts will also be collected. Logistic regression will be used to predict occurrence of HA-VTE during an inpatient encounter. Variables considered for model inclusion will be based on prior demonstrated association with HA-VTE and their availability in both retrospective EHR data and routine clinical care. Least absolute shrinkage and selection operator (LASSO) with tenfold cross-validation will be used for initial variable selection. Variables selected by the LASSO procedure, along with those deemed necessary by clinicians, will be used in an unpenalized multivariable logistic regression model. Discrimination and calibration will be reported for the derivation and validation cohorts. Discrimination will be measured using Harrell's C statistic. Calibration will be measured using calibration intercept, calibration slope, Brier score, integrated calibration index, and visual examination of non-linear calibration curve. Model reporting will adhere to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis guidelines for clinical prediction models using machine learning methods (TRIPOD + AI).

Discussion: We describe methods for developing, evaluating, and validating a prognostic model for HA-VTE using routinely collected EHR data. By combining best practices in statistical development and validation, knowledge engineering, and clinical domain knowledge, the resulting model should be well suited for real-time clinical implementation. Although this protocol describes our development of a model for HA-VTE, the general approach can be applied to other clinical outcomes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
18 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信