Development and validation of predictive models for COVID-19 outcomes in a safety-net hospital population

Journal of the American Medical Informatics Association : JAMIA Pub Date : 2022-04-20 DOI:10.1093/jamia/ocac062

Boran Hao, Yang Hu, Shahabeddin Sotudian, Zahra Zad, W. Adams, S. Assoumou, Heather E. Hsu, Rebecca G Mishuris, I. Paschalidis

{"title":"Development and validation of predictive models for COVID-19 outcomes in a safety-net hospital population","authors":"Boran Hao, Yang Hu, Shahabeddin Sotudian, Zahra Zad, W. Adams, S. Assoumou, Heather E. Hsu, Rebecca G Mishuris, I. Paschalidis","doi":"10.1093/jamia/ocac062","DOIUrl":null,"url":null,"abstract":"Abstract Objective To develop predictive models of coronavirus disease 2019 (COVID-19) outcomes, elucidate the influence of socioeconomic factors, and assess algorithmic racial fairness using a racially diverse patient population with high social needs. Materials and Methods Data included 7,102 patients with positive (RT-PCR) severe acute respiratory syndrome coronavirus 2 test at a safety-net system in Massachusetts. Linear and nonlinear classification methods were applied. A score based on a recurrent neural network and a transformer architecture was developed to capture the dynamic evolution of vital signs. Combined with patient characteristics, clinical variables, and hospital occupancy measures, this dynamic vital score was used to train predictive models. Results Hospitalizations can be predicted with an area under the receiver-operating characteristic curve (AUC) of 92% using symptoms, hospital occupancy, and patient characteristics, including social determinants of health. Parsimonious models to predict intensive care, mechanical ventilation, and mortality that used the most recent labs and vitals exhibited AUCs of 92.7%, 91.2%, and 94%, respectively. Early predictive models, using labs and vital signs closer to admission had AUCs of 81.1%, 84.9%, and 92%, respectively. Discussion The most accurate models exhibit racial bias, being more likely to falsely predict that Black patients will be hospitalized. Models that are only based on the dynamic vital score exhibited accuracies close to the best parsimonious models, although the latter also used laboratories. Conclusions This large study demonstrates that COVID-19 severity may accurately be predicted using a score that accounts for the dynamic evolution of vital signs. Further, race, social determinants of health, and hospital occupancy play an important role.","PeriodicalId":236137,"journal":{"name":"Journal of the American Medical Informatics Association : JAMIA","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association : JAMIA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamia/ocac062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Abstract Objective To develop predictive models of coronavirus disease 2019 (COVID-19) outcomes, elucidate the influence of socioeconomic factors, and assess algorithmic racial fairness using a racially diverse patient population with high social needs. Materials and Methods Data included 7,102 patients with positive (RT-PCR) severe acute respiratory syndrome coronavirus 2 test at a safety-net system in Massachusetts. Linear and nonlinear classification methods were applied. A score based on a recurrent neural network and a transformer architecture was developed to capture the dynamic evolution of vital signs. Combined with patient characteristics, clinical variables, and hospital occupancy measures, this dynamic vital score was used to train predictive models. Results Hospitalizations can be predicted with an area under the receiver-operating characteristic curve (AUC) of 92% using symptoms, hospital occupancy, and patient characteristics, including social determinants of health. Parsimonious models to predict intensive care, mechanical ventilation, and mortality that used the most recent labs and vitals exhibited AUCs of 92.7%, 91.2%, and 94%, respectively. Early predictive models, using labs and vital signs closer to admission had AUCs of 81.1%, 84.9%, and 92%, respectively. Discussion The most accurate models exhibit racial bias, being more likely to falsely predict that Black patients will be hospitalized. Models that are only based on the dynamic vital score exhibited accuracies close to the best parsimonious models, although the latter also used laboratories. Conclusions This large study demonstrates that COVID-19 severity may accurately be predicted using a score that accounts for the dynamic evolution of vital signs. Further, race, social determinants of health, and hospital occupancy play an important role.

查看原文本刊更多论文

在安全网医院人群中开发和验证COVID-19预后预测模型

摘要目的建立2019冠状病毒病(COVID-19)结局预测模型，阐明社会经济因素的影响，并利用高社会需求的种族多样化患者群体评估算法的种族公平性。资料与方法:在美国马萨诸塞州的一个安全网系统中，7102例严重急性呼吸综合征冠状病毒2型检测阳性(RT-PCR)患者。采用了线性和非线性分类方法。基于递归神经网络和变压器架构的评分被开发来捕捉生命体征的动态演变。结合患者特征、临床变量和医院占用率措施，该动态生命评分用于训练预测模型。结果使用症状、医院占用率和患者特征(包括健康的社会决定因素)，可以以92%的接受者-操作特征曲线(AUC)下面积预测住院。使用最新的实验室和生命体征预测重症监护、机械通气和死亡率的简约模型的auc分别为92.7%、91.2%和94%。早期的预测模型，使用接近入院的实验室和生命体征，auc分别为81.1%，84.9%和92%。最准确的模型表现出种族偏见，更有可能错误地预测黑人患者将住院。仅基于动态生命分数的模型显示出接近最佳简约模型的准确性，尽管后者也使用实验室。这项大型研究表明，使用考虑生命体征动态演变的评分可以准确预测COVID-19严重程度。此外，种族、健康的社会决定因素和医院占用也起着重要作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the American Medical Informatics Association : JAMIA

自引率

0.00%

发文量