A Machine Learning Study of 534,023 Medicare Beneficiaries with COVID-19: Implications for Personalized Risk Prediction

C. Dun, Christi M. Walsh, S. Bae, S. Bae, A. Adalja, Eric S Toner, Timothy A. Lash, Farah Hashim, J. Paturzo, D. Segev, D. Segev, M. Makary, M. Makary
{"title":"A Machine Learning Study of 534,023 Medicare Beneficiaries with COVID-19: Implications for Personalized Risk Prediction","authors":"C. Dun, Christi M. Walsh, S. Bae, S. Bae, A. Adalja, Eric S Toner, Timothy A. Lash, Farah Hashim, J. Paturzo, D. Segev, D. Segev, M. Makary, M. Makary","doi":"10.1101/2020.10.27.20220970","DOIUrl":null,"url":null,"abstract":"Background: Global demand for a COVID-19 vaccine will exceed the initial limited supply. Identifying individuals at highest risk of COVID-19 death may help allocation prioritization efforts. Personalized risk prediction that uses a broad range of comorbidities requires a cohort size larger than that reported in prior studies. Methods: Medicare claims data was used to identify patients age 65 years or older with diagnosis of COVID-19 between April 1, 2020 and August 31, 2020. Demographic characteristics, chronic medical conditions, and other patient risk factors that existed before the advent of COVID-19 were identified. A random forest model was used to empirically explore factors associated with COVID-19 death. The independent impact of factors identified were quantified using multivariate logistic regression with random effects. Results: We identified 534,023 COVID-19 patients of whom 38,066 had an inpatient death. Demographic characteristics associated with COVID-19 death included advanced age (85 years or older: aOR: 2.07; 95% CI, 1.99-2.16), male sex (aOR, 1.88; 95% CI, 1.82-1.94), and non-white race (Hispanic: aOR, 1.74; 95% CI, 1.66-1.83). Leading comorbidities associated with COVID-19 mortality included sickle cell disease (aOR, 1.73; 95% CI, 1.21-2.47), chronic kidney disease (aOR, 1.32; 95% CI, 1.29-1.36), leukemias and lymphomas (aOR, 1.22; 95% CI, 1.14-1.30), heart failure (aOR, 1.19; 95% CI, 1.16-1.22), and diabetes (aOR, 1.18; 95% CI, 1.15-1.22). Conclusions: We created a personalized risk prediction calculator to identify candidates for early vaccine and therapeutics allocation (www.predictcovidrisk.com). These findings may be used to protect those at greatest risk of death from COVID-19.","PeriodicalId":73706,"journal":{"name":"Journal of diabetes and clinical research","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of diabetes and clinical research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2020.10.27.20220970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Background: Global demand for a COVID-19 vaccine will exceed the initial limited supply. Identifying individuals at highest risk of COVID-19 death may help allocation prioritization efforts. Personalized risk prediction that uses a broad range of comorbidities requires a cohort size larger than that reported in prior studies. Methods: Medicare claims data was used to identify patients age 65 years or older with diagnosis of COVID-19 between April 1, 2020 and August 31, 2020. Demographic characteristics, chronic medical conditions, and other patient risk factors that existed before the advent of COVID-19 were identified. A random forest model was used to empirically explore factors associated with COVID-19 death. The independent impact of factors identified were quantified using multivariate logistic regression with random effects. Results: We identified 534,023 COVID-19 patients of whom 38,066 had an inpatient death. Demographic characteristics associated with COVID-19 death included advanced age (85 years or older: aOR: 2.07; 95% CI, 1.99-2.16), male sex (aOR, 1.88; 95% CI, 1.82-1.94), and non-white race (Hispanic: aOR, 1.74; 95% CI, 1.66-1.83). Leading comorbidities associated with COVID-19 mortality included sickle cell disease (aOR, 1.73; 95% CI, 1.21-2.47), chronic kidney disease (aOR, 1.32; 95% CI, 1.29-1.36), leukemias and lymphomas (aOR, 1.22; 95% CI, 1.14-1.30), heart failure (aOR, 1.19; 95% CI, 1.16-1.22), and diabetes (aOR, 1.18; 95% CI, 1.15-1.22). Conclusions: We created a personalized risk prediction calculator to identify candidates for early vaccine and therapeutics allocation (www.predictcovidrisk.com). These findings may be used to protect those at greatest risk of death from COVID-19.
一项针对534,023名COVID-19医疗保险受益人的机器学习研究:对个性化风险预测的影响
背景:全球对COVID-19疫苗的需求将超过最初有限的供应。确定COVID-19死亡风险最高的个体可能有助于分配优先工作。使用广泛的合并症的个性化风险预测需要比先前研究报道的更大的队列规模。方法:使用医疗保险索赔数据识别2020年4月1日至2020年8月31日期间诊断为COVID-19的65岁及以上患者。确定了COVID-19出现之前存在的人口统计学特征、慢性疾病和其他患者风险因素。随机森林模型用于实证探索与COVID-19死亡相关的因素。采用随机效应的多变量logistic回归对所确定因素的独立影响进行量化。结果:我们确定了534,023例COVID-19患者,其中38,066例住院死亡。与COVID-19死亡相关的人口统计学特征包括高龄(85岁或以上:aOR: 2.07;95% CI, 1.99-2.16),男性(aOR, 1.88;95% CI, 1.82-1.94)和非白种人(西班牙裔:aOR, 1.74;95% ci, 1.66-1.83)。与COVID-19死亡率相关的主要合并症包括镰状细胞病(aOR, 1.73;95% CI, 1.21-2.47),慢性肾脏疾病(aOR, 1.32;95% CI, 1.29-1.36),白血病和淋巴瘤(aOR, 1.22;95% CI, 1.14-1.30),心力衰竭(aOR, 1.19;95% CI, 1.16-1.22)和糖尿病(aOR, 1.18;95% ci, 1.15-1.22)。结论:我们创建了一个个性化的风险预测计算器来确定早期疫苗和治疗分配的候选人(www.predictcovidrisk.com)。这些发现可用于保护那些因COVID-19死亡风险最高的人。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信