利用多模态数据预测血液透析患者中的 SARS-CoV-2 感染。

Frontiers in nephrology Pub Date : 2023-06-02 eCollection Date: 2023-01-01 DOI:10.3389/fneph.2023.1179342
Juntao Duan, Hanmo Li, Xiaoran Ma, Hanjie Zhang, Rachel Lasky, Caitlin K Monaghan, Sheetal Chaudhuri, Len A Usvyat, Mengyang Gu, Wensheng Guo, Peter Kotanko, Yuedong Wang
{"title":"利用多模态数据预测血液透析患者中的 SARS-CoV-2 感染。","authors":"Juntao Duan, Hanmo Li, Xiaoran Ma, Hanjie Zhang, Rachel Lasky, Caitlin K Monaghan, Sheetal Chaudhuri, Len A Usvyat, Mengyang Gu, Wensheng Guo, Peter Kotanko, Yuedong Wang","doi":"10.3389/fneph.2023.1179342","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The coronavirus disease 2019 (COVID-19) pandemic has created more devastation among dialysis patients than among the general population. Patient-level prediction models for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection are crucial for the early identification of patients to prevent and mitigate outbreaks within dialysis clinics. As the COVID-19 pandemic evolves, it is unclear whether or not previously built prediction models are still sufficiently effective.</p><p><strong>Methods: </strong>We developed a machine learning (XGBoost) model to predict during the incubation period a SARS-CoV-2 infection that is subsequently diagnosed after 3 or more days. We used data from multiple sources, including demographic, clinical, treatment, laboratory, and vaccination information from a national network of hemodialysis clinics, socioeconomic information from the Census Bureau, and county-level COVID-19 infection and mortality information from state and local health agencies. We created prediction models and evaluated their performances on a rolling basis to investigate the evolution of prediction power and risk factors.</p><p><strong>Result: </strong>From April 2020 to August 2020, our machine learning model achieved an area under the receiver operating characteristic curve (AUROC) of 0.75, an improvement of over 0.07 from a previously developed machine learning model published by Kidney360 in 2021. As the pandemic evolved, the prediction performance deteriorated and fluctuated more, with the lowest AUROC of 0.6 in December 2021 and January 2022. Over the whole study period, that is, from April 2020 to February 2022, fixing the false-positive rate at 20%, our model was able to detect 40% of the positive patients. We found that features derived from local infection information reported by the Centers for Disease Control and Prevention (CDC) were the most important predictors, and vaccination status was a useful predictor as well. Whether or not a patient lives in a nursing home was an effective predictor before vaccination, but became less predictive after vaccination.</p><p><strong>Conclusion: </strong>As found in our study, the dynamics of the prediction model are frequently changing as the pandemic evolves. County-level infection information and vaccination information are crucial for the success of early COVID-19 prediction models. Our results show that the proposed model can effectively identify SARS-CoV-2 infections during the incubation period. Prospective studies are warranted to explore the application of such prediction models in daily clinical practice.</p>","PeriodicalId":73091,"journal":{"name":"Frontiers in nephrology","volume":"3 ","pages":"1179342"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479652/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting SARS-CoV-2 infection among hemodialysis patients using multimodal data.\",\"authors\":\"Juntao Duan, Hanmo Li, Xiaoran Ma, Hanjie Zhang, Rachel Lasky, Caitlin K Monaghan, Sheetal Chaudhuri, Len A Usvyat, Mengyang Gu, Wensheng Guo, Peter Kotanko, Yuedong Wang\",\"doi\":\"10.3389/fneph.2023.1179342\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The coronavirus disease 2019 (COVID-19) pandemic has created more devastation among dialysis patients than among the general population. Patient-level prediction models for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection are crucial for the early identification of patients to prevent and mitigate outbreaks within dialysis clinics. As the COVID-19 pandemic evolves, it is unclear whether or not previously built prediction models are still sufficiently effective.</p><p><strong>Methods: </strong>We developed a machine learning (XGBoost) model to predict during the incubation period a SARS-CoV-2 infection that is subsequently diagnosed after 3 or more days. We used data from multiple sources, including demographic, clinical, treatment, laboratory, and vaccination information from a national network of hemodialysis clinics, socioeconomic information from the Census Bureau, and county-level COVID-19 infection and mortality information from state and local health agencies. We created prediction models and evaluated their performances on a rolling basis to investigate the evolution of prediction power and risk factors.</p><p><strong>Result: </strong>From April 2020 to August 2020, our machine learning model achieved an area under the receiver operating characteristic curve (AUROC) of 0.75, an improvement of over 0.07 from a previously developed machine learning model published by Kidney360 in 2021. As the pandemic evolved, the prediction performance deteriorated and fluctuated more, with the lowest AUROC of 0.6 in December 2021 and January 2022. Over the whole study period, that is, from April 2020 to February 2022, fixing the false-positive rate at 20%, our model was able to detect 40% of the positive patients. We found that features derived from local infection information reported by the Centers for Disease Control and Prevention (CDC) were the most important predictors, and vaccination status was a useful predictor as well. Whether or not a patient lives in a nursing home was an effective predictor before vaccination, but became less predictive after vaccination.</p><p><strong>Conclusion: </strong>As found in our study, the dynamics of the prediction model are frequently changing as the pandemic evolves. County-level infection information and vaccination information are crucial for the success of early COVID-19 prediction models. Our results show that the proposed model can effectively identify SARS-CoV-2 infections during the incubation period. Prospective studies are warranted to explore the application of such prediction models in daily clinical practice.</p>\",\"PeriodicalId\":73091,\"journal\":{\"name\":\"Frontiers in nephrology\",\"volume\":\"3 \",\"pages\":\"1179342\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10479652/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in nephrology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fneph.2023.1179342\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in nephrology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fneph.2023.1179342","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:2019 年冠状病毒病(COVID-19)大流行在透析患者中造成的破坏比在普通人群中造成的破坏更大。严重急性呼吸系统综合征冠状病毒 2(SARS-CoV-2)感染的患者水平预测模型对于早期识别患者以预防和减轻透析诊所内的疫情至关重要。随着 COVID-19 大流行的演变,目前还不清楚以前建立的预测模型是否仍然足够有效:我们开发了一个机器学习(XGBoost)模型,用于在潜伏期内预测 SARS-CoV-2 感染,并在 3 天或更长时间后确诊。我们使用了多种来源的数据,包括来自全国血液透析诊所网络的人口统计、临床、治疗、实验室和疫苗接种信息,来自人口普查局的社会经济信息,以及来自州和地方卫生机构的县级 COVID-19 感染和死亡信息。我们创建了预测模型,并对其性能进行了滚动评估,以研究预测能力和风险因素的演变:从 2020 年 4 月到 2020 年 8 月,我们的机器学习模型的接收者操作特征曲线下面积(AUROC)达到了 0.75,比 Kidney360 在 2021 年发布的之前开发的机器学习模型提高了 0.07 以上。随着疫情的发展,预测性能有所下降,波动幅度更大,2021 年 12 月和 2022 年 1 月的 AUROC 最低,仅为 0.6。在整个研究期间,即从 2020 年 4 月到 2022 年 2 月,假阳性率固定为 20%,我们的模型能够检测到 40% 的阳性患者。我们发现,从美国疾病控制和预防中心(CDC)报告的当地感染信息中得出的特征是最重要的预测因素,疫苗接种状况也是一个有用的预测因素。在接种疫苗前,患者是否住在养老院是一个有效的预测因素,但在接种疫苗后,预测性降低:正如我们的研究发现的那样,随着大流行病的发展,预测模型的动态也在不断变化。县级感染信息和疫苗接种信息对于早期 COVID-19 预测模型的成功至关重要。我们的研究结果表明,所提出的模型能有效识别潜伏期内的 SARS-CoV-2 感染。我们有必要开展前瞻性研究,探索此类预测模型在日常临床实践中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Predicting SARS-CoV-2 infection among hemodialysis patients using multimodal data.

Predicting SARS-CoV-2 infection among hemodialysis patients using multimodal data.

Predicting SARS-CoV-2 infection among hemodialysis patients using multimodal data.

Predicting SARS-CoV-2 infection among hemodialysis patients using multimodal data.

Background: The coronavirus disease 2019 (COVID-19) pandemic has created more devastation among dialysis patients than among the general population. Patient-level prediction models for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection are crucial for the early identification of patients to prevent and mitigate outbreaks within dialysis clinics. As the COVID-19 pandemic evolves, it is unclear whether or not previously built prediction models are still sufficiently effective.

Methods: We developed a machine learning (XGBoost) model to predict during the incubation period a SARS-CoV-2 infection that is subsequently diagnosed after 3 or more days. We used data from multiple sources, including demographic, clinical, treatment, laboratory, and vaccination information from a national network of hemodialysis clinics, socioeconomic information from the Census Bureau, and county-level COVID-19 infection and mortality information from state and local health agencies. We created prediction models and evaluated their performances on a rolling basis to investigate the evolution of prediction power and risk factors.

Result: From April 2020 to August 2020, our machine learning model achieved an area under the receiver operating characteristic curve (AUROC) of 0.75, an improvement of over 0.07 from a previously developed machine learning model published by Kidney360 in 2021. As the pandemic evolved, the prediction performance deteriorated and fluctuated more, with the lowest AUROC of 0.6 in December 2021 and January 2022. Over the whole study period, that is, from April 2020 to February 2022, fixing the false-positive rate at 20%, our model was able to detect 40% of the positive patients. We found that features derived from local infection information reported by the Centers for Disease Control and Prevention (CDC) were the most important predictors, and vaccination status was a useful predictor as well. Whether or not a patient lives in a nursing home was an effective predictor before vaccination, but became less predictive after vaccination.

Conclusion: As found in our study, the dynamics of the prediction model are frequently changing as the pandemic evolves. County-level infection information and vaccination information are crucial for the success of early COVID-19 prediction models. Our results show that the proposed model can effectively identify SARS-CoV-2 infections during the incubation period. Prospective studies are warranted to explore the application of such prediction models in daily clinical practice.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信