Identification of Severe Acute Exacerbations of Chronic Obstructive Pulmonary Disease Subgroups by Machine Learning Implementation in Electronic Health Records.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Huan Li, John Huston, Jana Zielonka, Shannon Kay, Maor Sauler, Jose Gomez
{"title":"Identification of Severe Acute Exacerbations of Chronic Obstructive Pulmonary Disease Subgroups by Machine Learning Implementation in Electronic Health Records.","authors":"Huan Li, John Huston, Jana Zielonka, Shannon Kay, Maor Sauler, Jose Gomez","doi":"10.15326/jcopdf.2024.0556","DOIUrl":null,"url":null,"abstract":"<p><strong>Rationale: </strong>Acute exacerbations of COPD (AECOPD) are heterogeneous. Machine learning (ML) has previously been used to dissect some of the heterogeneity in COPD. The widespread adoption of electronic health records (EHRs) has led to the rapid accumulation of large amounts of patient data as part of routine clinical care. However, it is unclear whether the implementation of ML in EHR-derived data has the potential to identify subgroups of AECOPD.</p><p><strong>Objectives: </strong>Determine whether ML implementation using EHR data from severe AECOPD requiring hospitalization identifies relevant subgroups.</p><p><strong>Methods: </strong>This study used two retrospective cohorts of patients with AECOPD (non-COVID-19 and COVID-19) treated at Yale-New Haven Hospital (YNHHS). <i>K</i>-means clustering was used to identify patient subgroups.</p><p><strong>Measurements and main results: </strong>We identified three subgroups in the non-COVID cohort (n=1,736). Each subgroup had distinct clinical characteristics. The reference subgroup was the largest (n=904), followed by cardio-renal (n = 548) and eosinophilic (n=284). The eosinophilic subgroup had milder severity of AECOPD, including a shorter hospital stay (p<0.01). The cardio-renal subgroup had the highest mortality during (5%) and in the year after hospitalization (30%). Validation of the severe AECOPD classifier in the COVID-19 cohort recapitulated the characteristics seen in the non-COVID cohort. AECOPD subgroups in the COVID-19 cohort had different IL-1 beta, IL-2R, and IL-8 levels (FDR ≤ 0.05. These specific leukocyte and cytokine profiles resulted in inflammatory differences between the AECOPD subgroups based on C-reactive protein levels.</p><p><strong>Conclusions: </strong>Incorporating ML with EHR-data allows the identification of specific clinical and biological subgroups for severe AECOPD.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.15326/jcopdf.2024.0556","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Rationale: Acute exacerbations of COPD (AECOPD) are heterogeneous. Machine learning (ML) has previously been used to dissect some of the heterogeneity in COPD. The widespread adoption of electronic health records (EHRs) has led to the rapid accumulation of large amounts of patient data as part of routine clinical care. However, it is unclear whether the implementation of ML in EHR-derived data has the potential to identify subgroups of AECOPD.

Objectives: Determine whether ML implementation using EHR data from severe AECOPD requiring hospitalization identifies relevant subgroups.

Methods: This study used two retrospective cohorts of patients with AECOPD (non-COVID-19 and COVID-19) treated at Yale-New Haven Hospital (YNHHS). K-means clustering was used to identify patient subgroups.

Measurements and main results: We identified three subgroups in the non-COVID cohort (n=1,736). Each subgroup had distinct clinical characteristics. The reference subgroup was the largest (n=904), followed by cardio-renal (n = 548) and eosinophilic (n=284). The eosinophilic subgroup had milder severity of AECOPD, including a shorter hospital stay (p<0.01). The cardio-renal subgroup had the highest mortality during (5%) and in the year after hospitalization (30%). Validation of the severe AECOPD classifier in the COVID-19 cohort recapitulated the characteristics seen in the non-COVID cohort. AECOPD subgroups in the COVID-19 cohort had different IL-1 beta, IL-2R, and IL-8 levels (FDR ≤ 0.05. These specific leukocyte and cytokine profiles resulted in inflammatory differences between the AECOPD subgroups based on C-reactive protein levels.

Conclusions: Incorporating ML with EHR-data allows the identification of specific clinical and biological subgroups for severe AECOPD.

通过在电子健康记录中实施机器学习,识别慢性阻塞性肺病严重急性加重亚组。
理由:慢性阻塞性肺病急性加重(AECOPD)具有异质性。机器学习(ML)曾被用于剖析慢性阻塞性肺病的一些异质性。随着电子健康记录(EHR)的广泛应用,作为常规临床护理的一部分,大量患者数据得以迅速积累。然而,目前还不清楚在 EHR 衍生数据中实施 ML 是否有可能识别 AECOPD 亚组:确定使用需要住院治疗的严重 AECOPD 的电子病历数据实施 ML 是否能识别相关亚组:本研究使用了耶鲁-纽黑文医院(Yale-New Haven Hospital,YNHHS)收治的两个回顾性 AECOPD 患者队列(非 COVID-19 和 COVID-19)。采用 K-均值聚类法确定患者亚组:我们在非COVID队列(n=1,736)中确定了三个亚组。每个亚组都有不同的临床特征。参照亚组人数最多(904 人),其次是心肾亚组(548 人)和嗜酸性粒细胞亚组(284 人)。嗜酸性粒细胞亚组的 AECOPD 严重程度较轻,包括住院时间较短(p结论:将 ML 与电子病历数据相结合,可以确定严重 AECOPD 的特定临床和生物学亚组。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信