Huan Li, John Huston, Jana Zielonka, Shannon Kay, Maor Sauler, Jose Gomez
{"title":"通过在电子健康记录中实施机器学习,识别慢性阻塞性肺病严重急性加重亚组。","authors":"Huan Li, John Huston, Jana Zielonka, Shannon Kay, Maor Sauler, Jose Gomez","doi":"10.15326/jcopdf.2024.0556","DOIUrl":null,"url":null,"abstract":"<p><strong>Rationale: </strong>Acute exacerbations of COPD (AECOPD) are heterogeneous. Machine learning (ML) has previously been used to dissect some of the heterogeneity in COPD. The widespread adoption of electronic health records (EHRs) has led to the rapid accumulation of large amounts of patient data as part of routine clinical care. However, it is unclear whether the implementation of ML in EHR-derived data has the potential to identify subgroups of AECOPD.</p><p><strong>Objectives: </strong>Determine whether ML implementation using EHR data from severe AECOPD requiring hospitalization identifies relevant subgroups.</p><p><strong>Methods: </strong>This study used two retrospective cohorts of patients with AECOPD (non-COVID-19 and COVID-19) treated at Yale-New Haven Hospital (YNHHS). <i>K</i>-means clustering was used to identify patient subgroups.</p><p><strong>Measurements and main results: </strong>We identified three subgroups in the non-COVID cohort (n=1,736). Each subgroup had distinct clinical characteristics. The reference subgroup was the largest (n=904), followed by cardio-renal (n = 548) and eosinophilic (n=284). The eosinophilic subgroup had milder severity of AECOPD, including a shorter hospital stay (p<0.01). The cardio-renal subgroup had the highest mortality during (5%) and in the year after hospitalization (30%). Validation of the severe AECOPD classifier in the COVID-19 cohort recapitulated the characteristics seen in the non-COVID cohort. AECOPD subgroups in the COVID-19 cohort had different IL-1 beta, IL-2R, and IL-8 levels (FDR ≤ 0.05. These specific leukocyte and cytokine profiles resulted in inflammatory differences between the AECOPD subgroups based on C-reactive protein levels.</p><p><strong>Conclusions: </strong>Incorporating ML with EHR-data allows the identification of specific clinical and biological subgroups for severe AECOPD.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Severe Acute Exacerbations of Chronic Obstructive Pulmonary Disease Subgroups by Machine Learning Implementation in Electronic Health Records.\",\"authors\":\"Huan Li, John Huston, Jana Zielonka, Shannon Kay, Maor Sauler, Jose Gomez\",\"doi\":\"10.15326/jcopdf.2024.0556\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Rationale: </strong>Acute exacerbations of COPD (AECOPD) are heterogeneous. Machine learning (ML) has previously been used to dissect some of the heterogeneity in COPD. The widespread adoption of electronic health records (EHRs) has led to the rapid accumulation of large amounts of patient data as part of routine clinical care. However, it is unclear whether the implementation of ML in EHR-derived data has the potential to identify subgroups of AECOPD.</p><p><strong>Objectives: </strong>Determine whether ML implementation using EHR data from severe AECOPD requiring hospitalization identifies relevant subgroups.</p><p><strong>Methods: </strong>This study used two retrospective cohorts of patients with AECOPD (non-COVID-19 and COVID-19) treated at Yale-New Haven Hospital (YNHHS). <i>K</i>-means clustering was used to identify patient subgroups.</p><p><strong>Measurements and main results: </strong>We identified three subgroups in the non-COVID cohort (n=1,736). Each subgroup had distinct clinical characteristics. The reference subgroup was the largest (n=904), followed by cardio-renal (n = 548) and eosinophilic (n=284). The eosinophilic subgroup had milder severity of AECOPD, including a shorter hospital stay (p<0.01). The cardio-renal subgroup had the highest mortality during (5%) and in the year after hospitalization (30%). Validation of the severe AECOPD classifier in the COVID-19 cohort recapitulated the characteristics seen in the non-COVID cohort. AECOPD subgroups in the COVID-19 cohort had different IL-1 beta, IL-2R, and IL-8 levels (FDR ≤ 0.05. These specific leukocyte and cytokine profiles resulted in inflammatory differences between the AECOPD subgroups based on C-reactive protein levels.</p><p><strong>Conclusions: </strong>Incorporating ML with EHR-data allows the identification of specific clinical and biological subgroups for severe AECOPD.</p>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.15326/jcopdf.2024.0556\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.15326/jcopdf.2024.0556","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
摘要
理由:慢性阻塞性肺病急性加重(AECOPD)具有异质性。机器学习(ML)曾被用于剖析慢性阻塞性肺病的一些异质性。随着电子健康记录(EHR)的广泛应用,作为常规临床护理的一部分,大量患者数据得以迅速积累。然而,目前还不清楚在 EHR 衍生数据中实施 ML 是否有可能识别 AECOPD 亚组:确定使用需要住院治疗的严重 AECOPD 的电子病历数据实施 ML 是否能识别相关亚组:本研究使用了耶鲁-纽黑文医院(Yale-New Haven Hospital,YNHHS)收治的两个回顾性 AECOPD 患者队列(非 COVID-19 和 COVID-19)。采用 K-均值聚类法确定患者亚组:我们在非COVID队列(n=1,736)中确定了三个亚组。每个亚组都有不同的临床特征。参照亚组人数最多(904 人),其次是心肾亚组(548 人)和嗜酸性粒细胞亚组(284 人)。嗜酸性粒细胞亚组的 AECOPD 严重程度较轻,包括住院时间较短(p结论:将 ML 与电子病历数据相结合,可以确定严重 AECOPD 的特定临床和生物学亚组。
Identification of Severe Acute Exacerbations of Chronic Obstructive Pulmonary Disease Subgroups by Machine Learning Implementation in Electronic Health Records.
Rationale: Acute exacerbations of COPD (AECOPD) are heterogeneous. Machine learning (ML) has previously been used to dissect some of the heterogeneity in COPD. The widespread adoption of electronic health records (EHRs) has led to the rapid accumulation of large amounts of patient data as part of routine clinical care. However, it is unclear whether the implementation of ML in EHR-derived data has the potential to identify subgroups of AECOPD.
Objectives: Determine whether ML implementation using EHR data from severe AECOPD requiring hospitalization identifies relevant subgroups.
Methods: This study used two retrospective cohorts of patients with AECOPD (non-COVID-19 and COVID-19) treated at Yale-New Haven Hospital (YNHHS). K-means clustering was used to identify patient subgroups.
Measurements and main results: We identified three subgroups in the non-COVID cohort (n=1,736). Each subgroup had distinct clinical characteristics. The reference subgroup was the largest (n=904), followed by cardio-renal (n = 548) and eosinophilic (n=284). The eosinophilic subgroup had milder severity of AECOPD, including a shorter hospital stay (p<0.01). The cardio-renal subgroup had the highest mortality during (5%) and in the year after hospitalization (30%). Validation of the severe AECOPD classifier in the COVID-19 cohort recapitulated the characteristics seen in the non-COVID cohort. AECOPD subgroups in the COVID-19 cohort had different IL-1 beta, IL-2R, and IL-8 levels (FDR ≤ 0.05. These specific leukocyte and cytokine profiles resulted in inflammatory differences between the AECOPD subgroups based on C-reactive protein levels.
Conclusions: Incorporating ML with EHR-data allows the identification of specific clinical and biological subgroups for severe AECOPD.