{"title":"Quantitative CT and COPD: cluster analysis reveals five distinct subtypes with varying exacerbation risks.","authors":"Chusheng Peng, Zizheng Chen, Haobin Zhou, Chaoyue Dai, Haolei Yuan, Yuan Gao, Fengyan Wang, Zhenyu Liang","doi":"10.1186/s12890-025-03562-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The heterogeneity of chronic obstructive pulmonary disease (COPD) is increasingly recognized. To characterize the heterogeneity of COPD, we aimed to identify subtypes related to quantitative CT by using principal component analysis (PCA) and cluster analysis.</p><p><strong>Methods: </strong>The data of 1879 participants in the SPIROMICS study were obtained from the NHLBI Biologic Specimen and Data Repository Information Coordinating Center. A combination of PCA and k-means clustering was used to analyze the data from these participants in the SPIROMICS study. We randomly split the samples into training and validation sets. Clusters were evaluated for their relationship with acute exacerbation risk throughout the entire follow-up period. The results of the training set were confirmed in the validation set. To avoid sampling errors, we conducted 10 random sampling cycles. Normalized mutual information (NMI) was applied in every cycle to evaluate the stability of clustering.</p><p><strong>Results: </strong>We identified five clusters related to quantitative CT characterized as follows: (1) male-dominated low disease impact cluster, (2) obesity with relatively high symptom burden cluster, (3) airway wall lesion cluster, (4) lung upper region zone-predominant emphysema cluster, (5) severe emphysema cluster. There are significant differences in acute exacerbation risk among these five clusters.</p><p><strong>Conclusions: </strong>Cluster analysis identified 5 clusters related to quantitative CT of all participants in the SPIROMICS cohort with significant differences in baseline characteristics and acute exacerbation risk. The stability of clustering results was validated through NMI in 10 sampling cycles. In addition, dimensionality reduction results showed high reproducibility in different studies.</p>","PeriodicalId":9148,"journal":{"name":"BMC Pulmonary Medicine","volume":"25 1","pages":"92"},"PeriodicalIF":2.6000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863429/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pulmonary Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12890-025-03562-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The heterogeneity of chronic obstructive pulmonary disease (COPD) is increasingly recognized. To characterize the heterogeneity of COPD, we aimed to identify subtypes related to quantitative CT by using principal component analysis (PCA) and cluster analysis.
Methods: The data of 1879 participants in the SPIROMICS study were obtained from the NHLBI Biologic Specimen and Data Repository Information Coordinating Center. A combination of PCA and k-means clustering was used to analyze the data from these participants in the SPIROMICS study. We randomly split the samples into training and validation sets. Clusters were evaluated for their relationship with acute exacerbation risk throughout the entire follow-up period. The results of the training set were confirmed in the validation set. To avoid sampling errors, we conducted 10 random sampling cycles. Normalized mutual information (NMI) was applied in every cycle to evaluate the stability of clustering.
Results: We identified five clusters related to quantitative CT characterized as follows: (1) male-dominated low disease impact cluster, (2) obesity with relatively high symptom burden cluster, (3) airway wall lesion cluster, (4) lung upper region zone-predominant emphysema cluster, (5) severe emphysema cluster. There are significant differences in acute exacerbation risk among these five clusters.
Conclusions: Cluster analysis identified 5 clusters related to quantitative CT of all participants in the SPIROMICS cohort with significant differences in baseline characteristics and acute exacerbation risk. The stability of clustering results was validated through NMI in 10 sampling cycles. In addition, dimensionality reduction results showed high reproducibility in different studies.
期刊介绍:
BMC Pulmonary Medicine is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of pulmonary and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.