Amalie D Haue, Peter C Holm, Karina Banasik, Kenny Emil Aunstrup, Christian Holm Johansen, Agnete T Lundgaard, Victorine P Muse, Timo Röder, David Westergaard, Piotr J Chmura, Alex H Christensen, Peter E Weeke, Erik Sørensen, Ole B V Pedersen, Sisse R Ostrowski, Kasper K Iversen, Lars V Køber, Henrik Ullum, Henning Bundgaard, Søren Brunak
{"title":"Subgrouping patients with ischemic heart disease by means of the Markov cluster algorithm.","authors":"Amalie D Haue, Peter C Holm, Karina Banasik, Kenny Emil Aunstrup, Christian Holm Johansen, Agnete T Lundgaard, Victorine P Muse, Timo Röder, David Westergaard, Piotr J Chmura, Alex H Christensen, Peter E Weeke, Erik Sørensen, Ole B V Pedersen, Sisse R Ostrowski, Kasper K Iversen, Lars V Køber, Henrik Ullum, Henning Bundgaard, Søren Brunak","doi":"10.1038/s43856-025-01077-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ischemic heart disease (IHD) is heterogeneous with respect to onset, burden of symptoms, and disease progression. We hypothesized that unsupervised clustering analysis could facilitate identification of distinct and clinically relevant multimorbidity clusters.</p><p><strong>Methods: </strong>We included IHD patients who underwent coronary angiography (CAG) or coronary computed tomography angiography (CCTA) between 2004 and 2016 and used the earliest procedure as the index date. Patient health records were obtained from the Danish National Patient Registry, the Danish National Prescription Registry, and two in-hospital laboratory database systems. Genetic data were obtained from the Copenhagen Hospital Biobank. Using registered pre-index diagnosis codes (n = 3046), patients were clustered by application of the Markov Cluster algorithm. Multimorbidity clusters were then characterized using Cox regressions (new ischemic events, non-IHD mortality, and all-cause mortality) and enrichment analysis to explore both risks and phenotypical characteristics.</p><p><strong>Results: </strong>In a cohort of 72,249 patients with IHD (mean age 63.9 years, 63.1% males), 31 distinct clusters (C1-31, 67,136 patients) are identified. Comparing each cluster to the 30 others, seven clusters (9,590 patients) have significantly higher or lower risk of new ischemic events (five and two clusters, respectively). A total of 18 clusters (35,982 patients) have higher or lower risk of death from non-IHD causes (12 and six clusters, respectively), and 23 clusters have a statistically significant higher or lower risk for all-cause mortality. Cardiovascular or inflammatory diseases are commonly enriched in clusters (13). Distributions for 24 laboratory test results differ significantly across clusters. Polygenic risk scores are increased in a total of 15 clusters (48.4%).</p><p><strong>Conclusions: </strong>Based on prior disease profiles, unsupervised clustering robustly stratify patients with IHD in subgroups with similar clinical features and outcomes.</p>","PeriodicalId":72646,"journal":{"name":"Communications medicine","volume":"5 1","pages":"372"},"PeriodicalIF":5.4000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12381225/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43856-025-01077-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Ischemic heart disease (IHD) is heterogeneous with respect to onset, burden of symptoms, and disease progression. We hypothesized that unsupervised clustering analysis could facilitate identification of distinct and clinically relevant multimorbidity clusters.
Methods: We included IHD patients who underwent coronary angiography (CAG) or coronary computed tomography angiography (CCTA) between 2004 and 2016 and used the earliest procedure as the index date. Patient health records were obtained from the Danish National Patient Registry, the Danish National Prescription Registry, and two in-hospital laboratory database systems. Genetic data were obtained from the Copenhagen Hospital Biobank. Using registered pre-index diagnosis codes (n = 3046), patients were clustered by application of the Markov Cluster algorithm. Multimorbidity clusters were then characterized using Cox regressions (new ischemic events, non-IHD mortality, and all-cause mortality) and enrichment analysis to explore both risks and phenotypical characteristics.
Results: In a cohort of 72,249 patients with IHD (mean age 63.9 years, 63.1% males), 31 distinct clusters (C1-31, 67,136 patients) are identified. Comparing each cluster to the 30 others, seven clusters (9,590 patients) have significantly higher or lower risk of new ischemic events (five and two clusters, respectively). A total of 18 clusters (35,982 patients) have higher or lower risk of death from non-IHD causes (12 and six clusters, respectively), and 23 clusters have a statistically significant higher or lower risk for all-cause mortality. Cardiovascular or inflammatory diseases are commonly enriched in clusters (13). Distributions for 24 laboratory test results differ significantly across clusters. Polygenic risk scores are increased in a total of 15 clusters (48.4%).
Conclusions: Based on prior disease profiles, unsupervised clustering robustly stratify patients with IHD in subgroups with similar clinical features and outcomes.