Sally Zhao, Zhan Ye, Bhavna Adhin, Matti Vuori, Jari Laukkanen, Sudeshna Fisch
{"title":"心肾器官间评估:一种新的心电动态时间扭曲聚类方法。","authors":"Sally Zhao, Zhan Ye, Bhavna Adhin, Matti Vuori, Jari Laukkanen, Sudeshna Fisch","doi":"10.2196/73353","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The heart and kidneys have vital functions in the human body that reciprocally influence each physiologically. Pathological changes in one organ can damage the other. Epidemiologic studies show that greater than 50% of patients with heart failure (HF) have preserved ejection fraction (HFpEF). Additionally, one in six patients identified as having chronic kidney disease (CKD) also has HF. Thus, it is important to be able to predict and identify the cardiorenal relationship between HFpEF and CKD.</p><p><strong>Objective: </strong>Creating an ECG-enabled model that stratifies HFpEF suspected patients would help identify CKD enriched HFpEF clusters and phenogroups. Simultaneously, a minimal set of significant ECG features derived from the stratification model would aid precision medicine and practical diagnoses due to being more accessible and widely readable than a large set of clinical inputs. Furthermore, validation of the existing cardiorenal relationship using this ECG-enabled model may lead to better biological understanding.</p><p><strong>Methods: </strong>Using unsupervised clustering on all extractable ECG features from FinnGen, patients with an indication of HFpEF (filtered by LVEF ≥ 50% and NT-proBNP > 450 pg/mL) were categorized into different phenogroups and analyzed for CKD risk. After isolating significant predictive ECG features, unsupervised clustering and risk analysis were performed again to demonstrate the efficacy of using a minimal set of features for phenogrouping. These clusters were then compared to clusters formed using Dynamic Time Warping (DTW) on raw ECG time series electrical signals. Afterwards, these clusters were analyzed for CKD enrichment.</p><p><strong>Results: </strong>PR interval and QRS duration stood out as significant features and were used as the minimal feature set. After generating and comparing clusters (K-means with all extracted ECG features, K-means with minimal feature set, and DTW with full Lead II ECG waveform), the DTW generated clusters were most stable. ANOVA analysis also showed that several HFpEF clusters exhibited a deviation of CKD risk from baseline, allowing for further trajectory analysis. Specifically, the creatinine levels (a proxy for CKD) of several DTW created clusters showed significant differences from average. Based off Jaccard score, the DTW clusters also showed the greatest alignment to baseline comparison clusters created by clustering on creatinine. In comparison, the other two sets of clusters (created by all extracted ECG features and the minimal set) performed similarly.</p><p><strong>Conclusions: </strong>This project validates both the known cardiorenal relationship between HFpEF and CKD and the importance of the PR interval and QRS duration. After exploring the use of ECG data for patient clustering and stratification, DTW clustering with Lead II waveforms resulted in the most clinically meaningful clusters in the context of HFpEF and CKD. This methodology may prove useful in exploring ECG clustering applications outside of HFpEF as well.</p><p><strong>Clinicaltrial: </strong></p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cardiorenal Inter-organ Assessment: A Novel Clustering Method Using Dynamic Time Warping on ECG.\",\"authors\":\"Sally Zhao, Zhan Ye, Bhavna Adhin, Matti Vuori, Jari Laukkanen, Sudeshna Fisch\",\"doi\":\"10.2196/73353\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The heart and kidneys have vital functions in the human body that reciprocally influence each physiologically. Pathological changes in one organ can damage the other. Epidemiologic studies show that greater than 50% of patients with heart failure (HF) have preserved ejection fraction (HFpEF). Additionally, one in six patients identified as having chronic kidney disease (CKD) also has HF. Thus, it is important to be able to predict and identify the cardiorenal relationship between HFpEF and CKD.</p><p><strong>Objective: </strong>Creating an ECG-enabled model that stratifies HFpEF suspected patients would help identify CKD enriched HFpEF clusters and phenogroups. Simultaneously, a minimal set of significant ECG features derived from the stratification model would aid precision medicine and practical diagnoses due to being more accessible and widely readable than a large set of clinical inputs. Furthermore, validation of the existing cardiorenal relationship using this ECG-enabled model may lead to better biological understanding.</p><p><strong>Methods: </strong>Using unsupervised clustering on all extractable ECG features from FinnGen, patients with an indication of HFpEF (filtered by LVEF ≥ 50% and NT-proBNP > 450 pg/mL) were categorized into different phenogroups and analyzed for CKD risk. After isolating significant predictive ECG features, unsupervised clustering and risk analysis were performed again to demonstrate the efficacy of using a minimal set of features for phenogrouping. These clusters were then compared to clusters formed using Dynamic Time Warping (DTW) on raw ECG time series electrical signals. Afterwards, these clusters were analyzed for CKD enrichment.</p><p><strong>Results: </strong>PR interval and QRS duration stood out as significant features and were used as the minimal feature set. After generating and comparing clusters (K-means with all extracted ECG features, K-means with minimal feature set, and DTW with full Lead II ECG waveform), the DTW generated clusters were most stable. ANOVA analysis also showed that several HFpEF clusters exhibited a deviation of CKD risk from baseline, allowing for further trajectory analysis. Specifically, the creatinine levels (a proxy for CKD) of several DTW created clusters showed significant differences from average. Based off Jaccard score, the DTW clusters also showed the greatest alignment to baseline comparison clusters created by clustering on creatinine. In comparison, the other two sets of clusters (created by all extracted ECG features and the minimal set) performed similarly.</p><p><strong>Conclusions: </strong>This project validates both the known cardiorenal relationship between HFpEF and CKD and the importance of the PR interval and QRS duration. After exploring the use of ECG data for patient clustering and stratification, DTW clustering with Lead II waveforms resulted in the most clinically meaningful clusters in the context of HFpEF and CKD. This methodology may prove useful in exploring ECG clustering applications outside of HFpEF as well.</p><p><strong>Clinicaltrial: </strong></p>\",\"PeriodicalId\":56334,\"journal\":{\"name\":\"JMIR Medical Informatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2196/73353\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/73353","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
Cardiorenal Inter-organ Assessment: A Novel Clustering Method Using Dynamic Time Warping on ECG.
Background: The heart and kidneys have vital functions in the human body that reciprocally influence each physiologically. Pathological changes in one organ can damage the other. Epidemiologic studies show that greater than 50% of patients with heart failure (HF) have preserved ejection fraction (HFpEF). Additionally, one in six patients identified as having chronic kidney disease (CKD) also has HF. Thus, it is important to be able to predict and identify the cardiorenal relationship between HFpEF and CKD.
Objective: Creating an ECG-enabled model that stratifies HFpEF suspected patients would help identify CKD enriched HFpEF clusters and phenogroups. Simultaneously, a minimal set of significant ECG features derived from the stratification model would aid precision medicine and practical diagnoses due to being more accessible and widely readable than a large set of clinical inputs. Furthermore, validation of the existing cardiorenal relationship using this ECG-enabled model may lead to better biological understanding.
Methods: Using unsupervised clustering on all extractable ECG features from FinnGen, patients with an indication of HFpEF (filtered by LVEF ≥ 50% and NT-proBNP > 450 pg/mL) were categorized into different phenogroups and analyzed for CKD risk. After isolating significant predictive ECG features, unsupervised clustering and risk analysis were performed again to demonstrate the efficacy of using a minimal set of features for phenogrouping. These clusters were then compared to clusters formed using Dynamic Time Warping (DTW) on raw ECG time series electrical signals. Afterwards, these clusters were analyzed for CKD enrichment.
Results: PR interval and QRS duration stood out as significant features and were used as the minimal feature set. After generating and comparing clusters (K-means with all extracted ECG features, K-means with minimal feature set, and DTW with full Lead II ECG waveform), the DTW generated clusters were most stable. ANOVA analysis also showed that several HFpEF clusters exhibited a deviation of CKD risk from baseline, allowing for further trajectory analysis. Specifically, the creatinine levels (a proxy for CKD) of several DTW created clusters showed significant differences from average. Based off Jaccard score, the DTW clusters also showed the greatest alignment to baseline comparison clusters created by clustering on creatinine. In comparison, the other two sets of clusters (created by all extracted ECG features and the minimal set) performed similarly.
Conclusions: This project validates both the known cardiorenal relationship between HFpEF and CKD and the importance of the PR interval and QRS duration. After exploring the use of ECG data for patient clustering and stratification, DTW clustering with Lead II waveforms resulted in the most clinically meaningful clusters in the context of HFpEF and CKD. This methodology may prove useful in exploring ECG clustering applications outside of HFpEF as well.
期刊介绍:
JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals.
Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.