Development and validation of a machine learning model for online predicting the risk of in heart failure: based on the routine blood test and their derived parameters.

IF 2.8 3区医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS

Frontiers in Cardiovascular Medicine Pub Date : 2025-03-17 eCollection Date: 2025-01-01 DOI:10.3389/fcvm.2025.1539966

Jianchen Pu, Yimin Yao, Xiaochun Wang

{"title":"Development and validation of a machine learning model for online predicting the risk of in heart failure: based on the routine blood test and their derived parameters.","authors":"Jianchen Pu, Yimin Yao, Xiaochun Wang","doi":"10.3389/fcvm.2025.1539966","DOIUrl":null,"url":null,"abstract":"Background: Heart failure (HF), a core component of cardiovascular diseases, is characterized by high morbidity and mortality worldwide. By collecting and analyzing routine blood data, machine learning models were built to identify the patterns of changes in blood indicators related to HF.Methods: We conducted a statistical analysis of routine blood data from 226 patients who visited Zhejiang Provincial Hospital of Traditional Chinese Medicine (Hubin) between May 1, 2024, and June 30, 2024. The patients were divided into an experimental group (HF patients) and a normal control group. Additionally, 211 patients from the Qiantang and Xixi centers formed an independent external validation cohort. This study used both univariate and multivariate analyses to identify the risk factors associated with HF. Variables associated with HF were selected using LASSO regression analysis. In addition, eight different machine learning algorithms were applied for prediction, and the prediction performances of these algorithms were comprehensively evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis and confusion matrix.Conclusions: Using LASSO regression analysis, leukocyte, neutrophil, red blood cell, hemoglobin, platelet, and monocyte-to-lymphocyte ratios were identified as risk factors for HF. Among the evaluated models, the random forest model exhibited the best performance. In the validation cohort, the area under the curve (AUC) of the model was 0.948, while that of the test cohort was 1.000. The calibration curve revealed good agreement between the actual and predicted probabilities, whereas the decision curve showed the significant clinical application of the model. Additionally, the AUC of the model in the external independent test cohort was 0.945.Discussion: We used an online predictive tool to develop a predictive machine-learning model. The main purpose of this model was to predict the probability of developing HF in the future. This prediction can provide strong support and references for clinicians when making decisions. This online forecasting tool not only processes a large amount of data but also continuously optimizes and adjusts the accuracy of the model according to the latest medical research and clinical data. We hope to identify high-risk patients for early intervention to reduce the incidence of HF and improve their quality of life.","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"12 ","pages":"1539966"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11955618/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2025.1539966","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Heart failure (HF), a core component of cardiovascular diseases, is characterized by high morbidity and mortality worldwide. By collecting and analyzing routine blood data, machine learning models were built to identify the patterns of changes in blood indicators related to HF.

Methods: We conducted a statistical analysis of routine blood data from 226 patients who visited Zhejiang Provincial Hospital of Traditional Chinese Medicine (Hubin) between May 1, 2024, and June 30, 2024. The patients were divided into an experimental group (HF patients) and a normal control group. Additionally, 211 patients from the Qiantang and Xixi centers formed an independent external validation cohort. This study used both univariate and multivariate analyses to identify the risk factors associated with HF. Variables associated with HF were selected using LASSO regression analysis. In addition, eight different machine learning algorithms were applied for prediction, and the prediction performances of these algorithms were comprehensively evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis and confusion matrix.

Conclusions: Using LASSO regression analysis, leukocyte, neutrophil, red blood cell, hemoglobin, platelet, and monocyte-to-lymphocyte ratios were identified as risk factors for HF. Among the evaluated models, the random forest model exhibited the best performance. In the validation cohort, the area under the curve (AUC) of the model was 0.948, while that of the test cohort was 1.000. The calibration curve revealed good agreement between the actual and predicted probabilities, whereas the decision curve showed the significant clinical application of the model. Additionally, the AUC of the model in the external independent test cohort was 0.945.

Discussion: We used an online predictive tool to develop a predictive machine-learning model. The main purpose of this model was to predict the probability of developing HF in the future. This prediction can provide strong support and references for clinicians when making decisions. This online forecasting tool not only processes a large amount of data but also continuously optimizes and adjusts the accuracy of the model according to the latest medical research and clinical data. We hope to identify high-risk patients for early intervention to reduce the incidence of HF and improve their quality of life.

查看原文本刊更多论文

开发和验证用于在线预测心力衰竭风险的机器学习模型：基于常规血液检查及其衍生参数。

背景：心力衰竭（HF）是心血管疾病的核心组成部分，在世界范围内具有高发病率和死亡率的特点。通过收集和分析血常规数据，建立机器学习模型，识别HF相关血液指标的变化模式。方法：对2024年5月1日至6月30日在浙江省中医院（湖滨）就诊的226例患者的血常规资料进行统计分析。将患者分为实验组（HF患者）和正常对照组。此外，钱塘和西溪中心的211名患者组成了一个独立的外部验证队列。本研究采用单因素和多因素分析来确定与心衰相关的危险因素。采用LASSO回归分析选择与HF相关的变量。此外，采用8种不同的机器学习算法进行预测，并利用受试者工作特征曲线、曲线下面积（AUC）、校准曲线分析、决策曲线分析和混淆矩阵对这些算法的预测性能进行综合评价。结论：使用LASSO回归分析，白细胞、中性粒细胞、红细胞、血红蛋白、血小板和单核细胞/淋巴细胞比率被确定为HF的危险因素。在评价模型中，随机森林模型表现出最好的性能。在验证队列中，模型的曲线下面积（AUC）为0.948，而检验队列的AUC为1.000。校正曲线显示了实际概率与预测概率之间的良好一致性，而决策曲线显示了模型的显著临床应用。模型在外部独立检验队列中的AUC为0.945。讨论：我们使用在线预测工具来开发预测机器学习模型。该模型的主要目的是预测未来发生HF的概率。该预测可为临床医生的决策提供有力的支持和参考。该在线预测工具不仅处理大量数据，而且根据最新的医学研究和临床数据不断优化和调整模型的准确性。我们希望发现高危患者进行早期干预，降低心衰发生率，提高其生活质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in Cardiovascular Medicine Medicine-Cardiology and Cardiovascular Medicine

CiteScore

3.80

自引率

11.10%

发文量

3529

审稿时长

14 weeks

期刊介绍： Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers? At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.