Development and validation of a machine learning model for online predicting the risk of in heart failure: based on the routine blood test and their derived parameters.

IF 2.8 3区 医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS
Frontiers in Cardiovascular Medicine Pub Date : 2025-03-17 eCollection Date: 2025-01-01 DOI:10.3389/fcvm.2025.1539966
Jianchen Pu, Yimin Yao, Xiaochun Wang
{"title":"Development and validation of a machine learning model for online predicting the risk of in heart failure: based on the routine blood test and their derived parameters.","authors":"Jianchen Pu, Yimin Yao, Xiaochun Wang","doi":"10.3389/fcvm.2025.1539966","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Heart failure (HF), a core component of cardiovascular diseases, is characterized by high morbidity and mortality worldwide. By collecting and analyzing routine blood data, machine learning models were built to identify the patterns of changes in blood indicators related to HF.</p><p><strong>Methods: </strong>We conducted a statistical analysis of routine blood data from 226 patients who visited Zhejiang Provincial Hospital of Traditional Chinese Medicine (Hubin) between May 1, 2024, and June 30, 2024. The patients were divided into an experimental group (HF patients) and a normal control group. Additionally, 211 patients from the Qiantang and Xixi centers formed an independent external validation cohort. This study used both univariate and multivariate analyses to identify the risk factors associated with HF. Variables associated with HF were selected using LASSO regression analysis. In addition, eight different machine learning algorithms were applied for prediction, and the prediction performances of these algorithms were comprehensively evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis and confusion matrix.</p><p><strong>Conclusions: </strong>Using LASSO regression analysis, leukocyte, neutrophil, red blood cell, hemoglobin, platelet, and monocyte-to-lymphocyte ratios were identified as risk factors for HF. Among the evaluated models, the random forest model exhibited the best performance. In the validation cohort, the area under the curve (AUC) of the model was 0.948, while that of the test cohort was 1.000. The calibration curve revealed good agreement between the actual and predicted probabilities, whereas the decision curve showed the significant clinical application of the model. Additionally, the AUC of the model in the external independent test cohort was 0.945.</p><p><strong>Discussion: </strong>We used an online predictive tool to develop a predictive machine-learning model. The main purpose of this model was to predict the probability of developing HF in the future. This prediction can provide strong support and references for clinicians when making decisions. This online forecasting tool not only processes a large amount of data but also continuously optimizes and adjusts the accuracy of the model according to the latest medical research and clinical data. We hope to identify high-risk patients for early intervention to reduce the incidence of HF and improve their quality of life.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"12 ","pages":"1539966"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11955618/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2025.1539966","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Heart failure (HF), a core component of cardiovascular diseases, is characterized by high morbidity and mortality worldwide. By collecting and analyzing routine blood data, machine learning models were built to identify the patterns of changes in blood indicators related to HF.

Methods: We conducted a statistical analysis of routine blood data from 226 patients who visited Zhejiang Provincial Hospital of Traditional Chinese Medicine (Hubin) between May 1, 2024, and June 30, 2024. The patients were divided into an experimental group (HF patients) and a normal control group. Additionally, 211 patients from the Qiantang and Xixi centers formed an independent external validation cohort. This study used both univariate and multivariate analyses to identify the risk factors associated with HF. Variables associated with HF were selected using LASSO regression analysis. In addition, eight different machine learning algorithms were applied for prediction, and the prediction performances of these algorithms were comprehensively evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis and confusion matrix.

Conclusions: Using LASSO regression analysis, leukocyte, neutrophil, red blood cell, hemoglobin, platelet, and monocyte-to-lymphocyte ratios were identified as risk factors for HF. Among the evaluated models, the random forest model exhibited the best performance. In the validation cohort, the area under the curve (AUC) of the model was 0.948, while that of the test cohort was 1.000. The calibration curve revealed good agreement between the actual and predicted probabilities, whereas the decision curve showed the significant clinical application of the model. Additionally, the AUC of the model in the external independent test cohort was 0.945.

Discussion: We used an online predictive tool to develop a predictive machine-learning model. The main purpose of this model was to predict the probability of developing HF in the future. This prediction can provide strong support and references for clinicians when making decisions. This online forecasting tool not only processes a large amount of data but also continuously optimizes and adjusts the accuracy of the model according to the latest medical research and clinical data. We hope to identify high-risk patients for early intervention to reduce the incidence of HF and improve their quality of life.

求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Cardiovascular Medicine
Frontiers in Cardiovascular Medicine Medicine-Cardiology and Cardiovascular Medicine
CiteScore
3.80
自引率
11.10%
发文量
3529
审稿时长
14 weeks
期刊介绍: Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers? At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信