Predicting in-hospital mortality in patients with heart failure combined with atrial fibrillation using stacking ensemble model: an analysis of the medical information mart for intensive care IV (MIMIC-IV).

IF 3.3 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2024-12-23 DOI:10.1186/s12911-024-02829-0

Panpan Chen, Junhua Sun, Yingjie Chu, Yujie Zhao

{"title":"Predicting in-hospital mortality in patients with heart failure combined with atrial fibrillation using stacking ensemble model: an analysis of the medical information mart for intensive care IV (MIMIC-IV).","authors":"Panpan Chen, Junhua Sun, Yingjie Chu, Yujie Zhao","doi":"10.1186/s12911-024-02829-0","DOIUrl":null,"url":null,"abstract":"Background: Heart failure (HF) and atrial fibrillation (AF) usually coexist and are associated with a poorer prognosis. This study aimed to develop a model to predict in-hospital mortality in patients with HF combined with AF.Methods: Patients with HF and AF were obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database from 2008 to 2019. Feature selection was based on the Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model. Random Forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), K-Nearest Neighbor (KNN) models, and their stacked model (the stacking ensemble model) were established. The area under of the curve (AUC) with 95% confidence interval (CI), sensitivity, specificity, as well as accuracy were applied to assess the performance of the predictive models.Results: A total of 5,998 patients with HF combined with AF were included, of which 4,198 patients were assigned to the training set and 1,800 to the testing set (7:3). Among these 4,198 patients, 624 (14.86%) died in-hospital and 3,574 (85.14%) survived. Twenty-two features were used to construct the predictive model. Among these four single models, the AUC was 0.747 (95%CI: 0.717-0.777) for the Random Forest model, 0.755 (95%CI: 0.725-0.785) for the XGBoost model, 0.754 (95%CI: 0.724-0.784) for the LGBM model, and 0.746 (95%CI: 0.716-0.776) for the KNN model in the testing set. The stacking ensemble model had the highest AUC compared to the four single models, with AUCs of 0.837 (95%CI: 0.821-0.852) and 0.768 (95%CI: 0.740-0.796) for the training set and testing set, respectively.Conclusion: The stacking ensemble model showed a good predictive effect in predicting in-hospital mortality in patients with HF combined with AF and may provide clinicians with a reference tool for early identification of mortality risk.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"402"},"PeriodicalIF":3.3000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667998/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02829-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Heart failure (HF) and atrial fibrillation (AF) usually coexist and are associated with a poorer prognosis. This study aimed to develop a model to predict in-hospital mortality in patients with HF combined with AF.

Methods: Patients with HF and AF were obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database from 2008 to 2019. Feature selection was based on the Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model. Random Forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), K-Nearest Neighbor (KNN) models, and their stacked model (the stacking ensemble model) were established. The area under of the curve (AUC) with 95% confidence interval (CI), sensitivity, specificity, as well as accuracy were applied to assess the performance of the predictive models.

Results: A total of 5,998 patients with HF combined with AF were included, of which 4,198 patients were assigned to the training set and 1,800 to the testing set (7:3). Among these 4,198 patients, 624 (14.86%) died in-hospital and 3,574 (85.14%) survived. Twenty-two features were used to construct the predictive model. Among these four single models, the AUC was 0.747 (95%CI: 0.717-0.777) for the Random Forest model, 0.755 (95%CI: 0.725-0.785) for the XGBoost model, 0.754 (95%CI: 0.724-0.784) for the LGBM model, and 0.746 (95%CI: 0.716-0.776) for the KNN model in the testing set. The stacking ensemble model had the highest AUC compared to the four single models, with AUCs of 0.837 (95%CI: 0.821-0.852) and 0.768 (95%CI: 0.740-0.796) for the training set and testing set, respectively.

Conclusion: The stacking ensemble model showed a good predictive effect in predicting in-hospital mortality in patients with HF combined with AF and may provide clinicians with a reference tool for early identification of mortality risk.

查看原文本刊更多论文

使用堆叠集合模型预测心力衰竭合并心房颤动患者的住院死亡率：重症监护医疗信息市场（MIMIC-IV）的分析

背景：心力衰竭（HF）和心房颤动（AF）通常共存，且预后较差。本研究旨在建立一个预测心衰合并房颤患者住院死亡率的模型。方法：从重症监护医学信息市场IV （MIMIC-IV）数据库中获取2008 - 2019年心衰合并房颤患者。特征选择基于Mann-Whitney U检验和最小绝对收缩和选择算子（LASSO）回归模型。建立了随机森林、极限梯度增强（XGBoost）、光梯度增强机（LGBM）、k -近邻（KNN）模型及其堆叠模型（堆叠集成模型）。采用曲线下面积（AUC）和95%置信区间（CI）、敏感性、特异性和准确性来评估预测模型的性能。结果：共纳入5998例HF合并AF患者，其中4198例分配到训练集，1800例分配到测试集（7:3）。4198例患者中，住院死亡624例（14.86%），存活3574例（85.14%）。22个特征被用来构建预测模型。在这四个单一模型中，随机森林模型的AUC为0.747 (95%CI: 0.717-0.777)， XGBoost模型的AUC为0.755 (95%CI: 0.725-0.785)， LGBM模型的AUC为0.754 (95%CI: 0.724-0.784)， KNN模型的AUC为0.746 （95%CI: 0.716-0.776）。与4个单一模型相比，叠加集成模型的AUC最高，训练集和测试集的AUC分别为0.837 （95%CI: 0.821-0.852）和0.768 （95%CI: 0.740-0.796）。结论：叠加集合模型对心衰合并房颤患者住院死亡率有较好的预测效果，可为临床医生早期识别死亡风险提供参考工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.