使用堆叠集合模型预测心力衰竭合并心房颤动患者的住院死亡率:重症监护医疗信息市场(MIMIC-IV)的分析

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Panpan Chen, Junhua Sun, Yingjie Chu, Yujie Zhao
{"title":"使用堆叠集合模型预测心力衰竭合并心房颤动患者的住院死亡率:重症监护医疗信息市场(MIMIC-IV)的分析","authors":"Panpan Chen, Junhua Sun, Yingjie Chu, Yujie Zhao","doi":"10.1186/s12911-024-02829-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Heart failure (HF) and atrial fibrillation (AF) usually coexist and are associated with a poorer prognosis. This study aimed to develop a model to predict in-hospital mortality in patients with HF combined with AF.</p><p><strong>Methods: </strong>Patients with HF and AF were obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database from 2008 to 2019. Feature selection was based on the Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model. Random Forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), K-Nearest Neighbor (KNN) models, and their stacked model (the stacking ensemble model) were established. The area under of the curve (AUC) with 95% confidence interval (CI), sensitivity, specificity, as well as accuracy were applied to assess the performance of the predictive models.</p><p><strong>Results: </strong>A total of 5,998 patients with HF combined with AF were included, of which 4,198 patients were assigned to the training set and 1,800 to the testing set (7:3). Among these 4,198 patients, 624 (14.86%) died in-hospital and 3,574 (85.14%) survived. Twenty-two features were used to construct the predictive model. Among these four single models, the AUC was 0.747 (95%CI: 0.717-0.777) for the Random Forest model, 0.755 (95%CI: 0.725-0.785) for the XGBoost model, 0.754 (95%CI: 0.724-0.784) for the LGBM model, and 0.746 (95%CI: 0.716-0.776) for the KNN model in the testing set. The stacking ensemble model had the highest AUC compared to the four single models, with AUCs of 0.837 (95%CI: 0.821-0.852) and 0.768 (95%CI: 0.740-0.796) for the training set and testing set, respectively.</p><p><strong>Conclusion: </strong>The stacking ensemble model showed a good predictive effect in predicting in-hospital mortality in patients with HF combined with AF and may provide clinicians with a reference tool for early identification of mortality risk.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"402"},"PeriodicalIF":3.3000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667998/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting in-hospital mortality in patients with heart failure combined with atrial fibrillation using stacking ensemble model: an analysis of the medical information mart for intensive care IV (MIMIC-IV).\",\"authors\":\"Panpan Chen, Junhua Sun, Yingjie Chu, Yujie Zhao\",\"doi\":\"10.1186/s12911-024-02829-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Heart failure (HF) and atrial fibrillation (AF) usually coexist and are associated with a poorer prognosis. This study aimed to develop a model to predict in-hospital mortality in patients with HF combined with AF.</p><p><strong>Methods: </strong>Patients with HF and AF were obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database from 2008 to 2019. Feature selection was based on the Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model. Random Forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), K-Nearest Neighbor (KNN) models, and their stacked model (the stacking ensemble model) were established. The area under of the curve (AUC) with 95% confidence interval (CI), sensitivity, specificity, as well as accuracy were applied to assess the performance of the predictive models.</p><p><strong>Results: </strong>A total of 5,998 patients with HF combined with AF were included, of which 4,198 patients were assigned to the training set and 1,800 to the testing set (7:3). Among these 4,198 patients, 624 (14.86%) died in-hospital and 3,574 (85.14%) survived. Twenty-two features were used to construct the predictive model. Among these four single models, the AUC was 0.747 (95%CI: 0.717-0.777) for the Random Forest model, 0.755 (95%CI: 0.725-0.785) for the XGBoost model, 0.754 (95%CI: 0.724-0.784) for the LGBM model, and 0.746 (95%CI: 0.716-0.776) for the KNN model in the testing set. The stacking ensemble model had the highest AUC compared to the four single models, with AUCs of 0.837 (95%CI: 0.821-0.852) and 0.768 (95%CI: 0.740-0.796) for the training set and testing set, respectively.</p><p><strong>Conclusion: </strong>The stacking ensemble model showed a good predictive effect in predicting in-hospital mortality in patients with HF combined with AF and may provide clinicians with a reference tool for early identification of mortality risk.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"24 1\",\"pages\":\"402\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667998/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-024-02829-0\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02829-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:心力衰竭(HF)和心房颤动(AF)通常共存,且预后较差。本研究旨在建立一个预测心衰合并房颤患者住院死亡率的模型。方法:从重症监护医学信息市场IV (MIMIC-IV)数据库中获取2008 - 2019年心衰合并房颤患者。特征选择基于Mann-Whitney U检验和最小绝对收缩和选择算子(LASSO)回归模型。建立了随机森林、极限梯度增强(XGBoost)、光梯度增强机(LGBM)、k -近邻(KNN)模型及其堆叠模型(堆叠集成模型)。采用曲线下面积(AUC)和95%置信区间(CI)、敏感性、特异性和准确性来评估预测模型的性能。结果:共纳入5998例HF合并AF患者,其中4198例分配到训练集,1800例分配到测试集(7:3)。4198例患者中,住院死亡624例(14.86%),存活3574例(85.14%)。22个特征被用来构建预测模型。在这四个单一模型中,随机森林模型的AUC为0.747 (95%CI: 0.717-0.777), XGBoost模型的AUC为0.755 (95%CI: 0.725-0.785), LGBM模型的AUC为0.754 (95%CI: 0.724-0.784), KNN模型的AUC为0.746 (95%CI: 0.716-0.776)。与4个单一模型相比,叠加集成模型的AUC最高,训练集和测试集的AUC分别为0.837 (95%CI: 0.821-0.852)和0.768 (95%CI: 0.740-0.796)。结论:叠加集合模型对心衰合并房颤患者住院死亡率有较好的预测效果,可为临床医生早期识别死亡风险提供参考工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting in-hospital mortality in patients with heart failure combined with atrial fibrillation using stacking ensemble model: an analysis of the medical information mart for intensive care IV (MIMIC-IV).

Background: Heart failure (HF) and atrial fibrillation (AF) usually coexist and are associated with a poorer prognosis. This study aimed to develop a model to predict in-hospital mortality in patients with HF combined with AF.

Methods: Patients with HF and AF were obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database from 2008 to 2019. Feature selection was based on the Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model. Random Forest, eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), K-Nearest Neighbor (KNN) models, and their stacked model (the stacking ensemble model) were established. The area under of the curve (AUC) with 95% confidence interval (CI), sensitivity, specificity, as well as accuracy were applied to assess the performance of the predictive models.

Results: A total of 5,998 patients with HF combined with AF were included, of which 4,198 patients were assigned to the training set and 1,800 to the testing set (7:3). Among these 4,198 patients, 624 (14.86%) died in-hospital and 3,574 (85.14%) survived. Twenty-two features were used to construct the predictive model. Among these four single models, the AUC was 0.747 (95%CI: 0.717-0.777) for the Random Forest model, 0.755 (95%CI: 0.725-0.785) for the XGBoost model, 0.754 (95%CI: 0.724-0.784) for the LGBM model, and 0.746 (95%CI: 0.716-0.776) for the KNN model in the testing set. The stacking ensemble model had the highest AUC compared to the four single models, with AUCs of 0.837 (95%CI: 0.821-0.852) and 0.768 (95%CI: 0.740-0.796) for the training set and testing set, respectively.

Conclusion: The stacking ensemble model showed a good predictive effect in predicting in-hospital mortality in patients with HF combined with AF and may provide clinicians with a reference tool for early identification of mortality risk.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信