急性呼吸衰竭的有效死亡率预测:使用MIMIC数据库的资源受限机器学习方法

IF 13.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Muhammad Talha Khan, Maryam Gulzar, Arshad Ali, Aamir Wali, Rida Amir
{"title":"急性呼吸衰竭的有效死亡率预测:使用MIMIC数据库的资源受限机器学习方法","authors":"Muhammad Talha Khan,&nbsp;Maryam Gulzar,&nbsp;Arshad Ali,&nbsp;Aamir Wali,&nbsp;Rida Amir","doi":"10.1007/s10462-025-11387-4","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and <span>\\(\\hbox {F}_\\beta \\)</span> scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size. </p><h3>Graphical abstract</h3><p>This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Graphical abstract illustrating the process</p></div></div></figure></div></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 12","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11387-4.pdf","citationCount":"0","resultStr":"{\"title\":\"Efficient mortality prediction for acute respiratory failure: a resource-constrained machine learning approach using MIMIC databases\",\"authors\":\"Muhammad Talha Khan,&nbsp;Maryam Gulzar,&nbsp;Arshad Ali,&nbsp;Aamir Wali,&nbsp;Rida Amir\",\"doi\":\"10.1007/s10462-025-11387-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and <span>\\\\(\\\\hbox {F}_\\\\beta \\\\)</span> scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size. </p><h3>Graphical abstract</h3><p>This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Graphical abstract illustrating the process</p></div></div></figure></div></div>\",\"PeriodicalId\":8449,\"journal\":{\"name\":\"Artificial Intelligence Review\",\"volume\":\"58 12\",\"pages\":\"\"},\"PeriodicalIF\":13.9000,\"publicationDate\":\"2025-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10462-025-11387-4.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence Review\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10462-025-11387-4\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11387-4","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在重症监护病房(ICU)入院时准确预测急性呼吸衰竭(ARF)患者的死亡率可以改善患者的预后和资源管理。然而,ICU环境经常面临诸如缺少测试结果和资源有限等挑战。本研究提出了一个完整的预测ARF死亡率的管道,重点是有效的特征提取,数据输入和类不平衡处理。关键的预处理步骤包括缺失数据的迭代输入和上采样技术,如SMOTE和基于深度学习的生成器。使用MIMIC-III和MIMIC-IV数据库,对逻辑回归、随机森林、极端梯度增强和神经网络进行了测试。研究结果表明,神经网络与集成方法一起实现了高灵敏度和\(\hbox {F}_\beta \)分数,这对于准确预测死亡率至关重要。值得注意的是,当类别分布平衡时,模型在特异性和敏感性上表现同样良好。SMOTE被证明在解决类不平衡方面特别有效,这表明像gan这样的高级上采样方法可以在不减少数据集大小的情况下进一步提高预测精度。该工作的图形摘要说明了患者入院,确定ICU住院,收集前24小时的测试结果,输入缺失参数,将数据归一化并应用机器学习模型来预测死亡率结果。图解摘要说明过程
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficient mortality prediction for acute respiratory failure: a resource-constrained machine learning approach using MIMIC databases

Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and \(\hbox {F}_\beta \) scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size.

Graphical abstract

This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.

Graphical abstract illustrating the process

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Artificial Intelligence Review
Artificial Intelligence Review 工程技术-计算机:人工智能
CiteScore
22.00
自引率
3.30%
发文量
194
审稿时长
5.3 months
期刊介绍: Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信