Muhammad Talha Khan, Maryam Gulzar, Arshad Ali, Aamir Wali, Rida Amir
{"title":"急性呼吸衰竭的有效死亡率预测:使用MIMIC数据库的资源受限机器学习方法","authors":"Muhammad Talha Khan, Maryam Gulzar, Arshad Ali, Aamir Wali, Rida Amir","doi":"10.1007/s10462-025-11387-4","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and <span>\\(\\hbox {F}_\\beta \\)</span> scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size. </p><h3>Graphical abstract</h3><p>This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Graphical abstract illustrating the process</p></div></div></figure></div></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 12","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11387-4.pdf","citationCount":"0","resultStr":"{\"title\":\"Efficient mortality prediction for acute respiratory failure: a resource-constrained machine learning approach using MIMIC databases\",\"authors\":\"Muhammad Talha Khan, Maryam Gulzar, Arshad Ali, Aamir Wali, Rida Amir\",\"doi\":\"10.1007/s10462-025-11387-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and <span>\\\\(\\\\hbox {F}_\\\\beta \\\\)</span> scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size. </p><h3>Graphical abstract</h3><p>This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Graphical abstract illustrating the process</p></div></div></figure></div></div>\",\"PeriodicalId\":8449,\"journal\":{\"name\":\"Artificial Intelligence Review\",\"volume\":\"58 12\",\"pages\":\"\"},\"PeriodicalIF\":13.9000,\"publicationDate\":\"2025-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10462-025-11387-4.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence Review\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10462-025-11387-4\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11387-4","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Efficient mortality prediction for acute respiratory failure: a resource-constrained machine learning approach using MIMIC databases
Accurate prediction of mortality in Acute Respiratory Failure (ARF) patients at intensive care unit (ICU) admission can improve patient outcomes and resource management. However, ICU environments often face challenges like missing test results and limited resources. This study presents a complete pipeline for predicting ARF mortality, focusing on effective feature extraction, data imputation, and class imbalance handling. Key preprocessing steps include iterative imputation for missing data and upsampling techniques like SMOTE and deep learning-based generators. Using the MIMIC-III and MIMIC-IV databases, logistic regression, random forest, extreme gradient boosting, and neural networks were tested. Findings demonstrate that neural networks, along with ensemble methods, achieved high sensitivity and \(\hbox {F}_\beta \) scores, which are essential for accurate mortality predictions. Notably, when class distribution was balanced, the models performed equally well on specificity and sensitivity. SMOTE proved particularly effective in addressing class imbalance, suggesting that advanced upsampling methods like GANs could further enhance prediction accuracy without reducing dataset size.
Graphical abstract
This graphical abstract of the work that illustrates that a patient is admitted to the hospital, admission to the ICU is determined, the test results of the first 24 hours are collected, missing parameters are imputed, the data are normalized and a machine learning model is applied to predict mortality outcomes.
期刊介绍:
Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.