M. Hasan, S. Hamdan, S. Poudel, J. Vargas, K. Poudel
{"title":"基于MIMIC-III数据库的机器学习预测重症监护病房(ICU)住院时间","authors":"M. Hasan, S. Hamdan, S. Poudel, J. Vargas, K. Poudel","doi":"10.1109/CAI54212.2023.00142","DOIUrl":null,"url":null,"abstract":"The length-of-stay (LOS) is critical for patient care and accommodation in the intensive care unit (ICU). In this work, we developed a framework to predict the LOS using the Medical Information Mart for Intensive Care (MIMIC-III) database. We extracted six features from individual patients and submitted them to the regressors model and examined how well these features could predict LOS. We considered four prediction regimes; extreme gradient boosting (XGBoost), support vector regressor, random forest, and voting regressor. Our analysis reveals that XGBoost yields the best result among other regressors with R2 0.86 and root mean square error (RMSE) 1.2. Remarkably, our results show that ICD9 (9th International classification of diseases code), saline intake per hour, and drug rates are the top three critical features for predicting the LOS.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of Length-of-stay at Intensive Care Unit (ICU) Using Machine Learning based on MIMIC-III Database\",\"authors\":\"M. Hasan, S. Hamdan, S. Poudel, J. Vargas, K. Poudel\",\"doi\":\"10.1109/CAI54212.2023.00142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The length-of-stay (LOS) is critical for patient care and accommodation in the intensive care unit (ICU). In this work, we developed a framework to predict the LOS using the Medical Information Mart for Intensive Care (MIMIC-III) database. We extracted six features from individual patients and submitted them to the regressors model and examined how well these features could predict LOS. We considered four prediction regimes; extreme gradient boosting (XGBoost), support vector regressor, random forest, and voting regressor. Our analysis reveals that XGBoost yields the best result among other regressors with R2 0.86 and root mean square error (RMSE) 1.2. Remarkably, our results show that ICD9 (9th International classification of diseases code), saline intake per hour, and drug rates are the top three critical features for predicting the LOS.\",\"PeriodicalId\":129324,\"journal\":{\"name\":\"2023 IEEE Conference on Artificial Intelligence (CAI)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Conference on Artificial Intelligence (CAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAI54212.2023.00142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Conference on Artificial Intelligence (CAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAI54212.2023.00142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Length-of-stay at Intensive Care Unit (ICU) Using Machine Learning based on MIMIC-III Database
The length-of-stay (LOS) is critical for patient care and accommodation in the intensive care unit (ICU). In this work, we developed a framework to predict the LOS using the Medical Information Mart for Intensive Care (MIMIC-III) database. We extracted six features from individual patients and submitted them to the regressors model and examined how well these features could predict LOS. We considered four prediction regimes; extreme gradient boosting (XGBoost), support vector regressor, random forest, and voting regressor. Our analysis reveals that XGBoost yields the best result among other regressors with R2 0.86 and root mean square error (RMSE) 1.2. Remarkably, our results show that ICD9 (9th International classification of diseases code), saline intake per hour, and drug rates are the top three critical features for predicting the LOS.