A Machine Learning Pipeline for Mortality Prediction in the ICU

Y. Sun, Yi‐Hui Zhou
{"title":"A Machine Learning Pipeline for Mortality Prediction in the ICU","authors":"Y. Sun, Yi‐Hui Zhou","doi":"10.29337/ijdh.44","DOIUrl":null,"url":null,"abstract":"Mortality risk prediction for patients admitted into the intensive care unit (ICU) is a crucial and challenging task, so that clinicians are able to respond with timely and appropriate clinical intervention. This becomes more urgent under the background of COVID-19 as a global pandemic. In recent years, electronic health records (EHR) have been widely adopted, and have the potential to greatly improve clinical services and diagnostics. However, the large proportion of missing data in EHR poses challenges that may reduce the accuracy of prediction methods. We propose a cohort study that builds a pipeline that extracts ICD-9 codes and laboratory tests from public available electronic ICU databases, and improve the in-hospital mortality prediction accuracy using a combination of neural network missing data imputation approach and decision tree based outcome prediction algorithm. We show the proposed approach achieves a higher area under the ROC curve, ranging from 0.88-0.98, compared with other well-known machine learning methods applied to similar target population. It also offers clinical interpretations through variable selection. Our analysis also shows that mortality prediction for neonates was more challenging than for adults, and that prediction accuracy decreases as patients stayed longer in the ICU.","PeriodicalId":372681,"journal":{"name":"International Journal of Digital Health","volume":"190 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Digital Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29337/ijdh.44","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Mortality risk prediction for patients admitted into the intensive care unit (ICU) is a crucial and challenging task, so that clinicians are able to respond with timely and appropriate clinical intervention. This becomes more urgent under the background of COVID-19 as a global pandemic. In recent years, electronic health records (EHR) have been widely adopted, and have the potential to greatly improve clinical services and diagnostics. However, the large proportion of missing data in EHR poses challenges that may reduce the accuracy of prediction methods. We propose a cohort study that builds a pipeline that extracts ICD-9 codes and laboratory tests from public available electronic ICU databases, and improve the in-hospital mortality prediction accuracy using a combination of neural network missing data imputation approach and decision tree based outcome prediction algorithm. We show the proposed approach achieves a higher area under the ROC curve, ranging from 0.88-0.98, compared with other well-known machine learning methods applied to similar target population. It also offers clinical interpretations through variable selection. Our analysis also shows that mortality prediction for neonates was more challenging than for adults, and that prediction accuracy decreases as patients stayed longer in the ICU.
ICU死亡率预测的机器学习管道
重症监护病房(ICU)患者的死亡风险预测是一项至关重要且具有挑战性的任务,以便临床医生能够及时采取适当的临床干预措施。在COVID-19成为全球大流行的背景下,这一点变得更加紧迫。近年来,电子健康记录(EHR)已被广泛采用,并有可能大大改善临床服务和诊断。然而,电子病历中大量缺失的数据可能会降低预测方法的准确性。我们提出了一项队列研究,建立一个从公共电子ICU数据库中提取ICD-9代码和实验室测试的管道,并使用神经网络缺失数据插入方法和基于决策树的结果预测算法相结合来提高院内死亡率预测的准确性。我们表明,与应用于类似目标人群的其他知名机器学习方法相比,所提出的方法在ROC曲线下实现了更高的面积,范围为0.88-0.98。它还通过变量选择提供临床解释。我们的分析还表明,预测新生儿的死亡率比预测成人的死亡率更具挑战性,而且随着患者在ICU的时间延长,预测的准确性也会降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信