Relational Learning Improves Prediction of Mortality in COVID-19 in the Intensive Care Unit

IF 7.5 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data Pub Date : 2020-12-31 DOI:10.1109/TBDATA.2020.3048644

Tingyi Wanyan;Akhil Vaid;Jessica K De Freitas;Sulaiman Somani;Riccardo Miotto;Girish N. Nadkarni;Ariful Azad;Ying Ding;Benjamin S. Glicksberg

{"title":"Relational Learning Improves Prediction of Mortality in COVID-19 in the Intensive Care Unit","authors":"Tingyi Wanyan;Akhil Vaid;Jessica K De Freitas;Sulaiman Somani;Riccardo Miotto;Girish N. Nadkarni;Ariful Azad;Ying Ding;Benjamin S. Glicksberg","doi":"10.1109/TBDATA.2020.3048644","DOIUrl":null,"url":null,"abstract":"Traditional Machine Learning (ML) models have had limited success in predicting Coronoavirus-19 (COVID-19) outcomes using Electronic Health Record (EHR) data partially due to not effectively capturing the inter-connectivity patterns between various data modalities. In this work, we propose a novel framework that utilizes relational learning based on a heterogeneous graph model (HGM) for predicting mortality at different time windows in COVID-19 patients within the intensive care unit (ICU). We utilize the EHRs of one of the largest and most diverse patient populations across five hospitals in major health system in New York City. In our model, we use an LSTM for processing time varying patient data and apply our proposed relational learning strategy in the final output layer along with other static features. Here, we replace the traditional softmax layer with a Skip-Gram relational learning strategy to compare the similarity between a patient and outcome embedding representation. We demonstrate that the construction of a HGM can robustly learn the patterns classifying patient representations of outcomes through leveraging patterns within the embeddings of similar patients. Our experimental results show that our relational learning-based HGM model achieves higher area under the receiver operating characteristic curve (auROC) than both comparator models in all prediction time windows, with dramatic improvements to recall.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"7 1","pages":"38-44"},"PeriodicalIF":7.5000,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TBDATA.2020.3048644","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/9311826/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 8

Abstract

Traditional Machine Learning (ML) models have had limited success in predicting Coronoavirus-19 (COVID-19) outcomes using Electronic Health Record (EHR) data partially due to not effectively capturing the inter-connectivity patterns between various data modalities. In this work, we propose a novel framework that utilizes relational learning based on a heterogeneous graph model (HGM) for predicting mortality at different time windows in COVID-19 patients within the intensive care unit (ICU). We utilize the EHRs of one of the largest and most diverse patient populations across five hospitals in major health system in New York City. In our model, we use an LSTM for processing time varying patient data and apply our proposed relational learning strategy in the final output layer along with other static features. Here, we replace the traditional softmax layer with a Skip-Gram relational learning strategy to compare the similarity between a patient and outcome embedding representation. We demonstrate that the construction of a HGM can robustly learn the patterns classifying patient representations of outcomes through leveraging patterns within the embeddings of similar patients. Our experimental results show that our relational learning-based HGM model achieves higher area under the receiver operating characteristic curve (auROC) than both comparator models in all prediction time windows, with dramatic improvements to recall.

Abstract Image

查看原文本刊更多论文

关系学习改善重症监护病房新冠肺炎死亡率预测

传统的机器学习（ML）模型在使用电子健康记录（EHR）数据预测冠状病毒肺炎（新冠肺炎）结果方面取得的成功有限，部分原因是没有有效地捕捉各种数据模式之间的相互联系模式。在这项工作中，我们提出了一种新的框架，该框架利用基于异质图模型（HGM）的关系学习来预测重症监护室（ICU）内新冠肺炎患者在不同时间窗的死亡率。我们利用了纽约市主要卫生系统五家医院中最大、最多样化的患者群体之一的EHR。在我们的模型中，我们使用LSTM来处理时变患者数据，并将我们提出的关系学习策略与其他静态特征一起应用于最终输出层。在这里，我们用Skip Gram关系学习策略取代了传统的softmax层，以比较患者和结果嵌入表示之间的相似性。我们证明，HGM的构建可以通过利用相似患者嵌入中的模式，稳健地学习对患者结果表示进行分类的模式。我们的实验结果表明，在所有预测时间窗口中，我们基于关系学习的HGM模型在接收器工作特性曲线（auROC）下的面积都比两个比较器模型高，召回率显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Big Data Multiple-

CiteScore

11.80

自引率

2.80%

发文量

114

期刊介绍： The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.