Variationally regularized graph-based representation learning for electronic health records

Proceedings of the ACM Conference on Health, Inference, and Learning Pub Date : 2019-12-08 DOI:10.1145/3450439.3451855

Weicheng Zhu, N. Razavian

{"title":"Variationally regularized graph-based representation learning for electronic health records","authors":"Weicheng Zhu, N. Razavian","doi":"10.1145/3450439.3451855","DOIUrl":null,"url":null,"abstract":"Electronic Health Records (EHR) are high-dimensional data with implicit connections among thousands of medical concepts. These connections, for instance, the co-occurrence of diseases and lab-disease correlations can be informative when only a subset of these variables is documented by the clinician. A feasible approach to improving the representation learning of EHR data is to associate relevant medical concepts and utilize these connections. Existing medical ontologies can be the reference for EHR structures, but they place numerous constraints on the data source. Recent progress on graph neural networks (GNN) enables end-to-end learning of topological structures for non-grid or non-sequential data. However, there are problems to be addressed on how to learn the medical graph adaptively and how to understand the effect of medical graph on representation learning. In this paper, we propose a variationally regularized encoder-decoder graph network that achieves more robustness in graph structure learning by regularizing node representations. Our model outperforms the existing graph and non-graph based methods in various EHR predictive tasks based on both public data and real-world clinical data. Besides the improvements in empirical experiment performances, we provide an interpretation of the effect of variational regularization compared to standard graph neural network, using singular value analysis.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Conference on Health, Inference, and Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3450439.3451855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Electronic Health Records (EHR) are high-dimensional data with implicit connections among thousands of medical concepts. These connections, for instance, the co-occurrence of diseases and lab-disease correlations can be informative when only a subset of these variables is documented by the clinician. A feasible approach to improving the representation learning of EHR data is to associate relevant medical concepts and utilize these connections. Existing medical ontologies can be the reference for EHR structures, but they place numerous constraints on the data source. Recent progress on graph neural networks (GNN) enables end-to-end learning of topological structures for non-grid or non-sequential data. However, there are problems to be addressed on how to learn the medical graph adaptively and how to understand the effect of medical graph on representation learning. In this paper, we propose a variationally regularized encoder-decoder graph network that achieves more robustness in graph structure learning by regularizing node representations. Our model outperforms the existing graph and non-graph based methods in various EHR predictive tasks based on both public data and real-world clinical data. Besides the improvements in empirical experiment performances, we provide an interpretation of the effect of variational regularization compared to standard graph neural network, using singular value analysis.

查看原文本刊更多论文

基于变分正则化图的电子健康记录表示学习

电子健康记录(EHR)是高维数据，在数千个医疗概念之间具有隐式联系。例如，当临床医生只记录这些变量的一个子集时，这些联系，疾病的共发生和实验室疾病的相关性可以提供信息。将相关的医学概念关联起来并利用这些联系是改善电子病历数据表示学习的一种可行方法。现有的医学本体可以作为EHR结构的参考，但是它们对数据源施加了许多限制。图神经网络(GNN)的最新进展使非网格或非顺序数据的拓扑结构的端到端学习成为可能。然而，如何自适应地学习医学图，以及如何理解医学图对表征学习的影响，都是有待解决的问题。在本文中，我们提出了一种变正则化的编码器-解码器图网络，该网络通过正则化节点表示来实现图结构学习的鲁棒性。我们的模型在基于公共数据和真实临床数据的各种EHR预测任务中优于现有的基于图和非基于图的方法。除了经验实验性能的改进之外，我们还提供了与标准图神经网络相比，使用奇异值分析的变分正则化效果的解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ACM Conference on Health, Inference, and Learning

自引率

0.00%

发文量