A New Temporal Abstraction for Health Diagnosis Prediction using Deep Recurrent Networks

Proceedings of the 21st International Database Engineering & Applications Symposium Pub Date : 2017-07-12 DOI:10.1145/3105831.3105858

Alireza Manashty, Janet V. Light-Thompson

{"title":"A New Temporal Abstraction for Health Diagnosis Prediction using Deep Recurrent Networks","authors":"Alireza Manashty, Janet V. Light-Thompson","doi":"10.1145/3105831.3105858","DOIUrl":null,"url":null,"abstract":"Temporal health data, either as electronic health record or from nursery home care units, usually include multivariate sparse temporal health data different from a regular time-series. Conventional neural network models cannot be used in such data; recurrent neural networks (RNN) (such as with long-term short memory (LSTM) cells) are used to model time-series. However, long-term variable-length sparse temporal data are not suitable for an efficient learning with RNN models. This research presents a novel pattern extraction technique for use in diagnosis prediction using deep learning techniques in recurrent neural networks. To predict diagnosis from such data, a window-based data abstraction technique called intensity temporal sequence (ITS) is proposed and tested. ITS enables presenting long-term sparse temporal data as a fixed-length sequence suitable for training by deep recurrent networks. To evaluate the method against other techniques, such as recent temporal patterns (RTP), a pattern simulator and anomaly injection method is developed to generate 100,000 patient records with 10 possible diseases over 10,000 units of time. The results indicate that ITS performs slightly better than RTP in terms of accuracy when using techniques other than LSTM. However, only ITS is suitable for learning LSTM; a model which performs better in terms of accuracy.","PeriodicalId":319729,"journal":{"name":"Proceedings of the 21st International Database Engineering & Applications Symposium","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3105831.3105858","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Temporal health data, either as electronic health record or from nursery home care units, usually include multivariate sparse temporal health data different from a regular time-series. Conventional neural network models cannot be used in such data; recurrent neural networks (RNN) (such as with long-term short memory (LSTM) cells) are used to model time-series. However, long-term variable-length sparse temporal data are not suitable for an efficient learning with RNN models. This research presents a novel pattern extraction technique for use in diagnosis prediction using deep learning techniques in recurrent neural networks. To predict diagnosis from such data, a window-based data abstraction technique called intensity temporal sequence (ITS) is proposed and tested. ITS enables presenting long-term sparse temporal data as a fixed-length sequence suitable for training by deep recurrent networks. To evaluate the method against other techniques, such as recent temporal patterns (RTP), a pattern simulator and anomaly injection method is developed to generate 100,000 patient records with 10 possible diseases over 10,000 units of time. The results indicate that ITS performs slightly better than RTP in terms of accuracy when using techniques other than LSTM. However, only ITS is suitable for learning LSTM; a model which performs better in terms of accuracy.

查看原文本刊更多论文

一种新的基于深度循环网络的健康诊断预测时间抽象

时间健康数据，无论是作为电子健康记录还是来自托儿所家庭护理单位，通常包括不同于常规时间序列的多变量稀疏时间健康数据。传统的神经网络模型不能用于这类数据;递归神经网络(RNN)(如长短时记忆(LSTM)细胞)用于时间序列建模。然而，长期变长稀疏时态数据不适合RNN模型的高效学习。本研究提出了一种新的模式提取技术，用于在递归神经网络中使用深度学习技术进行诊断预测。为了从这些数据中预测诊断，提出并测试了一种基于窗口的数据抽象技术，称为强度时间序列(ITS)。ITS能够将长期稀疏时间数据呈现为适合深度循环网络训练的固定长度序列。为了将该方法与其他技术(如近期时间模式(RTP))进行比较，开发了模式模拟器和异常注射方法，以在10,000个单位时间内生成100,000个患者记录，其中包含10种可能的疾病。结果表明，当使用LSTM以外的技术时，ITS在准确性方面的表现略好于RTP。然而，只有ITS适合学习LSTM;一个在准确性方面表现更好的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 21st International Database Engineering & Applications Symposium

自引率

0.00%

发文量