Deep Computational Phenotyping

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pub Date : 2015-08-10 DOI:10.1145/2783258.2783365

Zhengping Che, David C. Kale, Wenzhe Li, M. T. Bahadori, Yan Liu

{"title":"Deep Computational Phenotyping","authors":"Zhengping Che, David C. Kale, Wenzhe Li, M. T. Bahadori, Yan Liu","doi":"10.1145/2783258.2783365","DOIUrl":null,"url":null,"abstract":"We apply deep learning to the problem of discovery and detection of characteristic patterns of physiology in clinical time series data. We propose two novel modifications to standard neural net training that address challenges and exploit properties that are peculiar, if not exclusive, to medical data. First, we examine a general framework for using prior knowledge to regularize parameters in the topmost layers. This framework can leverage priors of any form, ranging from formal ontologies (e.g., ICD9 codes) to data-derived similarity. Second, we describe a scalable procedure for training a collection of neural networks of different sizes but with partially shared architectures. Both of these innovations are well-suited to medical applications, where available data are not yet Internet scale and have many sparse outputs (e.g., rare diagnoses) but which have exploitable structure (e.g., temporal order and relationships between labels). However, both techniques are sufficiently general to be applied to other problems and domains. We demonstrate the empirical efficacy of both techniques on two real-world hospital data sets and show that the resulting neural nets learn interpretable and clinically relevant features.","PeriodicalId":243428,"journal":{"name":"Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"246","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2783258.2783365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 246

Abstract

We apply deep learning to the problem of discovery and detection of characteristic patterns of physiology in clinical time series data. We propose two novel modifications to standard neural net training that address challenges and exploit properties that are peculiar, if not exclusive, to medical data. First, we examine a general framework for using prior knowledge to regularize parameters in the topmost layers. This framework can leverage priors of any form, ranging from formal ontologies (e.g., ICD9 codes) to data-derived similarity. Second, we describe a scalable procedure for training a collection of neural networks of different sizes but with partially shared architectures. Both of these innovations are well-suited to medical applications, where available data are not yet Internet scale and have many sparse outputs (e.g., rare diagnoses) but which have exploitable structure (e.g., temporal order and relationships between labels). However, both techniques are sufficiently general to be applied to other problems and domains. We demonstrate the empirical efficacy of both techniques on two real-world hospital data sets and show that the resulting neural nets learn interpretable and clinically relevant features.

查看原文本刊更多论文

深度计算表型

我们将深度学习应用于临床时间序列数据中生理学特征模式的发现和检测问题。我们对标准神经网络训练提出了两种新的修改，以解决挑战并利用医疗数据特有的(如果不是独家的)属性。首先，我们研究了一个使用先验知识来正则化最顶层参数的一般框架。这个框架可以利用任何形式的先验，从形式本体(例如，ICD9代码)到数据派生的相似性。其次，我们描述了一个可扩展的过程，用于训练不同大小但具有部分共享架构的神经网络集合。这两项创新都非常适合医疗应用，其中可用数据尚未达到互联网规模，并且具有许多稀疏输出(例如，罕见诊断)，但具有可利用的结构(例如，时间顺序和标签之间的关系)。然而，这两种技术都足够通用，可以应用于其他问题和领域。我们在两个真实世界的医院数据集上展示了这两种技术的经验有效性，并表明由此产生的神经网络学习了可解释的和临床相关的特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

自引率

0.00%

发文量