Effective Patient Similarity Computation for Clinical Decision Support Using Time Series and Static Data

Proceedings of the Australasian Computer Science Week Multiconference Pub Date : 2020-01-29 DOI:10.1145/3373017.3373050

M. Masud, Kadhim Hayawi, S. Mathew, A. Dirir, Muhsin Cheratta

{"title":"Effective Patient Similarity Computation for Clinical Decision Support Using Time Series and Static Data","authors":"M. Masud, Kadhim Hayawi, S. Mathew, A. Dirir, Muhsin Cheratta","doi":"10.1145/3373017.3373050","DOIUrl":null,"url":null,"abstract":"This paper presents a technique for computing patient similarity using time series data effectively combined with static data. Time series data of inpatients, such as heart rate, blood pressure, Oxygen saturation, respiration are measured at regular intervals, especially for inpatients in intensive care unit (ICU). The static data are mainly patient background and demographic data, including age, weight, height and gender. The similarity computation is done in unsupervised way. It is therefore free from data labeling requirement. However, such patient similarity can be very useful in developing various clinical decision support systems including treatment, medication, hospital admission and diagnosis. Our proposed technique works in three main steps. First, patient similarity is computed for each individual time series. Second, patients are grouped by clustering the static data. Finally, similarities from individual time series are combined and effectively blended with the patient group information to create a nearest neighborhood model. This model consists of a collection of the nearest neighbors for a given patient. We encounter several challenges for this task, including dealing with multi-variate time series data, variable sampling quantities and rates, missing values, and combining time-series with static data. We evaluate the proposed technique on a real patient database on two target features, namely, ‘diagnosis’ and ‘admission type’. Notable performance is recorded for both targets, achieving f1-score as high as 0.8. We believe this technique can effectively combine different types of clinical data and develop an efficient unsupervised framework for computing patient similarity to be utilized for clinical decision support systems.","PeriodicalId":297760,"journal":{"name":"Proceedings of the Australasian Computer Science Week Multiconference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Australasian Computer Science Week Multiconference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3373017.3373050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

This paper presents a technique for computing patient similarity using time series data effectively combined with static data. Time series data of inpatients, such as heart rate, blood pressure, Oxygen saturation, respiration are measured at regular intervals, especially for inpatients in intensive care unit (ICU). The static data are mainly patient background and demographic data, including age, weight, height and gender. The similarity computation is done in unsupervised way. It is therefore free from data labeling requirement. However, such patient similarity can be very useful in developing various clinical decision support systems including treatment, medication, hospital admission and diagnosis. Our proposed technique works in three main steps. First, patient similarity is computed for each individual time series. Second, patients are grouped by clustering the static data. Finally, similarities from individual time series are combined and effectively blended with the patient group information to create a nearest neighborhood model. This model consists of a collection of the nearest neighbors for a given patient. We encounter several challenges for this task, including dealing with multi-variate time series data, variable sampling quantities and rates, missing values, and combining time-series with static data. We evaluate the proposed technique on a real patient database on two target features, namely, ‘diagnosis’ and ‘admission type’. Notable performance is recorded for both targets, achieving f1-score as high as 0.8. We believe this technique can effectively combine different types of clinical data and develop an efficient unsupervised framework for computing patient similarity to be utilized for clinical decision support systems.

查看原文本刊更多论文

基于时间序列和静态数据的临床决策支持的有效患者相似度计算

本文提出了一种将时间序列数据与静态数据有效结合计算患者相似度的方法。定期测量住院患者的心率、血压、血氧饱和度、呼吸等时间序列数据，特别是对重症监护病房(ICU)的住院患者。静态数据主要是患者背景和人口统计数据，包括年龄、体重、身高和性别。相似度计算采用无监督方式进行。因此，它不需要数据标签要求。然而，这种患者相似性在开发各种临床决策支持系统(包括治疗、用药、住院和诊断)时非常有用。我们提出的技术分为三个主要步骤。首先，计算每个时间序列的患者相似度。其次，对静态数据进行聚类，对患者进行分组。最后，将单个时间序列的相似度与患者群体信息进行组合并有效混合，从而创建最近邻模型。该模型由给定患者的最近邻居的集合组成。我们在这项任务中遇到了几个挑战，包括处理多变量时间序列数据、可变采样数量和速率、缺失值以及将时间序列与静态数据相结合。我们在真实的患者数据库中评估了两个目标特征，即“诊断”和“入院类型”。这两个目标都取得了显著的成绩，f1得分高达0.8。我们相信这种技术可以有效地结合不同类型的临床数据，并开发一个有效的无监督框架，用于计算临床决策支持系统的患者相似性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Australasian Computer Science Week Multiconference

自引率

0.00%

发文量