Chengliang Yang, C. Delcher, E. Shenkman, S. Ranka
{"title":"高利用率医疗保健接触间到达时间的聚类","authors":"Chengliang Yang, C. Delcher, E. Shenkman, S. Ranka","doi":"10.1109/HealthCom.2018.8531173","DOIUrl":null,"url":null,"abstract":"Patients with a large number of health care encounters receive great attention in health care research because their expensive and problematic health care utilization has important implications for the US health care system. The large volume of emergency department (ED) visits and inpatient hospital stays through time provides a unique opportunity to apply data-driven methods for identifying temporal signals associated with these patients. The micro variations in the inter-arrival time of these encounters within this population are not well-studied. Computational approaches for distinguishing various temporal visiting patterns leads to the problem of efficiently clustering asynchronous time series. Thus, we propose a Wasserstein distance based spectral clustering for this problem. Asynchronous time series are first represented as histograms of inter-arrival time and their pairwise similarities are computed under Wasserstein distance. Spectral clustering operates on the similarity matrix as input thereby avoiding the computational bottleneck of Wasserstein barycenters. The effectiveness of this method is demonstrated by synthetic data and application to a large real world health insurance encounters dataset, identifying potential associations between temporal visiting patterns and health factors from a population of frequent ED and inpatient hospital users.","PeriodicalId":232709,"journal":{"name":"2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Clustering Inter-Arrival Time of Health Care Encounters for High Utilizers\",\"authors\":\"Chengliang Yang, C. Delcher, E. Shenkman, S. Ranka\",\"doi\":\"10.1109/HealthCom.2018.8531173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Patients with a large number of health care encounters receive great attention in health care research because their expensive and problematic health care utilization has important implications for the US health care system. The large volume of emergency department (ED) visits and inpatient hospital stays through time provides a unique opportunity to apply data-driven methods for identifying temporal signals associated with these patients. The micro variations in the inter-arrival time of these encounters within this population are not well-studied. Computational approaches for distinguishing various temporal visiting patterns leads to the problem of efficiently clustering asynchronous time series. Thus, we propose a Wasserstein distance based spectral clustering for this problem. Asynchronous time series are first represented as histograms of inter-arrival time and their pairwise similarities are computed under Wasserstein distance. Spectral clustering operates on the similarity matrix as input thereby avoiding the computational bottleneck of Wasserstein barycenters. The effectiveness of this method is demonstrated by synthetic data and application to a large real world health insurance encounters dataset, identifying potential associations between temporal visiting patterns and health factors from a population of frequent ED and inpatient hospital users.\",\"PeriodicalId\":232709,\"journal\":{\"name\":\"2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HealthCom.2018.8531173\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HealthCom.2018.8531173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Clustering Inter-Arrival Time of Health Care Encounters for High Utilizers
Patients with a large number of health care encounters receive great attention in health care research because their expensive and problematic health care utilization has important implications for the US health care system. The large volume of emergency department (ED) visits and inpatient hospital stays through time provides a unique opportunity to apply data-driven methods for identifying temporal signals associated with these patients. The micro variations in the inter-arrival time of these encounters within this population are not well-studied. Computational approaches for distinguishing various temporal visiting patterns leads to the problem of efficiently clustering asynchronous time series. Thus, we propose a Wasserstein distance based spectral clustering for this problem. Asynchronous time series are first represented as histograms of inter-arrival time and their pairwise similarities are computed under Wasserstein distance. Spectral clustering operates on the similarity matrix as input thereby avoiding the computational bottleneck of Wasserstein barycenters. The effectiveness of this method is demonstrated by synthetic data and application to a large real world health insurance encounters dataset, identifying potential associations between temporal visiting patterns and health factors from a population of frequent ED and inpatient hospital users.