Jingyun Choi, Yejin Kim, Hun‐Sung Kim, I. Choi, Hwanjo Yu
{"title":"基于张量因子的表型分析使用组信息:他汀类药物疗效的案例研究","authors":"Jingyun Choi, Yejin Kim, Hun‐Sung Kim, I. Choi, Hwanjo Yu","doi":"10.1145/3107411.3107423","DOIUrl":null,"url":null,"abstract":"To automatically extract medical concepts from raw electronic health records (EHRs), several applications based on machine learning techniques have been proposed. Among the various techniques, tensor factorization methods have attracted considerable attention because tensor representations can capture interactions among high-dimensional EHRs. Most of the existing tensor factorization methods for computational phenotyping are only designed to derive individual phenotypes that approximate the original data. However, deriving grouped phenotypes is desirable because patients form natural groups of interest (i.e., efficacy of treatment and disease categories). In this paper, we propose Supervised Non-negative Tensor Factorization with Multinomial Logistic Regression (SNTFL) to derive grouped phenotypes that are discriminative. We define a discriminative constraint to derive grouped phenotypes and jointly optimize a multinomial logistic regression during the tensor factorization process. Our case study on a hyperlipidemia dataset demonstrates that our proposed method obtains better discrimination on patient groups compared to the baselines and successfully discovers meaningful patient subgroups.","PeriodicalId":246388,"journal":{"name":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Tensor-Factorization-Based Phenotyping using Group Information: Case Study on the Efficacy of Statins\",\"authors\":\"Jingyun Choi, Yejin Kim, Hun‐Sung Kim, I. Choi, Hwanjo Yu\",\"doi\":\"10.1145/3107411.3107423\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To automatically extract medical concepts from raw electronic health records (EHRs), several applications based on machine learning techniques have been proposed. Among the various techniques, tensor factorization methods have attracted considerable attention because tensor representations can capture interactions among high-dimensional EHRs. Most of the existing tensor factorization methods for computational phenotyping are only designed to derive individual phenotypes that approximate the original data. However, deriving grouped phenotypes is desirable because patients form natural groups of interest (i.e., efficacy of treatment and disease categories). In this paper, we propose Supervised Non-negative Tensor Factorization with Multinomial Logistic Regression (SNTFL) to derive grouped phenotypes that are discriminative. We define a discriminative constraint to derive grouped phenotypes and jointly optimize a multinomial logistic regression during the tensor factorization process. Our case study on a hyperlipidemia dataset demonstrates that our proposed method obtains better discrimination on patient groups compared to the baselines and successfully discovers meaningful patient subgroups.\",\"PeriodicalId\":246388,\"journal\":{\"name\":\"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3107411.3107423\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3107411.3107423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Tensor-Factorization-Based Phenotyping using Group Information: Case Study on the Efficacy of Statins
To automatically extract medical concepts from raw electronic health records (EHRs), several applications based on machine learning techniques have been proposed. Among the various techniques, tensor factorization methods have attracted considerable attention because tensor representations can capture interactions among high-dimensional EHRs. Most of the existing tensor factorization methods for computational phenotyping are only designed to derive individual phenotypes that approximate the original data. However, deriving grouped phenotypes is desirable because patients form natural groups of interest (i.e., efficacy of treatment and disease categories). In this paper, we propose Supervised Non-negative Tensor Factorization with Multinomial Logistic Regression (SNTFL) to derive grouped phenotypes that are discriminative. We define a discriminative constraint to derive grouped phenotypes and jointly optimize a multinomial logistic regression during the tensor factorization process. Our case study on a hyperlipidemia dataset demonstrates that our proposed method obtains better discrimination on patient groups compared to the baselines and successfully discovers meaningful patient subgroups.