{"title":"预测非依从性的文本挖掘方法","authors":"Yufan Wang, Mahsa Mohaghegh","doi":"10.1109/ICMLA52953.2021.00236","DOIUrl":null,"url":null,"abstract":"Companies operating patient support programs for chronic diseases have been dedicated to enhancing treatment adherence by utilizing data from various interventions of the programs. The purpose of this paper is to examine whether the textual patient notes recorded by program coordinators can be beneficial to predict non-adherence and provide useful insights. In this paper we show work in processing and analyzing over 20,000 patient notes corresponding to 1313 Psoriasis patients using statistical analysis and several NLP methods, such as term representation, sentiment analysis and topic modelling. To build predictive models, Support Vector Machine (SVM), Random Forest (RF) and Logistic Regression (LR) are tested with different feature subsets. The best performing model is SVM with 93% accuracy and 91% recall of non-adherent. Additionally, we also present patterns to differentiate non-adherent and adherent patients in terms of completion efficiency of call objectives and uncontactable problem. Accordingly, high-risk patients can be targeted to take interventions.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"12 1","pages":"1468-1471"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Text Mining Approach To Predict Non-Adherence\",\"authors\":\"Yufan Wang, Mahsa Mohaghegh\",\"doi\":\"10.1109/ICMLA52953.2021.00236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Companies operating patient support programs for chronic diseases have been dedicated to enhancing treatment adherence by utilizing data from various interventions of the programs. The purpose of this paper is to examine whether the textual patient notes recorded by program coordinators can be beneficial to predict non-adherence and provide useful insights. In this paper we show work in processing and analyzing over 20,000 patient notes corresponding to 1313 Psoriasis patients using statistical analysis and several NLP methods, such as term representation, sentiment analysis and topic modelling. To build predictive models, Support Vector Machine (SVM), Random Forest (RF) and Logistic Regression (LR) are tested with different feature subsets. The best performing model is SVM with 93% accuracy and 91% recall of non-adherent. Additionally, we also present patterns to differentiate non-adherent and adherent patients in terms of completion efficiency of call objectives and uncontactable problem. Accordingly, high-risk patients can be targeted to take interventions.\",\"PeriodicalId\":6750,\"journal\":{\"name\":\"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"volume\":\"12 1\",\"pages\":\"1468-1471\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA52953.2021.00236\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA52953.2021.00236","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Companies operating patient support programs for chronic diseases have been dedicated to enhancing treatment adherence by utilizing data from various interventions of the programs. The purpose of this paper is to examine whether the textual patient notes recorded by program coordinators can be beneficial to predict non-adherence and provide useful insights. In this paper we show work in processing and analyzing over 20,000 patient notes corresponding to 1313 Psoriasis patients using statistical analysis and several NLP methods, such as term representation, sentiment analysis and topic modelling. To build predictive models, Support Vector Machine (SVM), Random Forest (RF) and Logistic Regression (LR) are tested with different feature subsets. The best performing model is SVM with 93% accuracy and 91% recall of non-adherent. Additionally, we also present patterns to differentiate non-adherent and adherent patients in terms of completion efficiency of call objectives and uncontactable problem. Accordingly, high-risk patients can be targeted to take interventions.