通过可解释机器学习算法分析患者满意度

Jamunadevi C, Subith R, D. S, Pandikumar S
{"title":"通过可解释机器学习算法分析患者满意度","authors":"Jamunadevi C, Subith R, D. S, Pandikumar S","doi":"10.1109/ICECAA58104.2023.10212377","DOIUrl":null,"url":null,"abstract":"This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.","PeriodicalId":114624,"journal":{"name":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of Patient Satisfaction through Interpretable Machine Learning Algorithms\",\"authors\":\"Jamunadevi C, Subith R, D. S, Pandikumar S\",\"doi\":\"10.1109/ICECAA58104.2023.10212377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.\",\"PeriodicalId\":114624,\"journal\":{\"name\":\"2023 2nd International Conference on Edge Computing and Applications (ICECAA)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Edge Computing and Applications (ICECAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECAA58104.2023.10212377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECAA58104.2023.10212377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本研究旨在减少这些特征,预测患者对医院提供的服务是否满意。提出的系统对前五个特征进行分类,并使用机器学习算法提供更高的准确性。现有的系统存在一个局限性,即需要一个优化求解器,并且当变量数量变大时,计算量会增加。该系统考虑了数据集中的17个属性,并选择了5个特征来评估系统,以提高效率。由于几个数据集特征的相关性几乎相等,因此它们被消除。卡方检验是一种最有效的特征选择方法,可以在训练和测试模型之前从数据集中减少不需要的数据或不需要的特征,以达到更好的准确性和降低模型的复杂性。由于采集的数据不平衡,影响了精度,因此采用SMOTE技术对数据进行平衡。采集的数据集清除任何潜在的不规则数据,然后使用几种方法进行预处理,然后进行特征选择和模型构建。SVM、Random Forest、XGBOOST以及Random Forest和XGBOOST的集合是我们使用的分类器。当使用机器学习方法进行训练和测试时,随机森林最终比其他算法具有更高的准确性。该方法具有提高分类和预测精度的惊人能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of Patient Satisfaction through Interpretable Machine Learning Algorithms
This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信