通过可解释机器学习算法分析患者满意度

2023 2nd International Conference on Edge Computing and Applications (ICECAA) Pub Date : 2023-07-19 DOI:10.1109/ICECAA58104.2023.10212377

Jamunadevi C, Subith R, D. S, Pandikumar S

{"title":"通过可解释机器学习算法分析患者满意度","authors":"Jamunadevi C, Subith R, D. S, Pandikumar S","doi":"10.1109/ICECAA58104.2023.10212377","DOIUrl":null,"url":null,"abstract":"This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.","PeriodicalId":114624,"journal":{"name":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of Patient Satisfaction through Interpretable Machine Learning Algorithms\",\"authors\":\"Jamunadevi C, Subith R, D. S, Pandikumar S\",\"doi\":\"10.1109/ICECAA58104.2023.10212377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.\",\"PeriodicalId\":114624,\"journal\":{\"name\":\"2023 2nd International Conference on Edge Computing and Applications (ICECAA)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Edge Computing and Applications (ICECAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECAA58104.2023.10212377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECAA58104.2023.10212377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本研究旨在减少这些特征，预测患者对医院提供的服务是否满意。提出的系统对前五个特征进行分类，并使用机器学习算法提供更高的准确性。现有的系统存在一个局限性，即需要一个优化求解器，并且当变量数量变大时，计算量会增加。该系统考虑了数据集中的17个属性，并选择了5个特征来评估系统，以提高效率。由于几个数据集特征的相关性几乎相等，因此它们被消除。卡方检验是一种最有效的特征选择方法，可以在训练和测试模型之前从数据集中减少不需要的数据或不需要的特征，以达到更好的准确性和降低模型的复杂性。由于采集的数据不平衡，影响了精度，因此采用SMOTE技术对数据进行平衡。采集的数据集清除任何潜在的不规则数据，然后使用几种方法进行预处理，然后进行特征选择和模型构建。SVM、Random Forest、XGBOOST以及Random Forest和XGBOOST的集合是我们使用的分类器。当使用机器学习方法进行训练和测试时，随机森林最终比其他算法具有更高的准确性。该方法具有提高分类和预测精度的惊人能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis of Patient Satisfaction through Interpretable Machine Learning Algorithms

This research study intends to reduce the features and predict whether the patients are satisfied with the service provided by the hospitals. The proposed system classifies top five features and give more accuracy using the machine learning algorithm. The existing system has a limitation that it requires an optimization solver and increases the computing work if the number of variables become large. The proposed system considers 17 attributes in the dataset and five features are selected to evaluate the system to increase the efficiency. Since the correlation of several dataset features is nearly equal, they are eliminated. Chi-square test is one of the most efficient feature selection method to reduce the unwanted data or unwanted features from the dataset before training and testing the model for attaining better accuracy and reducing the complexity of the model. The taken dataset is imbalanced, it affects the accuracy, so SMOTE technique is used to balance the dataset. The acquired dataset is cleared of any potential irregular data and pre-processed with several methods followed by feature selection and model building. The SVM, Random Forest, XGBOOST and Ensembling of Random Forest and XGBoost are the classifiers that were employed. When using a machine learning approach for both training and testing, Random Forest ultimately has higher accuracy compared to other algorithms. This method has the amazing capacity to increase categorization and forecasting precision.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 2nd International Conference on Edge Computing and Applications (ICECAA)

自引率

0.00%

发文量