{"title":"Heart Disease Diagnosis: Performance Evaluation of Supervised Machine Learning and Feature Selection Techniques","authors":"Palak Khurana, Shakshi Sharma, Anjali Goyal","doi":"10.1109/SPIN52536.2021.9565963","DOIUrl":null,"url":null,"abstract":"Heart diseases are the leading cause of deaths nowadays. Due to the high severity of the problem, it has attracted several researchers around the globe. Researchers have considered the heart diagnosis as a classification problem where meaningful patterns are detected using data mining techniques. This paper presents an evaluation of various supervised learning algorithms and feature selection techniques for heart disease prediction. The performance of six machine learning classifiers (Naïve Bayes, Decision Tree, Logistic Regression, Random Forest, Support Vector Machine, k-Nearest Neighbour) and five feature selection techniques (Chi-Square, Gain Ratio, Information Gain, One-R and RELIEF) have been investigated on the benchmark dataset obtained from UCI Machine Learning Repository, Cleveland. The experimental results show that machine learning classifiers can achieve prediction accuracy up to 82.81% for heart disease prediction. The feature selection techniques further improve the classification performance and achieve prediction accuracy up to 83.41%.","PeriodicalId":343177,"journal":{"name":"2021 8th International Conference on Signal Processing and Integrated Networks (SPIN)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Signal Processing and Integrated Networks (SPIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIN52536.2021.9565963","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Heart diseases are the leading cause of deaths nowadays. Due to the high severity of the problem, it has attracted several researchers around the globe. Researchers have considered the heart diagnosis as a classification problem where meaningful patterns are detected using data mining techniques. This paper presents an evaluation of various supervised learning algorithms and feature selection techniques for heart disease prediction. The performance of six machine learning classifiers (Naïve Bayes, Decision Tree, Logistic Regression, Random Forest, Support Vector Machine, k-Nearest Neighbour) and five feature selection techniques (Chi-Square, Gain Ratio, Information Gain, One-R and RELIEF) have been investigated on the benchmark dataset obtained from UCI Machine Learning Repository, Cleveland. The experimental results show that machine learning classifiers can achieve prediction accuracy up to 82.81% for heart disease prediction. The feature selection techniques further improve the classification performance and achieve prediction accuracy up to 83.41%.