Improving Heart Disease Prediction of Classifiers with Data Transformation using PCA and Relief Feature Selection

Guggulla Varshini, Ananthaneni Ramya, Chitrakavi Lakshmi Sravya, Vinod Kumar, Brajesh K. Shukla
{"title":"Improving Heart Disease Prediction of Classifiers with Data Transformation using PCA and Relief Feature Selection","authors":"Guggulla Varshini, Ananthaneni Ramya, Chitrakavi Lakshmi Sravya, Vinod Kumar, Brajesh K. Shukla","doi":"10.1109/ICEARS56392.2023.10085401","DOIUrl":null,"url":null,"abstract":"Cardiovascular disorders (CVD) are the key cause of mortality worldwide. One in three male premature deaths and one in five female premature deaths are thought to be attributable to Cardiovascular disorders. Early prediction of CVDs may help to attenuate the disease, potentially lowering death rates. The existence of cardiac disease can be predicted using machine learning approaches; however, the effectiveness of the classifiers may be enhanced by applying PCA, relief feature selection, and data transformation techniques. The objective of employing data transformation, PCA, and relief feature selection approaches is to enhance classifier performance and increase the interpretability and ability of classifiers to predict heart disease. Heart disease anticipating is a challenging problem in the field of healthcare. This uses popular supervised machine learning (ML) algorithms including k-NN, LR, DT, RF, SVM, and ANN to help healthcare practitioners and specialists easily identify the prevalence of heart-related illnesses in patients. In these trials, data transformation is achieved using PCA, normalized features, and relief techniques, and RF surpasses all other classifiers with a prediction accuracy of 90%, followed by ANN and DT with AUCs of 87% and 86%, respectively. SVM and Naive Bayes classifiers were shown to be lesser effective at predicting heart disease.","PeriodicalId":338611,"journal":{"name":"2023 Second International Conference on Electronics and Renewable Systems (ICEARS)","volume":"192 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Second International Conference on Electronics and Renewable Systems (ICEARS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEARS56392.2023.10085401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Cardiovascular disorders (CVD) are the key cause of mortality worldwide. One in three male premature deaths and one in five female premature deaths are thought to be attributable to Cardiovascular disorders. Early prediction of CVDs may help to attenuate the disease, potentially lowering death rates. The existence of cardiac disease can be predicted using machine learning approaches; however, the effectiveness of the classifiers may be enhanced by applying PCA, relief feature selection, and data transformation techniques. The objective of employing data transformation, PCA, and relief feature selection approaches is to enhance classifier performance and increase the interpretability and ability of classifiers to predict heart disease. Heart disease anticipating is a challenging problem in the field of healthcare. This uses popular supervised machine learning (ML) algorithms including k-NN, LR, DT, RF, SVM, and ANN to help healthcare practitioners and specialists easily identify the prevalence of heart-related illnesses in patients. In these trials, data transformation is achieved using PCA, normalized features, and relief techniques, and RF surpasses all other classifiers with a prediction accuracy of 90%, followed by ANN and DT with AUCs of 87% and 86%, respectively. SVM and Naive Bayes classifiers were shown to be lesser effective at predicting heart disease.
基于PCA和浮雕特征选择的数据变换改进分类器的心脏病预测
心血管疾病(CVD)是世界范围内导致死亡的主要原因。据认为,三分之一的男性过早死亡和五分之一的女性过早死亡可归因于心血管疾病。心血管疾病的早期预测可能有助于减轻疾病,从而潜在地降低死亡率。使用机器学习方法可以预测心脏病的存在;然而,分类器的有效性可以通过应用PCA、地形特征选择和数据转换技术来增强。采用数据转换、主成分分析和缓解特征选择方法的目的是提高分类器的性能,提高分类器预测心脏病的可解释性和能力。心脏病预测是医疗保健领域的一个具有挑战性的问题。它使用流行的监督机器学习(ML)算法,包括k-NN、LR、DT、RF、SVM和ANN,以帮助医疗保健从业者和专家轻松识别患者中心脏相关疾病的患病率。在这些试验中,使用PCA、归一化特征和缓解技术实现数据转换,RF以90%的预测准确率超过所有其他分类器,其次是ANN和DT, auc分别为87%和86%。支持向量机和朴素贝叶斯分类器在预测心脏病方面效果较差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信