An Improved Framework for Reliable Cardiovascular Disease Prediction Using Hybrid Ensemble Learning

Tanjim Mahmud, Anik Barua, M. Begum, Eipshita Chakma, Sudhakar Das, Nahed Sharmen
{"title":"An Improved Framework for Reliable Cardiovascular Disease Prediction Using Hybrid Ensemble Learning","authors":"Tanjim Mahmud, Anik Barua, M. Begum, Eipshita Chakma, Sudhakar Das, Nahed Sharmen","doi":"10.1109/ECCE57851.2023.10101564","DOIUrl":null,"url":null,"abstract":"Cardiovascular diseases (CVDs), which include heart disorders, are the most prevalent and significant causes of death worldwide, including Bangladesh. Blood artery problems, rhythm issues, chest pain, heart attacks, strokes, and erratic blood pressure are a few of these. In Bangladesh, cardiovascular disease is the main factor in both male and female fatalities. More than 80% of CVD deaths are caused by heart disease and strokes, which are the predominant causes. To be able to examine the effectiveness of the various models, this research article explains the underlying methods as Support vector machines (SVM), K-Nearest Neighbors (KNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and XGBoost (XGB), wherein Random Forest perform better when their hyperparameters are tuned (RandomizedSearchCV). There suggested ensemble technique such as Bagging, Voting, Stacking. Additionally, it is suggested that a hybrid strategy using Bagging and stacking ensemble approaches can boost the predictability of cardiovascular disease. For this analysis of patient performance, we used a dataset from Kaggle that comprises of 70,000 unique data values. According to the experiment's findings, the proposed model had the best disease prediction accuracy, coming in at 84.03%.","PeriodicalId":131537,"journal":{"name":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECCE57851.2023.10101564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Cardiovascular diseases (CVDs), which include heart disorders, are the most prevalent and significant causes of death worldwide, including Bangladesh. Blood artery problems, rhythm issues, chest pain, heart attacks, strokes, and erratic blood pressure are a few of these. In Bangladesh, cardiovascular disease is the main factor in both male and female fatalities. More than 80% of CVD deaths are caused by heart disease and strokes, which are the predominant causes. To be able to examine the effectiveness of the various models, this research article explains the underlying methods as Support vector machines (SVM), K-Nearest Neighbors (KNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and XGBoost (XGB), wherein Random Forest perform better when their hyperparameters are tuned (RandomizedSearchCV). There suggested ensemble technique such as Bagging, Voting, Stacking. Additionally, it is suggested that a hybrid strategy using Bagging and stacking ensemble approaches can boost the predictability of cardiovascular disease. For this analysis of patient performance, we used a dataset from Kaggle that comprises of 70,000 unique data values. According to the experiment's findings, the proposed model had the best disease prediction accuracy, coming in at 84.03%.
基于混合集成学习的可靠心血管疾病预测改进框架
包括心脏病在内的心血管疾病是全世界(包括孟加拉国)最普遍和最重要的死亡原因。血液动脉问题、节律问题、胸痛、心脏病发作、中风和不稳定的血压就是其中的一些。在孟加拉国,心血管疾病是男性和女性死亡的主要因素。超过80%的心血管疾病死亡是由心脏病和中风引起的,这是主要原因。为了能够检验各种模型的有效性,本文解释了支持向量机(SVM)、k近邻(KNN)、逻辑回归(LR)、随机森林(RF)、决策树(DT)和XGBoost (XGB)等基本方法,其中随机森林在超参数调整时表现更好(RandomizedSearchCV)。有建议的综合技术,如Bagging, Voting, Stacking。此外,建议使用Bagging和堆叠集成方法的混合策略可以提高心血管疾病的可预测性。为了分析病人的表现,我们使用了一个来自Kaggle的数据集,它包含了7万个独特的数据值。根据实验结果,所提出的模型具有最佳的疾病预测精度,达到84.03%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信