Cardio-Vascular Disease Prediction using Machine Learning Techniques

Srinivas Konda, N. K. Kar, Padmaja Pulicherla, G. Shivakanth, R. C
{"title":"Cardio-Vascular Disease Prediction using Machine Learning Techniques","authors":"Srinivas Konda, N. K. Kar, Padmaja Pulicherla, G. Shivakanth, R. C","doi":"10.1109/ICICACS57338.2023.10099769","DOIUrl":null,"url":null,"abstract":"The main goal of this study is to use Data Mining Method and Artificial Neural Network to develop a system that can automatically and rapidly predict the risk of coronary heart disease (ANN). The IRT Perundurai Medical College and Hospital's master health checkup data on occupational drivers were used to test this idea (PMCH). Analysis for risk identification is performed in the first stage of the hybrid approach suggested in this study, and level prediction is performed in the second. The sensitivity, specificity, precision, receiver operating curve, area under curve, 10-fold cross validation technique, and the F-measure are used for this investigation. The initial step of the study involves thinking about the most common and changeable dangers. Systolic blood pressure, diastolic blood pressure, and body mass index (BMI) are three biophysical variables, whereas fasting blood sugar, postprandial blood sugar, and triglyceride levels are three blood chemical factors (TG). All of these characteristics have a predetermined margin value that is based on WHO guidelines. Support Vector Machine (SVM), Naive Bayes (NB), and the C4.5 algorithm in Decision Tree are the three approaches used to categorize these variables and forecast the risk (DT). The C4.5 algorithm fared best in forecasting CHD risk when the three approaches were compared using the performance metrics, as discovered by the investigation. The decision tree C4.5 method outperformed the other two classifiers with an improved 99.5% accuracy and 99.67% sensitivity. The increased percentage demonstrates that the Decision tree method delivered consistent results that were better to those produced by the Naive Bayes and SVM models.","PeriodicalId":274807,"journal":{"name":"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICACS57338.2023.10099769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The main goal of this study is to use Data Mining Method and Artificial Neural Network to develop a system that can automatically and rapidly predict the risk of coronary heart disease (ANN). The IRT Perundurai Medical College and Hospital's master health checkup data on occupational drivers were used to test this idea (PMCH). Analysis for risk identification is performed in the first stage of the hybrid approach suggested in this study, and level prediction is performed in the second. The sensitivity, specificity, precision, receiver operating curve, area under curve, 10-fold cross validation technique, and the F-measure are used for this investigation. The initial step of the study involves thinking about the most common and changeable dangers. Systolic blood pressure, diastolic blood pressure, and body mass index (BMI) are three biophysical variables, whereas fasting blood sugar, postprandial blood sugar, and triglyceride levels are three blood chemical factors (TG). All of these characteristics have a predetermined margin value that is based on WHO guidelines. Support Vector Machine (SVM), Naive Bayes (NB), and the C4.5 algorithm in Decision Tree are the three approaches used to categorize these variables and forecast the risk (DT). The C4.5 algorithm fared best in forecasting CHD risk when the three approaches were compared using the performance metrics, as discovered by the investigation. The decision tree C4.5 method outperformed the other two classifiers with an improved 99.5% accuracy and 99.67% sensitivity. The increased percentage demonstrates that the Decision tree method delivered consistent results that were better to those produced by the Naive Bayes and SVM models.
使用机器学习技术进行心血管疾病预测
本研究的主要目标是利用数据挖掘方法和人工神经网络技术,开发一个能够自动快速预测冠心病(ANN)风险的系统。IRT Perundurai医学院和医院对职业司机的总体健康检查数据被用来检验这一想法(PMCH)。本研究提出的混合方法在第一阶段进行风险识别分析,第二阶段进行水平预测。采用灵敏度、特异度、精密度、受检者工作曲线、曲线下面积、10倍交叉验证技术和f测量法进行研究。研究的第一步包括考虑最常见和最多变的危险。收缩压、舒张压和身体质量指数(BMI)是三个生物物理变量,而空腹血糖、餐后血糖和甘油三酯水平是三个血液化学因子(TG)。所有这些特征都有一个基于世卫组织准则的预定边际值。支持向量机(SVM)、朴素贝叶斯(NB)和决策树中的C4.5算法是对这些变量进行分类和预测风险(DT)的三种方法。调查发现,当使用性能指标对三种方法进行比较时,C4.5算法在预测冠心病风险方面表现最好。决策树C4.5方法优于其他两种分类器,准确率提高了99.5%,灵敏度提高了99.67%。增加的百分比表明决策树方法提供了一致的结果,比朴素贝叶斯和支持向量机模型产生的结果更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信