使用机器学习技术对心脏病进行分类

Perivitta Rajendran, S. Haw, Palaichamy Naveen
{"title":"使用机器学习技术对心脏病进行分类","authors":"Perivitta Rajendran, S. Haw, Palaichamy Naveen","doi":"10.1145/3488466.3488482","DOIUrl":null,"url":null,"abstract":"The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.","PeriodicalId":196340,"journal":{"name":"Proceedings of the 5th International Conference on Digital Technology in Education","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Classification of Heart Disease Using Machine Learning Techniques\",\"authors\":\"Perivitta Rajendran, S. Haw, Palaichamy Naveen\",\"doi\":\"10.1145/3488466.3488482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.\",\"PeriodicalId\":196340,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Digital Technology in Education\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Digital Technology in Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3488466.3488482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Digital Technology in Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3488466.3488482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

医学领域最关键的任务是诊断疾病。如果一种疾病在早期阶段得到诊断,那么许多生命就可以得到挽救。本文的目的是利用医学数据,使用监督和无监督学习技术来预测心血管心脏病,并展示特征相关性对四种不同算法(逻辑回归,朴素贝叶斯,随机森林和人工神经网络)分类模型的影响。对于性能评估,它包括f1分数,精度,曲线下面积和召回率。总体而言,逻辑回归算法在匈牙利和Statlog数据集上都表现良好,而对于克利夫兰数据集,人工神经网络在准确性方面表现优于逻辑回归。在曲线下面积得分方面,与朴素贝叶斯、随机森林和人工神经网络相比,逻辑回归在所有数据集中的表现都更高。结果表明,与其他方法相比,所设计的诊断系统能够有效地预测心脏病的风险水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Classification of Heart Disease Using Machine Learning Techniques
The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信