机器学习方法在糖尿病预测中的性能分析

S. Sakib, N. Yasmin, Ihtyaz Kader Tasawar, A. Aziz, Md. Abu Bakr Siddique, Mohammad Mahmudur Rahman Khan
{"title":"机器学习方法在糖尿病预测中的性能分析","authors":"S. Sakib, N. Yasmin, Ihtyaz Kader Tasawar, A. Aziz, Md. Abu Bakr Siddique, Mohammad Mahmudur Rahman Khan","doi":"10.1109/R10-HTC53172.2021.9641737","DOIUrl":null,"url":null,"abstract":"Diabetes is a major chronic syndrome caused by a series of metabolic abnormalities in which blood glucose levels are abnormally high for an indeterminate amount of time. It influences various organs in the human body, resulting in a variety of complex diseases such as stroke, renal disease, pulmonary embolism, eyesight, and so on. Diabetes Disorders (DD) are presently one of the healthcare top causes of mortality. Predictive analytics in the health care system is a huge obstacle, but if accurate early prediction is achieved, the potential risk and degree of diabetes may be significantly decreased. Machine learning (ML) techniques are now used to analyze medical datasets at an earlier stage of life in keeping people safe. In this research, we utilized several ML approaches notably Logistic Regression, Decision Tree (DT), XGBoost, Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Random Forest (RF) on PIMA Indian Diabetes Dataset in order to monitor and evaluate their performances in diabetes prediction. The performance of the various ML algorithms employed in this research suggests which algorithm is most suitable in diabetes prediction. It is observed that among all the models XGBoost had outperformed the other ML techniques with an accuracy of 80.73% while SVM was the second-best performing model with a classification accuracy of 80.21%. Thus, employing ML techniques, this study aims to assist doctors as well as clinicians in the early detection of diabetes.","PeriodicalId":117626,"journal":{"name":"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Performance Analysis of Machine Learning Approaches in Diabetes Prediction\",\"authors\":\"S. Sakib, N. Yasmin, Ihtyaz Kader Tasawar, A. Aziz, Md. Abu Bakr Siddique, Mohammad Mahmudur Rahman Khan\",\"doi\":\"10.1109/R10-HTC53172.2021.9641737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes is a major chronic syndrome caused by a series of metabolic abnormalities in which blood glucose levels are abnormally high for an indeterminate amount of time. It influences various organs in the human body, resulting in a variety of complex diseases such as stroke, renal disease, pulmonary embolism, eyesight, and so on. Diabetes Disorders (DD) are presently one of the healthcare top causes of mortality. Predictive analytics in the health care system is a huge obstacle, but if accurate early prediction is achieved, the potential risk and degree of diabetes may be significantly decreased. Machine learning (ML) techniques are now used to analyze medical datasets at an earlier stage of life in keeping people safe. In this research, we utilized several ML approaches notably Logistic Regression, Decision Tree (DT), XGBoost, Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Random Forest (RF) on PIMA Indian Diabetes Dataset in order to monitor and evaluate their performances in diabetes prediction. The performance of the various ML algorithms employed in this research suggests which algorithm is most suitable in diabetes prediction. It is observed that among all the models XGBoost had outperformed the other ML techniques with an accuracy of 80.73% while SVM was the second-best performing model with a classification accuracy of 80.21%. Thus, employing ML techniques, this study aims to assist doctors as well as clinicians in the early detection of diabetes.\",\"PeriodicalId\":117626,\"journal\":{\"name\":\"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/R10-HTC53172.2021.9641737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/R10-HTC53172.2021.9641737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

糖尿病是一种主要的慢性综合征,由一系列代谢异常引起,其中血糖水平在不确定的时间内异常高。它影响人体的各个器官,导致中风、肾病、肺栓塞、视力等多种复杂疾病。糖尿病疾病(DD)是目前医疗保健死亡的主要原因之一。在医疗保健系统中,预测分析是一个巨大的障碍,但如果能够实现准确的早期预测,糖尿病的潜在风险和程度可能会显著降低。机器学习(ML)技术现在被用于分析生命早期阶段的医疗数据集,以确保人们的安全。在这项研究中,我们在PIMA印度糖尿病数据集上使用了几种ML方法,包括逻辑回归、决策树(DT)、XGBoost、支持向量机(SVM)、k -近邻(KNN)和随机森林(RF),以监测和评估它们在糖尿病预测中的性能。本研究中使用的各种ML算法的性能表明哪种算法最适合于糖尿病预测。可以观察到,在所有模型中,XGBoost以80.73%的准确率优于其他ML技术,而SVM以80.21%的准确率排名第二。因此,本研究旨在利用机器学习技术,帮助医生和临床医生早期发现糖尿病。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance Analysis of Machine Learning Approaches in Diabetes Prediction
Diabetes is a major chronic syndrome caused by a series of metabolic abnormalities in which blood glucose levels are abnormally high for an indeterminate amount of time. It influences various organs in the human body, resulting in a variety of complex diseases such as stroke, renal disease, pulmonary embolism, eyesight, and so on. Diabetes Disorders (DD) are presently one of the healthcare top causes of mortality. Predictive analytics in the health care system is a huge obstacle, but if accurate early prediction is achieved, the potential risk and degree of diabetes may be significantly decreased. Machine learning (ML) techniques are now used to analyze medical datasets at an earlier stage of life in keeping people safe. In this research, we utilized several ML approaches notably Logistic Regression, Decision Tree (DT), XGBoost, Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Random Forest (RF) on PIMA Indian Diabetes Dataset in order to monitor and evaluate their performances in diabetes prediction. The performance of the various ML algorithms employed in this research suggests which algorithm is most suitable in diabetes prediction. It is observed that among all the models XGBoost had outperformed the other ML techniques with an accuracy of 80.73% while SVM was the second-best performing model with a classification accuracy of 80.21%. Thus, employing ML techniques, this study aims to assist doctors as well as clinicians in the early detection of diabetes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信