Early Detection of Diabetes Using Random Forest Algorithm

Cindy Nabila Noviyanti, Alamsyah Alamsyah
{"title":"Early Detection of Diabetes Using Random Forest Algorithm","authors":"Cindy Nabila Noviyanti, Alamsyah Alamsyah","doi":"10.52465/joiser.v2i1.245","DOIUrl":null,"url":null,"abstract":"Diabetes is one of the most chronic and deadly diseases. According to data from WHO in 2021, there were approximately 422 million adults living with diabetes worldwide, and this number is expected to continue to increase in the future due to various factors. Many studies have been conducted for early detection of diabetes by focusing on improving accuracy. However, a big problem in diabetes prediction is the selection of the right classification algorithm. This study aims to improve the accuracy of early detection of diabetes by implementing the Random Forest algorithm model. This research was conducted with the stages of data collection, data preprocessing, split data, modeling, and evaluation. This research uses the Pima Indian Diabetes data set. The results showed that the diabetes early detection model using the Random Forest algorithm produced an accuracy of 87%. This research shows that by using the Random Forest algorithm model, the performance of early detection of diabetes can be improved. However, there is still room for optimization of this performance, which is recommended for further research to carry out feature selection, data balancing, more complex model building, and exploring larger data.","PeriodicalId":499822,"journal":{"name":"Journal of Information System Exploration and Research","volume":"14 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information System Exploration and Research","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.52465/joiser.v2i1.245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Diabetes is one of the most chronic and deadly diseases. According to data from WHO in 2021, there were approximately 422 million adults living with diabetes worldwide, and this number is expected to continue to increase in the future due to various factors. Many studies have been conducted for early detection of diabetes by focusing on improving accuracy. However, a big problem in diabetes prediction is the selection of the right classification algorithm. This study aims to improve the accuracy of early detection of diabetes by implementing the Random Forest algorithm model. This research was conducted with the stages of data collection, data preprocessing, split data, modeling, and evaluation. This research uses the Pima Indian Diabetes data set. The results showed that the diabetes early detection model using the Random Forest algorithm produced an accuracy of 87%. This research shows that by using the Random Forest algorithm model, the performance of early detection of diabetes can be improved. However, there is still room for optimization of this performance, which is recommended for further research to carry out feature selection, data balancing, more complex model building, and exploring larger data.
使用随机森林算法早期检测糖尿病
糖尿病是最慢性、最致命的疾病之一。根据世卫组织 2021 年的数据,全球约有 4.22 亿成年人患有糖尿病,由于各种因素,预计这一数字在未来还会继续增加。为提高糖尿病早期检测的准确性,已经开展了许多研究。然而,糖尿病预测的一个大问题是选择正确的分类算法。本研究旨在通过采用随机森林算法模型来提高糖尿病早期检测的准确性。本研究分为数据收集、数据预处理、数据拆分、建模和评估等阶段。本研究使用了皮马印第安糖尿病数据集。结果显示,使用随机森林算法的糖尿病早期检测模型的准确率为 87%。这项研究表明,通过使用随机森林算法模型,可以提高糖尿病早期检测的性能。然而,这一性能仍有优化的空间,建议进一步研究,开展特征选择、数据平衡、更复杂的模型构建和更大数据的探索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信