Performance Evaluation of Data Mining and Neural Network Based Models For Diabetes Prediction

Priyabrata Sahu, J. K. Mantri
{"title":"Performance Evaluation of Data Mining and Neural Network Based Models For Diabetes Prediction","authors":"Priyabrata Sahu, J. K. Mantri","doi":"10.1109/ICSTSN57873.2023.10151474","DOIUrl":null,"url":null,"abstract":"Diabetes, often known as diabetes mellitus, is a disease that disrupts the body’s normal response to blood sugar. The pancreas releases insulin, which aids in the uptake of glucose from meals into cells to be used as fuel. Hyper-glycemia,or high blood sugar, is a typical result of uncontrolled diabetes and is associated with several significant health complications, most notably those affecting the nerves and blood vessels. Statistics indicate that in 2014, adults 18 and above had diabetes, and in 2019, diabetes was responsible for 1.5 million fatalities worldwide. Machine learning and deep learning predictive models have seen tremendous development throughout industries, including health care, making early diagnosis of diabetes a breeze. Many potentially fatal diseases, such as cancer, diabetes, heart disease, thyroid disease, etc., may be predicted or diagnosed with the use of machine learning classifiers. The treatment of chronic diabetes, one of the world’smost prevalent illnesses, might benefit greatly from improved diagnostic efficiency.Here, we examine the relative merits among several ML and DL approaches to the problem of early diabetic illness prediction. The fundamental purpose of this research is to organize and conduct out diabetes prognosis with several ML learning approaches and then analyze the results of these methods to determine which one is the most accurate classifier. In this work, we take a multifaceted approach to diabetes and its prediction by investigating a wide range of disease-related characteristics. We use the classic Dataset Based on PIDD, and we apply several Machine Learning and Deep learning classifiers to it, including Random Forest (RF), Logisticregression (LR), Support Vector Machine, Artificial Neural Network (ANN), Multilayer Perceptron (MLP), and Decision Tree, Gradient Boost (GB), XGBoost (GB), Adaboost (GB), CATBOOST (GB), and LightGBM (LGBM). There is a wide range of precision amongst the models used here. A technology that can precisely predict diabetes is shown in this research. The findings of this research indicate that one of the Data mining models, random forest (RF), and the ANN Model from the category of neural network models have superior accuracy in making diabetes forecasts.","PeriodicalId":325019,"journal":{"name":"2023 2nd International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSTSN57873.2023.10151474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Diabetes, often known as diabetes mellitus, is a disease that disrupts the body’s normal response to blood sugar. The pancreas releases insulin, which aids in the uptake of glucose from meals into cells to be used as fuel. Hyper-glycemia,or high blood sugar, is a typical result of uncontrolled diabetes and is associated with several significant health complications, most notably those affecting the nerves and blood vessels. Statistics indicate that in 2014, adults 18 and above had diabetes, and in 2019, diabetes was responsible for 1.5 million fatalities worldwide. Machine learning and deep learning predictive models have seen tremendous development throughout industries, including health care, making early diagnosis of diabetes a breeze. Many potentially fatal diseases, such as cancer, diabetes, heart disease, thyroid disease, etc., may be predicted or diagnosed with the use of machine learning classifiers. The treatment of chronic diabetes, one of the world’smost prevalent illnesses, might benefit greatly from improved diagnostic efficiency.Here, we examine the relative merits among several ML and DL approaches to the problem of early diabetic illness prediction. The fundamental purpose of this research is to organize and conduct out diabetes prognosis with several ML learning approaches and then analyze the results of these methods to determine which one is the most accurate classifier. In this work, we take a multifaceted approach to diabetes and its prediction by investigating a wide range of disease-related characteristics. We use the classic Dataset Based on PIDD, and we apply several Machine Learning and Deep learning classifiers to it, including Random Forest (RF), Logisticregression (LR), Support Vector Machine, Artificial Neural Network (ANN), Multilayer Perceptron (MLP), and Decision Tree, Gradient Boost (GB), XGBoost (GB), Adaboost (GB), CATBOOST (GB), and LightGBM (LGBM). There is a wide range of precision amongst the models used here. A technology that can precisely predict diabetes is shown in this research. The findings of this research indicate that one of the Data mining models, random forest (RF), and the ANN Model from the category of neural network models have superior accuracy in making diabetes forecasts.
基于数据挖掘和神经网络的糖尿病预测模型的性能评价
糖尿病,通常被称为糖尿病,是一种破坏人体对血糖的正常反应的疾病。胰腺释放胰岛素,帮助从食物中摄取葡萄糖进入细胞作为燃料。高血糖症是糖尿病不受控制的典型结果,与几种重要的健康并发症有关,最明显的是影响神经和血管的并发症。统计数据显示,2014年,18岁及以上的成年人患有糖尿病,2019年,全球有150万人死于糖尿病。机器学习和深度学习预测模型在包括医疗保健在内的各个行业都取得了巨大的发展,使糖尿病的早期诊断变得轻而易举。许多潜在的致命疾病,如癌症、糖尿病、心脏病、甲状腺疾病等,都可以通过使用机器学习分类器来预测或诊断。作为世界上最普遍的疾病之一,慢性糖尿病的治疗可能会从提高诊断效率中受益匪浅。在这里,我们研究了几种ML和DL方法在早期糖尿病疾病预测问题中的相对优点。本研究的根本目的是利用几种ML学习方法组织和进行糖尿病预后,然后分析这些方法的结果,确定哪一种分类器最准确。在这项工作中,我们通过调查广泛的疾病相关特征,采取多方面的方法来研究糖尿病及其预测。我们使用基于PIDD的经典数据集,并对其应用了几种机器学习和深度学习分类器,包括随机森林(RF)、逻辑回归(LR)、支持向量机、人工神经网络(ANN)、多层感知器(MLP)和决策树、梯度Boost (GB)、XGBoost (GB)、Adaboost (GB)、CATBOOST (GB)和LightGBM (LGBM)。这里使用的模型精度范围很广。这项研究展示了一种可以精确预测糖尿病的技术。本研究结果表明,数据挖掘模型中的随机森林模型(random forest, RF)和神经网络模型中的人工神经网络模型(ANN Model)在糖尿病预测中具有较好的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信