Predicting Life Style of Early Diabetes Mellitus using Machine Learning Technique

Q3 Computer Science

International Journal of Computing Pub Date : 2023-10-01 DOI:10.47839/ijc.22.3.3230

Salliah Shafi Bhat, Venkatesan Selvam, Gufran Ahmad Ansari

{"title":"Predicting Life Style of Early Diabetes Mellitus using Machine Learning Technique","authors":"Salliah Shafi Bhat, Venkatesan Selvam, Gufran Ahmad Ansari","doi":"10.47839/ijc.22.3.3230","DOIUrl":null,"url":null,"abstract":"A branch of artificial intelligence called Machine Learning (ML) enables machines to learn without having to be emphatically instructed. Machine Learning Techniques (MLT) have been used to forecast a variety of chronic diseases in the healthcare sector. Improvement in clinical approaches is necessary for early diabetes prediction to prevent complications and prolong the diagnosis of diabetes. Diabetes is growing fast in this world. In this paper MLT based Framework is recommended for early prediction of Diabetes Mellitus (DM). In this Paper the authors make use of PIDD data set. Different MLTs are used including Support Vector Classification (SVC), Logistic Regression (LR), K Nearest Neighbor (KNN) and Random Forest (RF). Data analysis is the first step in our method after which the information is transferred for data pre-processing and feature selection methods. RF performed better than other models with a 92.85 % accuracy rate followed by SVC (91.5%), LR (83.11) and KNN (89.6). K-fold cross-validation technique is utilized to verify the outcomes. The contribution of lifestyle characteristics is calculated using a feature engineering process. As a result, comprehensive overall comparative assessments of all the algorithms are performed taking into account variables such as accuracy, precision, sensitivity, recall, F1 score and ROC-AUC. The medical field can use the proposed framework to make early diabetes predictions. Additionally, it can be applied to other datasets that have data in common with diabetes.","PeriodicalId":37669,"journal":{"name":"International Journal of Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47839/ijc.22.3.3230","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

A branch of artificial intelligence called Machine Learning (ML) enables machines to learn without having to be emphatically instructed. Machine Learning Techniques (MLT) have been used to forecast a variety of chronic diseases in the healthcare sector. Improvement in clinical approaches is necessary for early diabetes prediction to prevent complications and prolong the diagnosis of diabetes. Diabetes is growing fast in this world. In this paper MLT based Framework is recommended for early prediction of Diabetes Mellitus (DM). In this Paper the authors make use of PIDD data set. Different MLTs are used including Support Vector Classification (SVC), Logistic Regression (LR), K Nearest Neighbor (KNN) and Random Forest (RF). Data analysis is the first step in our method after which the information is transferred for data pre-processing and feature selection methods. RF performed better than other models with a 92.85 % accuracy rate followed by SVC (91.5%), LR (83.11) and KNN (89.6). K-fold cross-validation technique is utilized to verify the outcomes. The contribution of lifestyle characteristics is calculated using a feature engineering process. As a result, comprehensive overall comparative assessments of all the algorithms are performed taking into account variables such as accuracy, precision, sensitivity, recall, F1 score and ROC-AUC. The medical field can use the proposed framework to make early diabetes predictions. Additionally, it can be applied to other datasets that have data in common with diabetes.

查看原文本刊更多论文

利用机器学习技术预测早期糖尿病患者的生活方式

人工智能的一个分支被称为机器学习(ML)，它使机器能够在不需要强调指示的情况下学习。机器学习技术(MLT)已被用于预测医疗保健领域的各种慢性疾病。改进临床方法是糖尿病早期预测预防并发症和延长糖尿病诊断的必要条件。糖尿病在这个世界上增长迅速。本文推荐基于MLT的框架用于糖尿病的早期预测。本文利用了PIDD数据集。使用不同的mlt，包括支持向量分类(SVC)，逻辑回归(LR)， K近邻(KNN)和随机森林(RF)。数据分析是该方法的第一步，然后将信息传递给数据预处理和特征选择方法。RF以92.85%的准确率优于其他模型，其次是SVC(91.5%)、LR(83.11)和KNN(89.6)。使用K-fold交叉验证技术来验证结果。使用特征工程过程计算生活方式特征的贡献。因此，考虑到准确性、精密度、灵敏度、召回率、F1分数和ROC-AUC等变量，对所有算法进行了全面的总体比较评估。医学领域可以使用提出的框架进行早期糖尿病预测。此外，它还可以应用于与糖尿病有共同数据的其他数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Computing Computer Science-Computer Science (miscellaneous)

CiteScore

2.20

自引率

0.00%

发文量

期刊介绍： The International Journal of Computing Journal was established in 2002 on the base of Branch Research Laboratory for Automated Systems and Networks, since 2005 it’s renamed as Research Institute of Intelligent Computer Systems. A goal of the Journal is to publish papers with the novel results in Computing Science and Computer Engineering and Information Technologies and Software Engineering and Information Systems within the Journal topics. The official language of the Journal is English; also papers abstracts in both Ukrainian and Russian languages are published there. The issues of the Journal are published quarterly. The Editorial Board consists of about 30 recognized worldwide scientists.