R. Deepa, M. Sakthivadivel, S. Saravanakumar, V. Ganesh Karthikeyan, S. Madumitha
{"title":"Early diabetes risk classification using supervised learning algorithms","authors":"R. Deepa, M. Sakthivadivel, S. Saravanakumar, V. Ganesh Karthikeyan, S. Madumitha","doi":"10.1109/InCACCT57535.2023.10141713","DOIUrl":null,"url":null,"abstract":"Diabetes is one of the most devastating diseases and affects many people. Diabetes can be caused by a variety of causes, including ageing, obesity, inactivity, genetics, a poor diet, high blood pressure, and others. Diabetes increases the likelihood of developing several illnesses, including heart disease, renal disease, stroke, eye problems, nerve damage, etc. The information needed to diagnose diabetes is currently gathered through a variety of tests used in hospitals, and the diagnosis is then used to determine the best course of treatment. The healthcare sector has a considerable application for machine learning (ML). Databases in the healthcare sector are very vast. Big datasets can be examined using ML techniques to find hidden information and patterns, allowing one to learn from the data and predict outcomes properly. Using the existing methods, the forecast’s accuracy is not very good. In this study, we proposed an early diabetes prediction model that incorporates several extrinsic characteristics that contribute to the development of diabetes together with more widely used measures like polyuria, weight loss, polyphagia, visual blurring, alopecia, obesity, etc. The Support Vector Machine (SVM), the Logistic Regression (LOR), the Boosted Tree (BOT), and the Bagged Tree (BAT) are four different classifiers that are utilized in this paper to predict diabetes early on. The device’s performance is assessed in terms of accuracy, recall, specificity, precision, and f-measure. Results show that among the classifiers, BAT has the highest accuracy, at 98%.","PeriodicalId":405272,"journal":{"name":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/InCACCT57535.2023.10141713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Diabetes is one of the most devastating diseases and affects many people. Diabetes can be caused by a variety of causes, including ageing, obesity, inactivity, genetics, a poor diet, high blood pressure, and others. Diabetes increases the likelihood of developing several illnesses, including heart disease, renal disease, stroke, eye problems, nerve damage, etc. The information needed to diagnose diabetes is currently gathered through a variety of tests used in hospitals, and the diagnosis is then used to determine the best course of treatment. The healthcare sector has a considerable application for machine learning (ML). Databases in the healthcare sector are very vast. Big datasets can be examined using ML techniques to find hidden information and patterns, allowing one to learn from the data and predict outcomes properly. Using the existing methods, the forecast’s accuracy is not very good. In this study, we proposed an early diabetes prediction model that incorporates several extrinsic characteristics that contribute to the development of diabetes together with more widely used measures like polyuria, weight loss, polyphagia, visual blurring, alopecia, obesity, etc. The Support Vector Machine (SVM), the Logistic Regression (LOR), the Boosted Tree (BOT), and the Bagged Tree (BAT) are four different classifiers that are utilized in this paper to predict diabetes early on. The device’s performance is assessed in terms of accuracy, recall, specificity, precision, and f-measure. Results show that among the classifiers, BAT has the highest accuracy, at 98%.