{"title":"Multiclass Diabetes Detection Using Random Forest Classification","authors":"Amjed Al-mousa, Laith AlKhdour, Hatem Bishawi, Fares AlShubeliat","doi":"10.1109/JEEIT58638.2023.10185679","DOIUrl":null,"url":null,"abstract":"Detecting diabetes at an early stage can help save lives and improve the patients quality of life significantly. Diabetes can be detected with the assistance of information regarding the patient's lifestyle and health. This work aims to predict diabetic patients using different machine-learning classification algorithms and a dataset about diabetic and healthy patients. The work employs a data balancing technique to handle the data imbalance issue, as well as using cross-validation. In addition, it compares these machine-learning algorithms according to several performance indicators like accuracy, precision, recall, and Fl-score. Accordingly, the Random Forest classifier proved to produce the best results with accuracy, precision, recall, and an Fl-score, all equal to 89%.","PeriodicalId":177556,"journal":{"name":"2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)","volume":"127 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JEEIT58638.2023.10185679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting diabetes at an early stage can help save lives and improve the patients quality of life significantly. Diabetes can be detected with the assistance of information regarding the patient's lifestyle and health. This work aims to predict diabetic patients using different machine-learning classification algorithms and a dataset about diabetic and healthy patients. The work employs a data balancing technique to handle the data imbalance issue, as well as using cross-validation. In addition, it compares these machine-learning algorithms according to several performance indicators like accuracy, precision, recall, and Fl-score. Accordingly, the Random Forest classifier proved to produce the best results with accuracy, precision, recall, and an Fl-score, all equal to 89%.