{"title":"Diabetic Prediction using Feature Selection based Random Forest and Fine Tuned K-Nearest Neighbor Classifier Algorithm-A Design Thinking Approach","authors":"S. Ramya, Dr T. Vijayaraghavan, D. Kalaivani","doi":"10.1109/ICESC57686.2023.10193333","DOIUrl":null,"url":null,"abstract":"In low- and middle-income nations today, diabetes affects the majority of the population, according to a World Health organization (WHO) research. The WHO report suggested that 80% of the deaths would be due to the diabetes from 2016 to 2030. However, the current method continues to provide findings that are erroneous, which has a substantial negative impact on performance. To overcome the abovementioned issue, in this work, Random Forest (RF) algorithm and Fine tuned K-Nearest Neighbor (FKNN) classifier algorithm is proposed. Pre-processing, feature selection, and classification are the three primary stages of this project. Initially, preprocessing is performing for improving the final dataset results more accurately. Preprocessing is the process of cleaning the database into correct format. In order to choose more relevant and useful data from the dataset, the feature selection is then carried out utilizing the RF algorithm. It also minimizes the risk of over fitting with minimum features. Finally, diabetic prediction and classification is done by using FKNN classifier algorithm is used for categorizing items in the feature space based on training samples that are the most similar to the objects being classified. According to the experimental results, the suggested RF+FKNN method outperforms the current algorithms in accuracy, precision, recall, and f-measure.","PeriodicalId":235381,"journal":{"name":"2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICESC57686.2023.10193333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In low- and middle-income nations today, diabetes affects the majority of the population, according to a World Health organization (WHO) research. The WHO report suggested that 80% of the deaths would be due to the diabetes from 2016 to 2030. However, the current method continues to provide findings that are erroneous, which has a substantial negative impact on performance. To overcome the abovementioned issue, in this work, Random Forest (RF) algorithm and Fine tuned K-Nearest Neighbor (FKNN) classifier algorithm is proposed. Pre-processing, feature selection, and classification are the three primary stages of this project. Initially, preprocessing is performing for improving the final dataset results more accurately. Preprocessing is the process of cleaning the database into correct format. In order to choose more relevant and useful data from the dataset, the feature selection is then carried out utilizing the RF algorithm. It also minimizes the risk of over fitting with minimum features. Finally, diabetic prediction and classification is done by using FKNN classifier algorithm is used for categorizing items in the feature space based on training samples that are the most similar to the objects being classified. According to the experimental results, the suggested RF+FKNN method outperforms the current algorithms in accuracy, precision, recall, and f-measure.