Md. Abdur Rahman, S. M. Shoaib, Md. Al Amin, Rafia Nishat Toma, M. Moni, M. Awal
{"title":"A Bayesian Optimization Framework for the Prediction of Diabetes Mellitus","authors":"Md. Abdur Rahman, S. M. Shoaib, Md. Al Amin, Rafia Nishat Toma, M. Moni, M. Awal","doi":"10.1109/ICAEE48663.2019.8975480","DOIUrl":null,"url":null,"abstract":"The advances of bioinformatics and medical sciences have generated an enormous amount of data which can be used by machine learning (ML) and data mining (DT) methods to transform the data into valuable knowledge and can improve diagnosis, prediction, and management of most chronic diseases. One of the most life-threatening and widespread chronic diseases is Type 2 Diabetes Mellitus (T2DM), characterized by impaired operation of glucose homeostasis. We used several cutting-edge machine learning algorithms including Random Forest (RF), Support Vector Machine (SVM), Decision Tree (DT), Naive Bayes (NB) on diabetes data. A state-of-the-art Bayesian Optimization (BO) has been proposed to optimize the hyper-parameters of machine learning classifiers for the Diabetes Mellitus (DM). The optimized hyperparameters using BO achieved an accuracy of 77.60% with RF, 76.04% with SVM, 71.61% for DT, 73.96% for NB classifier. We also achieved 64.06% accuracy without BO optimized SVM. We justified our models using confusion matrix for each classifier. The statistical comparison among different classifier’s performances has been presented using the Boxplot and Analysis of variance (ANOVA) test.","PeriodicalId":138634,"journal":{"name":"2019 5th International Conference on Advances in Electrical Engineering (ICAEE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 5th International Conference on Advances in Electrical Engineering (ICAEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAEE48663.2019.8975480","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The advances of bioinformatics and medical sciences have generated an enormous amount of data which can be used by machine learning (ML) and data mining (DT) methods to transform the data into valuable knowledge and can improve diagnosis, prediction, and management of most chronic diseases. One of the most life-threatening and widespread chronic diseases is Type 2 Diabetes Mellitus (T2DM), characterized by impaired operation of glucose homeostasis. We used several cutting-edge machine learning algorithms including Random Forest (RF), Support Vector Machine (SVM), Decision Tree (DT), Naive Bayes (NB) on diabetes data. A state-of-the-art Bayesian Optimization (BO) has been proposed to optimize the hyper-parameters of machine learning classifiers for the Diabetes Mellitus (DM). The optimized hyperparameters using BO achieved an accuracy of 77.60% with RF, 76.04% with SVM, 71.61% for DT, 73.96% for NB classifier. We also achieved 64.06% accuracy without BO optimized SVM. We justified our models using confusion matrix for each classifier. The statistical comparison among different classifier’s performances has been presented using the Boxplot and Analysis of variance (ANOVA) test.