Noor Azmiya Bt Sirajun Noor, I. Elamvazuthi, N. Yahya
{"title":"用集成算法对糖尿病进行分类","authors":"Noor Azmiya Bt Sirajun Noor, I. Elamvazuthi, N. Yahya","doi":"10.1109/ICIAS49414.2021.9642508","DOIUrl":null,"url":null,"abstract":"Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell’s not responding towards the produced insulin. Data mining and machine learning techniques can be extremely useful in classification of DM considering the need to have a shift from current traditional methods which use sharp needles to draw blood towards a non - invasive method. The objective of this study is to perform DM classification using various machine learning algorithms. In this paper, individual classifiers such as Support Vector Machine, Naïve Bayes, Bayes Net, Decision Stump, k - Nearest Neighbors, Logistic Regression, Multilayer Perceptron and Decision Tree are experimented. Apart from that, ensemble methods such as bagging, boosting, hybrid classifier using combinations of Random Forest with other base classifiers and ensemble algorithm which is the Random Forest has also been studied. Proposed DM classification model is chosen based on an optimized model reflected by their accuracy and performance of the model. In this research, it was found that performance of ensemble method using hybrid classifier of Random Forest - Bayes Net model has proven to be the best DM classification model with an accuracy of 83.91% and AUC of 0.904 using the Pima Indian Diabetes Dataset (PIDD).","PeriodicalId":212635,"journal":{"name":"2020 8th International Conference on Intelligent and Advanced Systems (ICIAS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Classification of Diabetes Mellitus using Ensemble Algorithms\",\"authors\":\"Noor Azmiya Bt Sirajun Noor, I. Elamvazuthi, N. Yahya\",\"doi\":\"10.1109/ICIAS49414.2021.9642508\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell’s not responding towards the produced insulin. Data mining and machine learning techniques can be extremely useful in classification of DM considering the need to have a shift from current traditional methods which use sharp needles to draw blood towards a non - invasive method. The objective of this study is to perform DM classification using various machine learning algorithms. In this paper, individual classifiers such as Support Vector Machine, Naïve Bayes, Bayes Net, Decision Stump, k - Nearest Neighbors, Logistic Regression, Multilayer Perceptron and Decision Tree are experimented. Apart from that, ensemble methods such as bagging, boosting, hybrid classifier using combinations of Random Forest with other base classifiers and ensemble algorithm which is the Random Forest has also been studied. Proposed DM classification model is chosen based on an optimized model reflected by their accuracy and performance of the model. In this research, it was found that performance of ensemble method using hybrid classifier of Random Forest - Bayes Net model has proven to be the best DM classification model with an accuracy of 83.91% and AUC of 0.904 using the Pima Indian Diabetes Dataset (PIDD).\",\"PeriodicalId\":212635,\"journal\":{\"name\":\"2020 8th International Conference on Intelligent and Advanced Systems (ICIAS)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 8th International Conference on Intelligent and Advanced Systems (ICIAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIAS49414.2021.9642508\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th International Conference on Intelligent and Advanced Systems (ICIAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAS49414.2021.9642508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of Diabetes Mellitus using Ensemble Algorithms
Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell’s not responding towards the produced insulin. Data mining and machine learning techniques can be extremely useful in classification of DM considering the need to have a shift from current traditional methods which use sharp needles to draw blood towards a non - invasive method. The objective of this study is to perform DM classification using various machine learning algorithms. In this paper, individual classifiers such as Support Vector Machine, Naïve Bayes, Bayes Net, Decision Stump, k - Nearest Neighbors, Logistic Regression, Multilayer Perceptron and Decision Tree are experimented. Apart from that, ensemble methods such as bagging, boosting, hybrid classifier using combinations of Random Forest with other base classifiers and ensemble algorithm which is the Random Forest has also been studied. Proposed DM classification model is chosen based on an optimized model reflected by their accuracy and performance of the model. In this research, it was found that performance of ensemble method using hybrid classifier of Random Forest - Bayes Net model has proven to be the best DM classification model with an accuracy of 83.91% and AUC of 0.904 using the Pima Indian Diabetes Dataset (PIDD).