Olawumi Olasunkanmi, O. D. Olanloye, Abdulquadri Adegbiji
{"title":"Comparison analysis of decision tree and ensemble models in the classification of chronic kidney diseases","authors":"Olawumi Olasunkanmi, O. D. Olanloye, Abdulquadri Adegbiji","doi":"10.18100/ijamec.792863","DOIUrl":null,"url":null,"abstract":"The world is now in the era of big data and processing, and exploring the data has become one of the significant challenges. Hence, researchers have done a lot to analyse these data in the health sector to enhance disease detection and classification using artificial intelligence and ML principles. Kidney disease is one of the terrible conditions in which its late detection has sent many people to untimely graves. ML classifiers have been employed in many dimensions to classify heart disease, but, existing works have not explored the variants of each method for selection of best model parameters. An attempt is being made in this research to study the behaviour of three (3) variants each from two(2) tree-based models in the classification of Kidney Disease. Three of the variants are Complex, Medium and Simple models of Decision tree classifier and the other one are Boosted, Bagged and RUSBoosted of Ensemble Classifiers. Using MATLAB for implementation, the model performance established that the accuracy of Ensemble Classifier (Bagged tree model) is the best, concerning the speed, Decision tree (Complex and Simple tree models have the same and highest value). Hence, the two are the best. In terms of training time, Decision tree(Simple tree) has the least time and therefore the best.","PeriodicalId":120305,"journal":{"name":"International Journal of Applied Mathematics Electronics and Computers","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Mathematics Electronics and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18100/ijamec.792863","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The world is now in the era of big data and processing, and exploring the data has become one of the significant challenges. Hence, researchers have done a lot to analyse these data in the health sector to enhance disease detection and classification using artificial intelligence and ML principles. Kidney disease is one of the terrible conditions in which its late detection has sent many people to untimely graves. ML classifiers have been employed in many dimensions to classify heart disease, but, existing works have not explored the variants of each method for selection of best model parameters. An attempt is being made in this research to study the behaviour of three (3) variants each from two(2) tree-based models in the classification of Kidney Disease. Three of the variants are Complex, Medium and Simple models of Decision tree classifier and the other one are Boosted, Bagged and RUSBoosted of Ensemble Classifiers. Using MATLAB for implementation, the model performance established that the accuracy of Ensemble Classifier (Bagged tree model) is the best, concerning the speed, Decision tree (Complex and Simple tree models have the same and highest value). Hence, the two are the best. In terms of training time, Decision tree(Simple tree) has the least time and therefore the best.