Rayrone Zirtany Nunes Marques, L. Coutinho, T. B. Borchartt, S. Vale, Francisco Silva
{"title":"An Experimental Evaluation of Data Mining Algorithms Using Hyperparameter Optimization","authors":"Rayrone Zirtany Nunes Marques, L. Coutinho, T. B. Borchartt, S. Vale, Francisco Silva","doi":"10.1109/MICAI.2015.29","DOIUrl":null,"url":null,"abstract":"The challenge to choose the best algorithm and its best parameters for a given problem is known as Combined Algorithm Selection and Hyperparameter Optimization Problem. Among all the classification algorithms available are those based on human comprehensible representations, such as decision trees and classification rule induction. These algorithms are usually chosen by the clarity of the results obtained and the interpretability of its models. In this paper, we evaluated the six most used algorithms based on human comprehension. We conducted experiments with 28 datasets often used in the literature in different ways: using default parameters, using ExpDB parameters and using a tool based in genetic algorithm to find the best parameter combination. The results obtained have shown the strategy of combining the data from ExpDB via GA is effective in finding classification models with good accuracy.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"160 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The challenge to choose the best algorithm and its best parameters for a given problem is known as Combined Algorithm Selection and Hyperparameter Optimization Problem. Among all the classification algorithms available are those based on human comprehensible representations, such as decision trees and classification rule induction. These algorithms are usually chosen by the clarity of the results obtained and the interpretability of its models. In this paper, we evaluated the six most used algorithms based on human comprehension. We conducted experiments with 28 datasets often used in the literature in different ways: using default parameters, using ExpDB parameters and using a tool based in genetic algorithm to find the best parameter combination. The results obtained have shown the strategy of combining the data from ExpDB via GA is effective in finding classification models with good accuracy.