{"title":"A comparative analysis of discretization algorithms for data mining","authors":"Xie Ming, Xinping Xiao","doi":"10.1109/GSIS.2009.5408138","DOIUrl":null,"url":null,"abstract":"In this paper, four kinds of typical discretization algorithms were comparatively analyzed from two aspects using examples: one referred to the variable quality of classification and accuracy of approximation under different parameter, the other was the similarity degrees between reducted variable sets and the original variable set. On determination of reducted variable sets, the reduction was regarded as multi-objective optimization problem, which was solved by the genetic algorithm, and the optimal reducted variable sets were found through including degrees. Finally, the consistent conclusion on preference of discretization algorithms was gained.","PeriodicalId":294363,"journal":{"name":"2009 IEEE International Conference on Grey Systems and Intelligent Services (GSIS 2009)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Grey Systems and Intelligent Services (GSIS 2009)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GSIS.2009.5408138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, four kinds of typical discretization algorithms were comparatively analyzed from two aspects using examples: one referred to the variable quality of classification and accuracy of approximation under different parameter, the other was the similarity degrees between reducted variable sets and the original variable set. On determination of reducted variable sets, the reduction was regarded as multi-objective optimization problem, which was solved by the genetic algorithm, and the optimal reducted variable sets were found through including degrees. Finally, the consistent conclusion on preference of discretization algorithms was gained.