{"title":"将自适应离散化方法引入遗传规划中进行数据分类","authors":"Emmanuel Dufourq, N. Pillay","doi":"10.1109/WICT.2013.7113123","DOIUrl":null,"url":null,"abstract":"Genetic programming (GP) for data classification using decision trees has been successful in creating models which obtain high classification accuracies. When categorical data is used GP is able to directly use decision trees to create models, however when the data contains continuous attributes discretization is required as a pre-processing step prior to learning. There has been no attempt to incorporate the discretization mechanism into the GP algorithm and this serves as the rationale for this paper. This paper proposes an adaptive discretization method for inclusion into the GP algorithm by randomly creating intervals during the execution of the algorithm through the use of a new genetic operator. This proposed approach was tested on five data sets and serves as an initial attempt at dynamically altering the intervals of GP decision trees while simultaneously searching for an optimal solution during the learning phase. The proposed method performs well when compared to other non-GP adaptive methods.","PeriodicalId":235292,"journal":{"name":"2013 Third World Congress on Information and Communication Technologies (WICT 2013)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Incorporating adaptive discretization into genetic programming for data classification\",\"authors\":\"Emmanuel Dufourq, N. Pillay\",\"doi\":\"10.1109/WICT.2013.7113123\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Genetic programming (GP) for data classification using decision trees has been successful in creating models which obtain high classification accuracies. When categorical data is used GP is able to directly use decision trees to create models, however when the data contains continuous attributes discretization is required as a pre-processing step prior to learning. There has been no attempt to incorporate the discretization mechanism into the GP algorithm and this serves as the rationale for this paper. This paper proposes an adaptive discretization method for inclusion into the GP algorithm by randomly creating intervals during the execution of the algorithm through the use of a new genetic operator. This proposed approach was tested on five data sets and serves as an initial attempt at dynamically altering the intervals of GP decision trees while simultaneously searching for an optimal solution during the learning phase. The proposed method performs well when compared to other non-GP adaptive methods.\",\"PeriodicalId\":235292,\"journal\":{\"name\":\"2013 Third World Congress on Information and Communication Technologies (WICT 2013)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 Third World Congress on Information and Communication Technologies (WICT 2013)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WICT.2013.7113123\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Third World Congress on Information and Communication Technologies (WICT 2013)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2013.7113123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Incorporating adaptive discretization into genetic programming for data classification
Genetic programming (GP) for data classification using decision trees has been successful in creating models which obtain high classification accuracies. When categorical data is used GP is able to directly use decision trees to create models, however when the data contains continuous attributes discretization is required as a pre-processing step prior to learning. There has been no attempt to incorporate the discretization mechanism into the GP algorithm and this serves as the rationale for this paper. This paper proposes an adaptive discretization method for inclusion into the GP algorithm by randomly creating intervals during the execution of the algorithm through the use of a new genetic operator. This proposed approach was tested on five data sets and serves as an initial attempt at dynamically altering the intervals of GP decision trees while simultaneously searching for an optimal solution during the learning phase. The proposed method performs well when compared to other non-GP adaptive methods.