Genetic algorithm based multiple decision tree induction

ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378) Pub Date : 1999-11-16 DOI:10.1109/ICONIP.1999.845633

Z. Bandar, H. Al-Attar, D. Mclean

{"title":"Genetic algorithm based multiple decision tree induction","authors":"Z. Bandar, H. Al-Attar, D. Mclean","doi":"10.1109/ICONIP.1999.845633","DOIUrl":null,"url":null,"abstract":"There are two fundamental weaknesses which may have a great impact on the performance of decision tree (DT) induction. These are the limitations in the ability of the DT language to represent some of the underlying patterns of the domain and the degradation in the quality of evidence available to the induction process caused by its recursive partitioning of the training data. The impact of these two weaknesses is greatest when the induction process attempts to overcome the first weakness by resorting to more partitioning of the training data, thus increasing its vulnerability to the second weakness. The authors investigate the use of multiple DT models as a method of overcoming the limitations of the DT modeling language and describe a new and novel algorithm to automatically generate multiple DT models from the same training data. The algorithm is compared to a single-tree classifier by experiments on two well known data sets. Results clearly demonstrate the superiority of our algorithm.","PeriodicalId":237855,"journal":{"name":"ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONIP.1999.845633","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

There are two fundamental weaknesses which may have a great impact on the performance of decision tree (DT) induction. These are the limitations in the ability of the DT language to represent some of the underlying patterns of the domain and the degradation in the quality of evidence available to the induction process caused by its recursive partitioning of the training data. The impact of these two weaknesses is greatest when the induction process attempts to overcome the first weakness by resorting to more partitioning of the training data, thus increasing its vulnerability to the second weakness. The authors investigate the use of multiple DT models as a method of overcoming the limitations of the DT modeling language and describe a new and novel algorithm to automatically generate multiple DT models from the same training data. The algorithm is compared to a single-tree classifier by experiments on two well known data sets. Results clearly demonstrate the superiority of our algorithm.

查看原文本刊更多论文

基于遗传算法的多决策树归纳

决策树归纳法有两个基本的缺陷，这两个缺陷可能会对决策树(DT)归纳法的性能产生很大的影响。这些是DT语言表示域的一些潜在模式的能力的限制，以及由其对训练数据的递归划分引起的归纳过程可用证据质量的下降。当归纳过程试图通过对训练数据进行更多的划分来克服第一个弱点时，这两个弱点的影响是最大的，从而增加了对第二个弱点的脆弱性。作者研究了使用多个DT模型作为克服DT建模语言局限性的方法，并描述了一种新的算法，可以从相同的训练数据自动生成多个DT模型。通过对两个已知数据集的实验，将该算法与单树分类器进行了比较。结果清楚地证明了算法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378)

自引率

0.00%

发文量