Genetic Based ID3 Classification Algorithm Diagnosis and Prognosis of Oral Cancer

International Journal of Data Mining Techniques and Applications Pub Date : 2016-12-15 DOI:10.20894/IJDMTA.102.005.002.009

K. Jamberi, E. Ramaraj

{"title":"Genetic Based ID3 Classification Algorithm Diagnosis and Prognosis of Oral Cancer","authors":"K. Jamberi, E. Ramaraj","doi":"10.20894/IJDMTA.102.005.002.009","DOIUrl":null,"url":null,"abstract":": In order to analyse the chosen data from various points of view, data mining is used as the effective process. This process is also used to sum-up all those views into useful information. There are several types of algorithms in data mining such as Classification algorithms, Regression, Segmentation algorithms, association algorithms, sequence analysis algorithms, etc.,. The classification algorithm can be usedto bifurcate the data set from the given data set and foretell one or more discrete variables, based on the other attributes in the dataset. The ID3 (Iterative Dichotomiser 3) algorithm is an original data set S as the root node. An unutilised attribute of the data set S calculates the entropy H(S) (or Information gain IG (A)) of the attribute. Upon its selection, the attribute should have the smallest entropy (or largest information gain) value. A genetic algorithm (GA) is a heuristic quest that imitates the process of natural selection. Genetic algorithm can easily select cancer data set, from the given data set using GA operators, such as mutation, selection, and crossover. A method existed earlier (KNN+GA) was not successful for oral cancer and primary tumor. Our method of creating new algorithm GA+ID3 easily identifiesoral cancer data set from the given data set. The genetic based ID3 classification algorithm diagnosis and prognosis of oral cancer data set is identified by this paper.","PeriodicalId":414709,"journal":{"name":"International Journal of Data Mining Techniques and Applications","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20894/IJDMTA.102.005.002.009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

: In order to analyse the chosen data from various points of view, data mining is used as the effective process. This process is also used to sum-up all those views into useful information. There are several types of algorithms in data mining such as Classification algorithms, Regression, Segmentation algorithms, association algorithms, sequence analysis algorithms, etc.,. The classification algorithm can be usedto bifurcate the data set from the given data set and foretell one or more discrete variables, based on the other attributes in the dataset. The ID3 (Iterative Dichotomiser 3) algorithm is an original data set S as the root node. An unutilised attribute of the data set S calculates the entropy H(S) (or Information gain IG (A)) of the attribute. Upon its selection, the attribute should have the smallest entropy (or largest information gain) value. A genetic algorithm (GA) is a heuristic quest that imitates the process of natural selection. Genetic algorithm can easily select cancer data set, from the given data set using GA operators, such as mutation, selection, and crossover. A method existed earlier (KNN+GA) was not successful for oral cancer and primary tumor. Our method of creating new algorithm GA+ID3 easily identifiesoral cancer data set from the given data set. The genetic based ID3 classification algorithm diagnosis and prognosis of oral cancer data set is identified by this paper.

查看原文本刊更多论文

基于遗传的ID3分类算法对口腔癌诊断及预后的影响

为了从不同的角度分析所选择的数据，数据挖掘是一种有效的方法。该过程还用于将所有这些视图总结为有用的信息。在数据挖掘中有几种类型的算法，如分类算法、回归算法、分割算法、关联算法、序列分析算法等。该分类算法可用于从给定数据集中分岔数据集，并根据数据集中的其他属性预测一个或多个离散变量。ID3 (Iterative Dichotomiser 3)算法是将原始数据集S作为根节点。数据集S的未使用属性计算该属性的熵H(S)(或信息增益IG (A))。选择后，属性应该具有最小的熵(或最大的信息增益)值。遗传算法(GA)是一种模仿自然选择过程的启发式探索。遗传算法可以很容易地选择癌症数据集，从给定的数据集中使用遗传算子，如突变、选择和交叉。早期存在的一种方法(KNN+GA)对口腔癌和原发肿瘤不成功。我们创建的新算法GA+ID3很容易从给定的数据集中识别口腔癌数据集。本文提出了基于遗传的口腔癌数据集诊断与预后的ID3分类算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Data Mining Techniques and Applications

自引率

0.00%

发文量