{"title":"Attribute subset selection by mixed weighting mean classification method","authors":"Adidela Daveedu Raju, M. N. Sri, G. L. Devi","doi":"10.1109/ICEEOT.2016.7755359","DOIUrl":null,"url":null,"abstract":"The discovery of knowledge from the huge available data is the highest mount setback in practical pattern classification and knowledge discovery problem. The preprocessing of data plays a major role in knowledge discovery as it consequently improves the accuracy of the classifier. One of the preprocessing techniques, attribute subset selection has major importance as the selection leads to better performance of the classifier and the cost of the classification is sensitive to the choice of attributes that used to construct the classifier. This paper proposes a new attribute subset selection method named as Mixed Weighting Mean Classification (MWM-C) method. It evaluates the weights of the available attributes by using 5 major weighting functions such as information gain, information gain ratio, gini index, correlation, chi squared statistic. The five methods are chosen to bias the results of one another. The proposed method is examined on soybean data set and conferred satisfactory results.","PeriodicalId":383674,"journal":{"name":"2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEOT.2016.7755359","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The discovery of knowledge from the huge available data is the highest mount setback in practical pattern classification and knowledge discovery problem. The preprocessing of data plays a major role in knowledge discovery as it consequently improves the accuracy of the classifier. One of the preprocessing techniques, attribute subset selection has major importance as the selection leads to better performance of the classifier and the cost of the classification is sensitive to the choice of attributes that used to construct the classifier. This paper proposes a new attribute subset selection method named as Mixed Weighting Mean Classification (MWM-C) method. It evaluates the weights of the available attributes by using 5 major weighting functions such as information gain, information gain ratio, gini index, correlation, chi squared statistic. The five methods are chosen to bias the results of one another. The proposed method is examined on soybean data set and conferred satisfactory results.