S. Sasikala, S. Geetha, A. Christopher, S. Balamurugan
{"title":"A predictive model using improved Normalized Point Wise Mutual Information (INPMI)","authors":"S. Sasikala, S. Geetha, A. Christopher, S. Balamurugan","doi":"10.1109/ICTKE.2013.6756284","DOIUrl":null,"url":null,"abstract":"In machine learning, selection of optimal features for the classifier is a critical problem. In order to address this problem a novel feature selection method named “Improved Normalized Point wise Mutual Information (INPMI)” is proposed. The proposed INPMI method coupled with Sequential forward search (SFS) finds the best feature subset to aid feature selection process. The key properties of evaluating feature subset i.e. relevancy and redundancy are analysed well. The classifiers like Naive Bayes, Support Vector Machine and J48 are used to determine the accuracy for the choice of features selected. Experimental results with benchmark medical datasets from UCI (University of California Irvine) machine learning database show that proposed INPMI-NB model with SFS, INPMI-SVM model with SFS, INPMI-J48model with SFS achieves 98.36 %, 98.90 %, 94.53 % classification accuracy and selects 22 features for erythemato-squamous diseases. Also the proposed work is evaluated on a World Aircraft dataset to prove its generalization ability. Experimental results prove that the proposed INPMI method outperforms the existing systems.","PeriodicalId":122281,"journal":{"name":"2013 Eleventh International Conference on ICT and Knowledge Engineering","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Eleventh International Conference on ICT and Knowledge Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTKE.2013.6756284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In machine learning, selection of optimal features for the classifier is a critical problem. In order to address this problem a novel feature selection method named “Improved Normalized Point wise Mutual Information (INPMI)” is proposed. The proposed INPMI method coupled with Sequential forward search (SFS) finds the best feature subset to aid feature selection process. The key properties of evaluating feature subset i.e. relevancy and redundancy are analysed well. The classifiers like Naive Bayes, Support Vector Machine and J48 are used to determine the accuracy for the choice of features selected. Experimental results with benchmark medical datasets from UCI (University of California Irvine) machine learning database show that proposed INPMI-NB model with SFS, INPMI-SVM model with SFS, INPMI-J48model with SFS achieves 98.36 %, 98.90 %, 94.53 % classification accuracy and selects 22 features for erythemato-squamous diseases. Also the proposed work is evaluated on a World Aircraft dataset to prove its generalization ability. Experimental results prove that the proposed INPMI method outperforms the existing systems.
在机器学习中,为分类器选择最优特征是一个关键问题。为了解决这一问题,提出了一种新的特征选择方法“改进归一化逐点互信息(INPMI)”。提出的INPMI方法结合顺序前向搜索(SFS)找到最佳特征子集,以辅助特征选择过程。分析了评价特征子集的关键属性,即相关性和冗余性。使用朴素贝叶斯、支持向量机和J48等分类器来确定所选特征选择的准确性。基于UCI (University of California Irvine)机器学习数据库的基准医学数据集的实验结果表明,采用SFS的INPMI-NB模型、采用SFS的INPMI-SVM模型、采用SFS的inpmi - j48模型分别达到了98.36%、98.90%和94.53%的分类准确率,并选择了22个红斑鳞状疾病的特征。并在世界飞机数据集上进行了评估,以证明其泛化能力。实验结果表明,所提出的INPMI方法优于现有系统。