Feature Selection and Discretization based on Mutual Information

S. Sharmin, A. Ali, Muhammad Asif Hossain Khan, M. Shoyaib
{"title":"Feature Selection and Discretization based on Mutual Information","authors":"S. Sharmin, A. Ali, Muhammad Asif Hossain Khan, M. Shoyaib","doi":"10.1109/ICIVPR.2017.7890885","DOIUrl":null,"url":null,"abstract":"Feature selection and discretization have been considered to be an important research topic in the field of pattern recognition and data mining. However, addressing both these issues at a time is rarely discussed in the existing research. In this paper, these issues have been addressed by developing a heuristic namely discretization and selection of features based on mutual information (DSM). Experimental results on 15 datasets show that the proposed DSM outperforms a number of state-of-the-art feature selection or discretization algorithms. On average, its accuracy surpasses that of the best performing state-of-the-art algorithms by 5% using Support Vector Machine. Moreover, for datasets with a large number of features, it shows promising accuracies even with far less number of features than the other competing algorithms.","PeriodicalId":126745,"journal":{"name":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIVPR.2017.7890885","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

Abstract

Feature selection and discretization have been considered to be an important research topic in the field of pattern recognition and data mining. However, addressing both these issues at a time is rarely discussed in the existing research. In this paper, these issues have been addressed by developing a heuristic namely discretization and selection of features based on mutual information (DSM). Experimental results on 15 datasets show that the proposed DSM outperforms a number of state-of-the-art feature selection or discretization algorithms. On average, its accuracy surpasses that of the best performing state-of-the-art algorithms by 5% using Support Vector Machine. Moreover, for datasets with a large number of features, it shows promising accuracies even with far less number of features than the other competing algorithms.
基于互信息的特征选择与离散化
特征选择和离散化一直是模式识别和数据挖掘领域的一个重要研究课题。然而,在现有的研究中,一次解决这两个问题的讨论很少。本文通过开发一种启发式方法,即基于互信息(DSM)的离散化和特征选择来解决这些问题。在15个数据集上的实验结果表明,所提出的DSM优于许多最先进的特征选择或离散化算法。平均而言,它的准确性比使用支持向量机的最佳性能的最先进算法高出5%。此外,对于具有大量特征的数据集,即使特征数量远少于其他竞争算法,它也显示出有希望的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信