A Supervised Feature Selection Algorithm through Minimum Spanning Tree Clustering

Qin Liu, Jingxiao Zhang, Jiakai Xiao, Hongming Zhu, Qinpei Zhao
{"title":"A Supervised Feature Selection Algorithm through Minimum Spanning Tree Clustering","authors":"Qin Liu, Jingxiao Zhang, Jiakai Xiao, Hongming Zhu, Qinpei Zhao","doi":"10.1109/ICTAI.2014.47","DOIUrl":null,"url":null,"abstract":"In different types of feature selection algorithms, feature clustering is an emerging subset generation paradigm. In this paper, a Minimum spanning tree based Feature Clustering (MFC) algorithm is proposed. In the algorithm, an information-theoretic based measure, i.e., Variation of information, is utilized as the feature redundancy and relevance metric. At the clustering phase, the sum of pair wise feature redundancy is minimized. Then, a representative feature is selected from each cluster, where the relevance between representative features and the target label is maximized. The algorithm is supervised since it is designed for various supervised learning problems, such as classification and regression. The proposed MFC is compared with three conventional feature selection algorithms, two of which are also feature clustering method. The MFC obtains well separated feature clusters in the experiment and considerable better classification accuracies applied on several real data sets.","PeriodicalId":142794,"journal":{"name":"2014 IEEE 26th International Conference on Tools with Artificial Intelligence","volume":"189 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 26th International Conference on Tools with Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2014.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

In different types of feature selection algorithms, feature clustering is an emerging subset generation paradigm. In this paper, a Minimum spanning tree based Feature Clustering (MFC) algorithm is proposed. In the algorithm, an information-theoretic based measure, i.e., Variation of information, is utilized as the feature redundancy and relevance metric. At the clustering phase, the sum of pair wise feature redundancy is minimized. Then, a representative feature is selected from each cluster, where the relevance between representative features and the target label is maximized. The algorithm is supervised since it is designed for various supervised learning problems, such as classification and regression. The proposed MFC is compared with three conventional feature selection algorithms, two of which are also feature clustering method. The MFC obtains well separated feature clusters in the experiment and considerable better classification accuracies applied on several real data sets.
基于最小生成树聚类的有监督特征选择算法
在不同类型的特征选择算法中,特征聚类是一种新兴的子集生成范式。提出了一种基于最小生成树的特征聚类(MFC)算法。该算法采用基于信息理论的度量,即信息变异量作为特征冗余度和相关性度量。在聚类阶段,最小化成对特征冗余的总和。然后,从每个聚类中选择一个代表性特征,使代表性特征与目标标签之间的相关性最大化。该算法是有监督的,因为它是为各种监督学习问题而设计的,比如分类和回归。将该方法与三种传统的特征选择算法进行了比较,其中两种算法也是特征聚类方法。MFC在实验中得到了分离良好的特征聚类,在多个真实数据集上得到了相当好的分类精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信