Unsupervised feature selection for linked data

Rachana T. Nemade, R. Makhijani
{"title":"Unsupervised feature selection for linked data","authors":"Rachana T. Nemade, R. Makhijani","doi":"10.1109/ICRAIE.2014.6909131","DOIUrl":null,"url":null,"abstract":"The widespread use of social media web sites gives high dimensional linked data. For limiting the amount and dimensionality of the data, feature subset selection is an effective way which selects features that correlate well with the target class. The high dimensional linked data from social media web sites lacks the availability of label information. So feature selection for linked data remains a challenging task. By using the link information feature relevance assessment is done. In this paper, we propose the unsupervised feature selection from linked data, UFSLD algorithm. The UFSLD algorithm works in three steps. In the first step, the interdependency among the linked data is exploited and the relevant features are selected. In the second step, the features from first step are classified to form the clusters by using graph-theoretic clustering method. In the third step, the most representative feature from each cluster is selected to form a subset of features. MST clustering method is used to ensure the efficiency of this algorithm. Experiments are conducted to compare UFSLD with one unsupervised and another supervised feature selection algorithm and the effectiveness of this algorithm is evaluated.","PeriodicalId":355706,"journal":{"name":"International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAIE.2014.6909131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The widespread use of social media web sites gives high dimensional linked data. For limiting the amount and dimensionality of the data, feature subset selection is an effective way which selects features that correlate well with the target class. The high dimensional linked data from social media web sites lacks the availability of label information. So feature selection for linked data remains a challenging task. By using the link information feature relevance assessment is done. In this paper, we propose the unsupervised feature selection from linked data, UFSLD algorithm. The UFSLD algorithm works in three steps. In the first step, the interdependency among the linked data is exploited and the relevant features are selected. In the second step, the features from first step are classified to form the clusters by using graph-theoretic clustering method. In the third step, the most representative feature from each cluster is selected to form a subset of features. MST clustering method is used to ensure the efficiency of this algorithm. Experiments are conducted to compare UFSLD with one unsupervised and another supervised feature selection algorithm and the effectiveness of this algorithm is evaluated.
链接数据的无监督特征选择
社交媒体网站的广泛使用提供了高维关联数据。为了限制数据的数量和维数,特征子集选择是一种有效的方法,它选择与目标类相关的特征。社交媒体网站的高维关联数据缺乏标签信息的可用性。因此,关联数据的特征选择仍然是一项具有挑战性的任务。利用链接信息特征进行相关性评估。本文提出了一种基于关联数据的无监督特征选择(UFSLD)算法。UFSLD算法分为三个步骤。第一步,利用关联数据之间的相互依赖性,选择相关特征。第二步,利用图论聚类方法对第一步得到的特征进行分类,形成聚类。第三步,从每个聚类中选择最具代表性的特征组成特征子集。为了保证算法的有效性,采用了MST聚类方法。通过实验将UFSLD与一种无监督和一种有监督的特征选择算法进行了比较,并对该算法的有效性进行了评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信