Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content

S. Raheel
{"title":"Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content","authors":"S. Raheel","doi":"10.1109/AIMS.2015.28","DOIUrl":null,"url":null,"abstract":"Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential. The advantage of this approach is that it allows for an important number of features that are highly relevant to their classes but statistically insignificant to participate in the learning process of the classifier. Results show that this new approach is promising and as good as the traditional one. Higher accuracy is reached when the number of the infrequent features increases. This approach is useful when we need for the infrequent features to be part of the predictive model since this, in turn, enforces the subjectivity of the decision made by the classifier.","PeriodicalId":121874,"journal":{"name":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIMS.2015.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential. The advantage of this approach is that it allows for an important number of features that are highly relevant to their classes but statistically insignificant to participate in the learning process of the classifier. Results show that this new approach is promising and as good as the traditional one. Higher accuracy is reached when the number of the infrequent features increases. This approach is useful when we need for the infrequent features to be part of the predictive model since this, in turn, enforces the subjectivity of the decision made by the classifier.
混合内容文档的复合选择降维策略
特征选择是从某个数据集中选择可用特征或属性的子集的过程,以使构建预测模型的过程更加高效和准确。在大多数情况下,属性的选择是顺序完成的。在本文中,我们提出了一种新的过滤策略,以复合的方式而不是顺序的方式选择属性。这种方法的优点是,它允许大量与其类高度相关但在统计上不显著的特征参与分类器的学习过程。结果表明,该方法具有良好的应用前景,与传统方法的效果相当。当非频繁特征的数量增加时,达到更高的精度。当我们需要将不常见的特征作为预测模型的一部分时,这种方法很有用,因为这反过来又加强了分类器做出决策的主观性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信