A Bi-Search Evolutionary Algorithm for High-Dimensional Bi-Objective Feature Selection

IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Hang Xu;Bing Xue;Mengjie Zhang
{"title":"A Bi-Search Evolutionary Algorithm for High-Dimensional Bi-Objective Feature Selection","authors":"Hang Xu;Bing Xue;Mengjie Zhang","doi":"10.1109/TETCI.2024.3393388","DOIUrl":null,"url":null,"abstract":"High dimensionality often challenges the efficiency and accuracy of a classifier, while evolutionary feature selection is an effective method for data preprocessing and dimensionality reduction. However, with the exponential expansion of search space along with the increase of features, traditional evolutionary feature selection methods could still find it difficult to search for optimal or near optimal solutions in the large-scale search space. To overcome the above issue, in this paper, we propose a bi-search evolutionary algorithm (termed BSEA) for tackling high-dimensional feature selection in classification, with two contradictory optimizing objectives (i.e., minimizing both selected features and classification errors). In BSEA, a bi-search evolutionary mode combining the forward and backward searching tasks is adopted to enhance the search ability in the large-scale search space; in addition, an adaptive feature analysis mechanism is also designed to the explore promising features for efficiently reproducing more diverse offspring. In the experiments, BSEA is comprehensively compared with 9 most recent or classic state-of-the-art MOEAs on a series of 11 high-dimensional datasets with no less than 2000 features. The empirical results suggest that BSEA generally performs the best on most of the datasets in terms of all performance metrics, along with high computational efficiency, while each of its essential components can take positive effect on boosting the search ability and together make the best contribution.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3489-3502"},"PeriodicalIF":5.3000,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10510502/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

High dimensionality often challenges the efficiency and accuracy of a classifier, while evolutionary feature selection is an effective method for data preprocessing and dimensionality reduction. However, with the exponential expansion of search space along with the increase of features, traditional evolutionary feature selection methods could still find it difficult to search for optimal or near optimal solutions in the large-scale search space. To overcome the above issue, in this paper, we propose a bi-search evolutionary algorithm (termed BSEA) for tackling high-dimensional feature selection in classification, with two contradictory optimizing objectives (i.e., minimizing both selected features and classification errors). In BSEA, a bi-search evolutionary mode combining the forward and backward searching tasks is adopted to enhance the search ability in the large-scale search space; in addition, an adaptive feature analysis mechanism is also designed to the explore promising features for efficiently reproducing more diverse offspring. In the experiments, BSEA is comprehensively compared with 9 most recent or classic state-of-the-art MOEAs on a series of 11 high-dimensional datasets with no less than 2000 features. The empirical results suggest that BSEA generally performs the best on most of the datasets in terms of all performance metrics, along with high computational efficiency, while each of its essential components can take positive effect on boosting the search ability and together make the best contribution.
用于高维双目标特征选择的双搜索进化算法
高维度往往对分类器的效率和准确性提出挑战,而进化特征选择是一种有效的数据预处理和降维方法。然而,随着特征的增加,搜索空间也呈指数级扩展,传统的进化特征选择方法仍然难以在大规模搜索空间中搜索到最优或接近最优的解。为了克服上述问题,本文提出了一种双搜索进化算法(称为 BSEA),用于解决分类中的高维特征选择问题,该算法具有两个相互矛盾的优化目标(即同时最小化所选特征和分类误差)。BSEA 采用前向搜索和后向搜索相结合的双搜索进化模式,以增强在大规模搜索空间中的搜索能力;此外,还设计了自适应特征分析机制,以发掘有潜力的特征,从而有效地繁殖出更多样化的后代。在实验中,BSEA 与 9 种最新或最经典的 MOEA 在 11 个高维数据集上进行了综合比较,这些数据集包含不少于 2000 个特征。实证结果表明,在大多数数据集上,BSEA 在所有性能指标方面的表现都是最好的,同时计算效率也很高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
7.50%
发文量
147
期刊介绍: The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信