鲁棒特征选择的联合不确定性模型和度量:一种双层分布考虑和特征评估方法

IF 2.7 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS
Jihong Wan , Xiaoping Li , Jie Zhao , Min Li , Zhixuan Deng , Hongmei Chen
{"title":"鲁棒特征选择的联合不确定性模型和度量:一种双层分布考虑和特征评估方法","authors":"Jihong Wan ,&nbsp;Xiaoping Li ,&nbsp;Jie Zhao ,&nbsp;Min Li ,&nbsp;Zhixuan Deng ,&nbsp;Hongmei Chen","doi":"10.1016/j.fss.2025.109615","DOIUrl":null,"url":null,"abstract":"<div><div>Excavating discriminative features in uncertain data environment is essential and challenging. Randomness, fuzziness, and inconsistency are the main characteristics of uncertain data. For the uncertainty in data, fuzzy rough set (FRS) is considered as a powerful analytical model. However, the classical FRS model is susceptible to noisy information. Moreover, fewer existing works utilize robust evaluation metrics to explore complex feature correlations in high-dimensional spaces, which leads to the loss of some important discriminative information and the selection results are not robust. Therefore, this paper proposes a joint uncertainty model and metric for robust feature selection method with bi-level distribution consideration and feature evaluation (JBRFS). The robust FRS model and the robust uncertainty metric are firstly constructed by twofold consideration of data distribution characteristics (i.e., the density distribution of heterogeneous samples and the statistical distribution of different features). Then, the bi-level feature evaluation (i.e., one-order individual feature relevance and two-order pairwise feature correlation) is devised by jointing the robust uncertainty model and metric. Finally, a novel feature selection strategy of max-fuzzy relative dependency, max-significance and interaction is proposed. The experimental results show that JBRFS can select features with discriminative ability, and it is verified to have significant advantages in most cases by nine comparison algorithms and the ablation experiment.</div></div>","PeriodicalId":55130,"journal":{"name":"Fuzzy Sets and Systems","volume":"523 ","pages":"Article 109615"},"PeriodicalIF":2.7000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint uncertainty model and metric for robust feature selection: A bi-level distribution consideration and feature evaluation approach\",\"authors\":\"Jihong Wan ,&nbsp;Xiaoping Li ,&nbsp;Jie Zhao ,&nbsp;Min Li ,&nbsp;Zhixuan Deng ,&nbsp;Hongmei Chen\",\"doi\":\"10.1016/j.fss.2025.109615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Excavating discriminative features in uncertain data environment is essential and challenging. Randomness, fuzziness, and inconsistency are the main characteristics of uncertain data. For the uncertainty in data, fuzzy rough set (FRS) is considered as a powerful analytical model. However, the classical FRS model is susceptible to noisy information. Moreover, fewer existing works utilize robust evaluation metrics to explore complex feature correlations in high-dimensional spaces, which leads to the loss of some important discriminative information and the selection results are not robust. Therefore, this paper proposes a joint uncertainty model and metric for robust feature selection method with bi-level distribution consideration and feature evaluation (JBRFS). The robust FRS model and the robust uncertainty metric are firstly constructed by twofold consideration of data distribution characteristics (i.e., the density distribution of heterogeneous samples and the statistical distribution of different features). Then, the bi-level feature evaluation (i.e., one-order individual feature relevance and two-order pairwise feature correlation) is devised by jointing the robust uncertainty model and metric. Finally, a novel feature selection strategy of max-fuzzy relative dependency, max-significance and interaction is proposed. The experimental results show that JBRFS can select features with discriminative ability, and it is verified to have significant advantages in most cases by nine comparison algorithms and the ablation experiment.</div></div>\",\"PeriodicalId\":55130,\"journal\":{\"name\":\"Fuzzy Sets and Systems\",\"volume\":\"523 \",\"pages\":\"Article 109615\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fuzzy Sets and Systems\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165011425003549\",\"RegionNum\":1,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Sets and Systems","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165011425003549","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

摘要

在不确定的数据环境中挖掘识别特征是非常必要的,也是具有挑战性的。随机性、模糊性和不一致性是不确定数据的主要特征。对于数据的不确定性,模糊粗糙集(FRS)被认为是一种强大的分析模型。然而,经典的FRS模型容易受到噪声信息的影响。此外,利用鲁棒性评价指标来探索高维空间中复杂特征相关性的研究较少,导致一些重要的判别信息丢失,选择结果不够鲁棒。因此,本文提出了一种联合不确定性模型和度量的双水平分布考虑和特征评价鲁棒特征选择方法(JBRFS)。首先从数据分布特征(即异质样本的密度分布和不同特征的统计分布)两方面考虑构建鲁棒FRS模型和鲁棒不确定性度量。然后,结合鲁棒不确定性模型和度量,设计了一阶个体特征关联和二阶两两特征关联的双层特征评价方法;最后,提出了一种基于最大模糊相对依赖、最大显著性和交互的特征选择策略。实验结果表明,JBRFS可以选择具有判别能力的特征,并通过9种比较算法和烧蚀实验验证了JBRFS在大多数情况下具有显著的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Joint uncertainty model and metric for robust feature selection: A bi-level distribution consideration and feature evaluation approach
Excavating discriminative features in uncertain data environment is essential and challenging. Randomness, fuzziness, and inconsistency are the main characteristics of uncertain data. For the uncertainty in data, fuzzy rough set (FRS) is considered as a powerful analytical model. However, the classical FRS model is susceptible to noisy information. Moreover, fewer existing works utilize robust evaluation metrics to explore complex feature correlations in high-dimensional spaces, which leads to the loss of some important discriminative information and the selection results are not robust. Therefore, this paper proposes a joint uncertainty model and metric for robust feature selection method with bi-level distribution consideration and feature evaluation (JBRFS). The robust FRS model and the robust uncertainty metric are firstly constructed by twofold consideration of data distribution characteristics (i.e., the density distribution of heterogeneous samples and the statistical distribution of different features). Then, the bi-level feature evaluation (i.e., one-order individual feature relevance and two-order pairwise feature correlation) is devised by jointing the robust uncertainty model and metric. Finally, a novel feature selection strategy of max-fuzzy relative dependency, max-significance and interaction is proposed. The experimental results show that JBRFS can select features with discriminative ability, and it is verified to have significant advantages in most cases by nine comparison algorithms and the ablation experiment.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Fuzzy Sets and Systems
Fuzzy Sets and Systems 数学-计算机:理论方法
CiteScore
6.50
自引率
17.90%
发文量
321
审稿时长
6.1 months
期刊介绍: Since its launching in 1978, the journal Fuzzy Sets and Systems has been devoted to the international advancement of the theory and application of fuzzy sets and systems. The theory of fuzzy sets now encompasses a well organized corpus of basic notions including (and not restricted to) aggregation operations, a generalized theory of relations, specific measures of information content, a calculus of fuzzy numbers. Fuzzy sets are also the cornerstone of a non-additive uncertainty theory, namely possibility theory, and of a versatile tool for both linguistic and numerical modeling: fuzzy rule-based systems. Numerous works now combine fuzzy concepts with other scientific disciplines as well as modern technologies. In mathematics fuzzy sets have triggered new research topics in connection with category theory, topology, algebra, analysis. Fuzzy sets are also part of a recent trend in the study of generalized measures and integrals, and are combined with statistical methods. Furthermore, fuzzy sets have strong logical underpinnings in the tradition of many-valued logics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信