Yue Yang , Jihong Wan , Xiaoping Li , Xiaoling Yang , Hongmei Chen , Kay Chen Tan , Chris Cornelis
{"title":"Rational linear kernelized weighted fuzzy rough attribute selection with class separability","authors":"Yue Yang , Jihong Wan , Xiaoping Li , Xiaoling Yang , Hongmei Chen , Kay Chen Tan , Chris Cornelis","doi":"10.1016/j.fss.2025.109600","DOIUrl":null,"url":null,"abstract":"<div><div>Attribute selection is widely used in data mining to reduce dimensionality and computational overhead, which improves the generalization and efficiency of machine learning models. However, it is a challenging task to select the most representative subset of attributes from data characterized by uncertainty, sparsity, and heterogeneity. Most existing fuzzy rough set-based attribute selection methods focus on improving specific characteristics without a comprehensive consideration, limiting their effectiveness. Moreover, these methods overlook the class distribution information in data, which leads to poor representation of the selected attribute subset. Motivated by these issues, a Rational Linear kernelized weighted fuzzy rough attribute selection method with class separability (RLWAS-CS) is proposed in this paper. A Rational Linear (RL) kernelized fuzzy similarity relation, derived from the Rational Quadratic kernel and mixed attribute distance measure, is first defined to accurately capture sample similarities in sparse and heterogeneous space. On this basis, an RL kernelized weighted fuzzy rough set model (RLWFRS) is proposed. In this model, the weight comprehensively reflects the membership between the sample and each class in the complete attribute space, which improves the discriminability of fuzzy approximation membership and enhances its ability to handle uncertainty. Additionally, an attribute evaluation function is designed based on the RLWFRS model. It integrates class separability and captures both the class distribution characteristics and the dynamic relation between attribute separability and redundancy. Finally, the RLWAS-CS algorithm is presented for the attribute selection. Experimental results show that RLWAS-CS outperforms baseline methods in effectiveness.</div></div>","PeriodicalId":55130,"journal":{"name":"Fuzzy Sets and Systems","volume":"521 ","pages":"Article 109600"},"PeriodicalIF":2.7000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Sets and Systems","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165011425003392","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Attribute selection is widely used in data mining to reduce dimensionality and computational overhead, which improves the generalization and efficiency of machine learning models. However, it is a challenging task to select the most representative subset of attributes from data characterized by uncertainty, sparsity, and heterogeneity. Most existing fuzzy rough set-based attribute selection methods focus on improving specific characteristics without a comprehensive consideration, limiting their effectiveness. Moreover, these methods overlook the class distribution information in data, which leads to poor representation of the selected attribute subset. Motivated by these issues, a Rational Linear kernelized weighted fuzzy rough attribute selection method with class separability (RLWAS-CS) is proposed in this paper. A Rational Linear (RL) kernelized fuzzy similarity relation, derived from the Rational Quadratic kernel and mixed attribute distance measure, is first defined to accurately capture sample similarities in sparse and heterogeneous space. On this basis, an RL kernelized weighted fuzzy rough set model (RLWFRS) is proposed. In this model, the weight comprehensively reflects the membership between the sample and each class in the complete attribute space, which improves the discriminability of fuzzy approximation membership and enhances its ability to handle uncertainty. Additionally, an attribute evaluation function is designed based on the RLWFRS model. It integrates class separability and captures both the class distribution characteristics and the dynamic relation between attribute separability and redundancy. Finally, the RLWAS-CS algorithm is presented for the attribute selection. Experimental results show that RLWAS-CS outperforms baseline methods in effectiveness.
期刊介绍:
Since its launching in 1978, the journal Fuzzy Sets and Systems has been devoted to the international advancement of the theory and application of fuzzy sets and systems. The theory of fuzzy sets now encompasses a well organized corpus of basic notions including (and not restricted to) aggregation operations, a generalized theory of relations, specific measures of information content, a calculus of fuzzy numbers. Fuzzy sets are also the cornerstone of a non-additive uncertainty theory, namely possibility theory, and of a versatile tool for both linguistic and numerical modeling: fuzzy rule-based systems. Numerous works now combine fuzzy concepts with other scientific disciplines as well as modern technologies.
In mathematics fuzzy sets have triggered new research topics in connection with category theory, topology, algebra, analysis. Fuzzy sets are also part of a recent trend in the study of generalized measures and integrals, and are combined with statistical methods. Furthermore, fuzzy sets have strong logical underpinnings in the tradition of many-valued logics.