基于模糊基尼熵的双向互补在线流特征选择

IF 10.7 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Chucai Zhang;Zhengxiang Lu;Yongkang Zhang;Jianhua Dai
{"title":"基于模糊基尼熵的双向互补在线流特征选择","authors":"Chucai Zhang;Zhengxiang Lu;Yongkang Zhang;Jianhua Dai","doi":"10.1109/TFUZZ.2025.3529466","DOIUrl":null,"url":null,"abstract":"Online streaming feature selection has garnered widespread attention due to its efficiency and adaptability in dynamic data environments. However, existing methods primarily focus on the correlation and redundancy among features, often overlooking the complementarity between candidate and selected features. In this article, we address this gap by introducing three key innovations. First, we construct a novel metric, fuzzy Gini entropy (FGE), to measure feature uncertainty within datasets. Unlike traditional information entropy, fuzzy Gini entropy inherits the advantages of the Gini index, effectively measuring the impurity of datasets, while also being capable of handling common fuzzy environments. Accordingly, related metrics such as fuzzy joint Gini entropy, fuzzy conditional Gini entropy, and fuzzy mutual Gini information are developed. Second, we innovatively propose the concept of the bidirectional complementarity ratio, which captures the relationship between candidate features and previously selected features in online streaming feature selection. This mitigates the unfairness associated with the late arrival of features, ensuring that candidate features with a bidirectional complementary effect that outweighs their redundancy effect with the selected features are chosen. Third, we design an online streaming feature selection method named FGE-OSFS. The method evaluates streaming features through three steps: Online relevance analysis, online bidirectional complementarity analysis, and online redundancy analysis. Finally, we compare the proposed method with five state-of-the-art online streaming feature selection methods, demonstrating the effectiveness of our new approach.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 5","pages":"1592-1604"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online Streaming Feature Selection Using Bidirectional Complementarity Based on Fuzzy Gini Entropy\",\"authors\":\"Chucai Zhang;Zhengxiang Lu;Yongkang Zhang;Jianhua Dai\",\"doi\":\"10.1109/TFUZZ.2025.3529466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online streaming feature selection has garnered widespread attention due to its efficiency and adaptability in dynamic data environments. However, existing methods primarily focus on the correlation and redundancy among features, often overlooking the complementarity between candidate and selected features. In this article, we address this gap by introducing three key innovations. First, we construct a novel metric, fuzzy Gini entropy (FGE), to measure feature uncertainty within datasets. Unlike traditional information entropy, fuzzy Gini entropy inherits the advantages of the Gini index, effectively measuring the impurity of datasets, while also being capable of handling common fuzzy environments. Accordingly, related metrics such as fuzzy joint Gini entropy, fuzzy conditional Gini entropy, and fuzzy mutual Gini information are developed. Second, we innovatively propose the concept of the bidirectional complementarity ratio, which captures the relationship between candidate features and previously selected features in online streaming feature selection. This mitigates the unfairness associated with the late arrival of features, ensuring that candidate features with a bidirectional complementary effect that outweighs their redundancy effect with the selected features are chosen. Third, we design an online streaming feature selection method named FGE-OSFS. The method evaluates streaming features through three steps: Online relevance analysis, online bidirectional complementarity analysis, and online redundancy analysis. Finally, we compare the proposed method with five state-of-the-art online streaming feature selection methods, demonstrating the effectiveness of our new approach.\",\"PeriodicalId\":13212,\"journal\":{\"name\":\"IEEE Transactions on Fuzzy Systems\",\"volume\":\"33 5\",\"pages\":\"1592-1604\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-01-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Fuzzy Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10839572/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10839572/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在线流媒体特征选择因其在动态数据环境中的高效性和适应性而受到广泛关注。然而,现有的方法主要关注特征之间的相关性和冗余性,往往忽略了候选特征和选定特征之间的互补性。在本文中,我们通过引入三个关键创新来解决这一差距。首先,我们构建了一个新的度量,模糊基尼熵(FGE),以衡量数据集中的特征不确定性。与传统的信息熵不同,模糊基尼熵继承了基尼指数的优点,可以有效地衡量数据集的不纯性,同时也能够处理常见的模糊环境。据此,提出了模糊联合基尼熵、模糊条件基尼熵、模糊互基尼信息等相关测度。其次,我们创新地提出了双向互补比的概念,该概念捕获了在线流媒体特征选择中候选特征与先前选择特征之间的关系。这减轻了与特征延迟到达相关的不公平,确保具有双向互补效应的候选特征被选中,而这种互补效应超过了它们与所选特征的冗余效应。第三,设计了一种在线流媒体特征选择方法FGE-OSFS。该方法通过在线相关性分析、在线双向互补性分析和在线冗余分析三个步骤对流特征进行评估。最后,我们将所提出的方法与五种最先进的在线流媒体特征选择方法进行了比较,证明了我们的新方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Online Streaming Feature Selection Using Bidirectional Complementarity Based on Fuzzy Gini Entropy
Online streaming feature selection has garnered widespread attention due to its efficiency and adaptability in dynamic data environments. However, existing methods primarily focus on the correlation and redundancy among features, often overlooking the complementarity between candidate and selected features. In this article, we address this gap by introducing three key innovations. First, we construct a novel metric, fuzzy Gini entropy (FGE), to measure feature uncertainty within datasets. Unlike traditional information entropy, fuzzy Gini entropy inherits the advantages of the Gini index, effectively measuring the impurity of datasets, while also being capable of handling common fuzzy environments. Accordingly, related metrics such as fuzzy joint Gini entropy, fuzzy conditional Gini entropy, and fuzzy mutual Gini information are developed. Second, we innovatively propose the concept of the bidirectional complementarity ratio, which captures the relationship between candidate features and previously selected features in online streaming feature selection. This mitigates the unfairness associated with the late arrival of features, ensuring that candidate features with a bidirectional complementary effect that outweighs their redundancy effect with the selected features are chosen. Third, we design an online streaming feature selection method named FGE-OSFS. The method evaluates streaming features through three steps: Online relevance analysis, online bidirectional complementarity analysis, and online redundancy analysis. Finally, we compare the proposed method with five state-of-the-art online streaming feature selection methods, demonstrating the effectiveness of our new approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems 工程技术-工程:电子与电气
CiteScore
20.50
自引率
13.40%
发文量
517
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信