Fuzzy-Rough Bireducts With Supervised Multiscale Granulation

IF 11.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhihong Wang;Hongmei Chen;Huming Liao;Tengyu Yin;Biao Xiang;Shi-Jinn Horng;Tianrui Li
{"title":"Fuzzy-Rough Bireducts With Supervised Multiscale Granulation","authors":"Zhihong Wang;Hongmei Chen;Huming Liao;Tengyu Yin;Biao Xiang;Shi-Jinn Horng;Tianrui Li","doi":"10.1109/TFUZZ.2024.3518473","DOIUrl":null,"url":null,"abstract":"The inherent characteristics involved in data can be mined from multi-scale information systems by extracting information from different value levels of features. In real applications, noise data and irrelevant or redundant features affect the generality of learning models. Therefore, keeping meaningful features and avoiding the effect of noise is essential for feature selection in a multi-scale information system. In bireduct, multi-scale granulation can be used to characterize the importance and correlation of features at different scales. However, little work has taken the distribution of multi-scale data into account when granulating it. In addition, these approaches focus on solving the task of multi-scale data reduction only from the dimension perspective. To this end, a fuzzy-rough bireduct with supervised multi-scale granulation (FrBSmg) is proposed. First, the supervised multi-scale fuzzy granulation based on data distribution is constructed. Then, scaled uncertainty measures are defined to describe the fuzzy relevance of each feature. Furthermore, the global and local distributions of a sample are characterized simultaneously based on the positive region, which can reflect the degree of a sample belonging to some class, and the supervised fuzzy similarity relation can describe the degree of a sample belonging to its class. A strategy of Feature-Correlated Selection and Sample-Noisy Removal is devised for bireduct. Finally, the experimental results on twenty-one public datasets show the effectiveness of FrBSmg.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 4","pages":"1253-1264"},"PeriodicalIF":11.9000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10803953/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The inherent characteristics involved in data can be mined from multi-scale information systems by extracting information from different value levels of features. In real applications, noise data and irrelevant or redundant features affect the generality of learning models. Therefore, keeping meaningful features and avoiding the effect of noise is essential for feature selection in a multi-scale information system. In bireduct, multi-scale granulation can be used to characterize the importance and correlation of features at different scales. However, little work has taken the distribution of multi-scale data into account when granulating it. In addition, these approaches focus on solving the task of multi-scale data reduction only from the dimension perspective. To this end, a fuzzy-rough bireduct with supervised multi-scale granulation (FrBSmg) is proposed. First, the supervised multi-scale fuzzy granulation based on data distribution is constructed. Then, scaled uncertainty measures are defined to describe the fuzzy relevance of each feature. Furthermore, the global and local distributions of a sample are characterized simultaneously based on the positive region, which can reflect the degree of a sample belonging to some class, and the supervised fuzzy similarity relation can describe the degree of a sample belonging to its class. A strategy of Feature-Correlated Selection and Sample-Noisy Removal is devised for bireduct. Finally, the experimental results on twenty-one public datasets show the effectiveness of FrBSmg.
有监督的多尺度粒化模糊粗糙双环产品
在多尺度信息系统中,通过对特征的不同值层次进行信息提取,可以挖掘数据所涉及的内在特征。在实际应用中,噪声数据和不相关或冗余的特征会影响学习模型的通用性。因此,保留有意义的特征并避免噪声的影响是多尺度信息系统特征选择的关键。在二分法中,多尺度颗粒化可以用来表征不同尺度特征的重要性和相关性。然而,在对多尺度数据进行颗粒化时,很少考虑到数据的分布情况。此外,这些方法主要是从维数角度解决多尺度数据约简问题。为此,提出了一种带有监督多尺度粒化的模糊粗糙二叉算法(FrBSmg)。首先,构造基于数据分布的监督多尺度模糊粒化;然后,定义尺度不确定性测度来描述各特征之间的模糊关联。此外,基于正域同时表征样本的全局分布和局部分布,正域可以反映样本属于某一类的程度,监督模糊相似关系可以描述样本属于某一类的程度。提出了一种特征相关选择和样本噪声去除策略。最后,在21个公共数据集上进行了实验,验证了FrBSmg的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems 工程技术-工程:电子与电气
CiteScore
20.50
自引率
13.40%
发文量
517
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信