{"title":"Prototype-based fuzzy rough sets for outlier detection","authors":"Mingjie Cai , Dongying Qi , Chaoqun Huang , Jiaxin Zhan","doi":"10.1016/j.fss.2025.109460","DOIUrl":null,"url":null,"abstract":"<div><div>Outlier detection is a crucial task for ensuring the reliability of data analysis, aiming at identifying objects that markedly differ from typical patterns in the dataset. Recently, many methods based on fuzzy rough sets have shown promising performance. However, these methods overlook the influence of redundant attributes and include the relationships between outliers in the analysis. Targeting these important problems, we propose a novel approach called Prototype-based Fuzzy Rough Sets (PFRS), which performs outlier detection via prototype learning based on the selection of separable attributes. Specifically, information entropy is applied to select significant attributes, enhancing the separability of the feature space. More importantly, by employing prototype learning to acquire representative objects, PFRS effectively eliminates the impact of relationships among outliers. In addition, the fuzzy similarity between prototypes and objects is evaluated to reconstruct the traditional fuzzy upper and lower approximations. Finally, more reliable outlier scores are derived from PFRS. Extensive experiments using widely adopted algorithms verify the effectiveness of PFRS, confirming its outstanding performance.</div></div>","PeriodicalId":55130,"journal":{"name":"Fuzzy Sets and Systems","volume":"517 ","pages":"Article 109460"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Sets and Systems","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016501142500199X","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Outlier detection is a crucial task for ensuring the reliability of data analysis, aiming at identifying objects that markedly differ from typical patterns in the dataset. Recently, many methods based on fuzzy rough sets have shown promising performance. However, these methods overlook the influence of redundant attributes and include the relationships between outliers in the analysis. Targeting these important problems, we propose a novel approach called Prototype-based Fuzzy Rough Sets (PFRS), which performs outlier detection via prototype learning based on the selection of separable attributes. Specifically, information entropy is applied to select significant attributes, enhancing the separability of the feature space. More importantly, by employing prototype learning to acquire representative objects, PFRS effectively eliminates the impact of relationships among outliers. In addition, the fuzzy similarity between prototypes and objects is evaluated to reconstruct the traditional fuzzy upper and lower approximations. Finally, more reliable outlier scores are derived from PFRS. Extensive experiments using widely adopted algorithms verify the effectiveness of PFRS, confirming its outstanding performance.
期刊介绍:
Since its launching in 1978, the journal Fuzzy Sets and Systems has been devoted to the international advancement of the theory and application of fuzzy sets and systems. The theory of fuzzy sets now encompasses a well organized corpus of basic notions including (and not restricted to) aggregation operations, a generalized theory of relations, specific measures of information content, a calculus of fuzzy numbers. Fuzzy sets are also the cornerstone of a non-additive uncertainty theory, namely possibility theory, and of a versatile tool for both linguistic and numerical modeling: fuzzy rule-based systems. Numerous works now combine fuzzy concepts with other scientific disciplines as well as modern technologies.
In mathematics fuzzy sets have triggered new research topics in connection with category theory, topology, algebra, analysis. Fuzzy sets are also part of a recent trend in the study of generalized measures and integrals, and are combined with statistical methods. Furthermore, fuzzy sets have strong logical underpinnings in the tradition of many-valued logics.