Pengfei Zhang;Zhaoxuan He;Dexian Wang;Tao Jiang;Baolin Li;Jia Liu;Wei Huang;Tianrui Li
{"title":"ODMGIS:基于多粒度信息集的离群点检测方法","authors":"Pengfei Zhang;Zhaoxuan He;Dexian Wang;Tao Jiang;Baolin Li;Jia Liu;Wei Huang;Tianrui Li","doi":"10.1109/TFUZZ.2025.3550749","DOIUrl":null,"url":null,"abstract":"In the realm of data mining, outlier detection has emerged as a pivotal research focus, aimed at uncovering anomalies within datasets to extract meaningful and valuable insights. The objective is to leverage data mining methodologies to pinpoint anomalies within datasets, thereby revealing crucial and enlightening information. Herein, we introduce a groundbreaking outlier detection methodology, ODMGIS, that seamlessly integrates multigranularity representation and information set concepts to devise the multigranularity information set (MGIS) model. This model adeptly characterizes the distribution patterns of data points. First, we employ entropy function and their complementary function as measurement tools to accurately quantify the inherent uncertainty in data with different distributions, and considers the sum of the two as a comprehensive representation of the overall uncertainty of the information source. Subsequently, an outlier score model is constructed based on MGIS, which can deeply characterize the degree of outlierness of samples, thereby effectively identifying abnormal points in the dataset. During validation, ODMGIS was rigorously tested on practical datasets from medicine and bioinformatics, and its performance was benchmarked against both traditional and the state-of-the-art algorithms, showcasing substantial benefits. This research not only contributes a fresh perspective to outlier detection, but also sparks innovative avenues for exploring and advancing the granular computing theory.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 7","pages":"2050-2061"},"PeriodicalIF":10.7000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ODMGIS: An Outlier Detection Method Based on Multigranularity Information Sets\",\"authors\":\"Pengfei Zhang;Zhaoxuan He;Dexian Wang;Tao Jiang;Baolin Li;Jia Liu;Wei Huang;Tianrui Li\",\"doi\":\"10.1109/TFUZZ.2025.3550749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the realm of data mining, outlier detection has emerged as a pivotal research focus, aimed at uncovering anomalies within datasets to extract meaningful and valuable insights. The objective is to leverage data mining methodologies to pinpoint anomalies within datasets, thereby revealing crucial and enlightening information. Herein, we introduce a groundbreaking outlier detection methodology, ODMGIS, that seamlessly integrates multigranularity representation and information set concepts to devise the multigranularity information set (MGIS) model. This model adeptly characterizes the distribution patterns of data points. First, we employ entropy function and their complementary function as measurement tools to accurately quantify the inherent uncertainty in data with different distributions, and considers the sum of the two as a comprehensive representation of the overall uncertainty of the information source. Subsequently, an outlier score model is constructed based on MGIS, which can deeply characterize the degree of outlierness of samples, thereby effectively identifying abnormal points in the dataset. During validation, ODMGIS was rigorously tested on practical datasets from medicine and bioinformatics, and its performance was benchmarked against both traditional and the state-of-the-art algorithms, showcasing substantial benefits. This research not only contributes a fresh perspective to outlier detection, but also sparks innovative avenues for exploring and advancing the granular computing theory.\",\"PeriodicalId\":13212,\"journal\":{\"name\":\"IEEE Transactions on Fuzzy Systems\",\"volume\":\"33 7\",\"pages\":\"2050-2061\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-03-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Fuzzy Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10923722/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10923722/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
ODMGIS: An Outlier Detection Method Based on Multigranularity Information Sets
In the realm of data mining, outlier detection has emerged as a pivotal research focus, aimed at uncovering anomalies within datasets to extract meaningful and valuable insights. The objective is to leverage data mining methodologies to pinpoint anomalies within datasets, thereby revealing crucial and enlightening information. Herein, we introduce a groundbreaking outlier detection methodology, ODMGIS, that seamlessly integrates multigranularity representation and information set concepts to devise the multigranularity information set (MGIS) model. This model adeptly characterizes the distribution patterns of data points. First, we employ entropy function and their complementary function as measurement tools to accurately quantify the inherent uncertainty in data with different distributions, and considers the sum of the two as a comprehensive representation of the overall uncertainty of the information source. Subsequently, an outlier score model is constructed based on MGIS, which can deeply characterize the degree of outlierness of samples, thereby effectively identifying abnormal points in the dataset. During validation, ODMGIS was rigorously tested on practical datasets from medicine and bioinformatics, and its performance was benchmarked against both traditional and the state-of-the-art algorithms, showcasing substantial benefits. This research not only contributes a fresh perspective to outlier detection, but also sparks innovative avenues for exploring and advancing the granular computing theory.
期刊介绍:
The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.