{"title":"Double fuzzy relaxation local information C-Means clustering","authors":"Yunlong Gao, Xingshen Zheng, Qinting Wu, Jiahao Zhang, Chao Cao, Jinyan Pan","doi":"10.1007/s10489-024-06078-6","DOIUrl":null,"url":null,"abstract":"<div><p>Fuzzy c-means clustering (FCM) has gained widespread application because of its ability to capture uncertain information in data effectively. However, attributed to the prior assumption of identical distribution, traditional FCM is sensitive to noise and cluster size. Modified methods incorporating local spatial information can enhance the robustness to noise. However, they tend to balance cluster sizes, resulting in poor performance when dealing with imbalanced data. Modified methods learning the statistical characteristics of data are feasible to handle imbalanced data. However, they are often sensitive to noise due to the ignorance of local information. Aiming at the lack of method that can simultaneously alleviate the sensitivity to noise and cluster size, a double fuzzy relaxation local information c-means clustering algorithm (DFRLICM) is proposed in this paper. Firstly, sample relaxation is introduced to explore potential clustering results and enhance inter-class separability. Secondly, to cooperate with the relaxation, we design fuzzy weights to record the imbalance situation of data clusters, enhancing the capability of algorithm in dealing with imbalanced data. Thirdly, we introduce fuzzy factor to account for the preservation of local structures in data and improve the robustness of algorithm. Finally, we integrate the three elements into a unified model framework to achieve the combination optimization of robustness to noise and insensitivity to cluster size simultaneously. Extensive experiments are conducted and the results demonstrate that the proposed algorithm indeed achieves a balance between robustness to noise and insensitivity to cluster size.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 2","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06078-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Fuzzy c-means clustering (FCM) has gained widespread application because of its ability to capture uncertain information in data effectively. However, attributed to the prior assumption of identical distribution, traditional FCM is sensitive to noise and cluster size. Modified methods incorporating local spatial information can enhance the robustness to noise. However, they tend to balance cluster sizes, resulting in poor performance when dealing with imbalanced data. Modified methods learning the statistical characteristics of data are feasible to handle imbalanced data. However, they are often sensitive to noise due to the ignorance of local information. Aiming at the lack of method that can simultaneously alleviate the sensitivity to noise and cluster size, a double fuzzy relaxation local information c-means clustering algorithm (DFRLICM) is proposed in this paper. Firstly, sample relaxation is introduced to explore potential clustering results and enhance inter-class separability. Secondly, to cooperate with the relaxation, we design fuzzy weights to record the imbalance situation of data clusters, enhancing the capability of algorithm in dealing with imbalanced data. Thirdly, we introduce fuzzy factor to account for the preservation of local structures in data and improve the robustness of algorithm. Finally, we integrate the three elements into a unified model framework to achieve the combination optimization of robustness to noise and insensitivity to cluster size simultaneously. Extensive experiments are conducted and the results demonstrate that the proposed algorithm indeed achieves a balance between robustness to noise and insensitivity to cluster size.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.