Anisotropic-Gaussian-Kernel-Based Fuzzy Clustering Algorithm for Feature Selection

IF 11.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jun Liu;Mengyuan Wu;Mingwei Lin;Zeshui Xu
{"title":"Anisotropic-Gaussian-Kernel-Based Fuzzy Clustering Algorithm for Feature Selection","authors":"Jun Liu;Mengyuan Wu;Mingwei Lin;Zeshui Xu","doi":"10.1109/TFUZZ.2025.3581918","DOIUrl":null,"url":null,"abstract":"Soft clustering algorithms based on fuzzy C-means (FCM) have been extensively applied to complex data analysis. However, existing FCM variants still encounter key limitations: a large number of iterations due to slow convergence on high-dimensional data, equal weights of all samples, which increases sensitivity to noise, and strong dependence on empirically chosen fuzzy parameters, often resulting in suboptimal solutions. In order to address these challenges, in this work, we propose an adaptive FCM clustering algorithm based on anisotropic Gaussian kernel function, which facilitates the process of feature selection for multidimensional data with fewer iterations after feature reduction based on updating the kernel width vector. In order to enhance the classification accuracy, we assign adaptive weights to each sample and adjust these weights to mitigate the impact of outliers on classification accuracy. Unlike traditional fuzzy clustering algorithms, we employ the particle swarm optimization algorithm with time-varying acceleration coefficients to determine the global optimal parameters, thus reducing the computational resources and enhancing the algorithm’s efficiency. The experimental results based on 16 publicly available datasets validate that the proposed algorithm achieves higher classification accuracy with minimal number of iterations.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 9","pages":"3061-3075"},"PeriodicalIF":11.9000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11045817/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Soft clustering algorithms based on fuzzy C-means (FCM) have been extensively applied to complex data analysis. However, existing FCM variants still encounter key limitations: a large number of iterations due to slow convergence on high-dimensional data, equal weights of all samples, which increases sensitivity to noise, and strong dependence on empirically chosen fuzzy parameters, often resulting in suboptimal solutions. In order to address these challenges, in this work, we propose an adaptive FCM clustering algorithm based on anisotropic Gaussian kernel function, which facilitates the process of feature selection for multidimensional data with fewer iterations after feature reduction based on updating the kernel width vector. In order to enhance the classification accuracy, we assign adaptive weights to each sample and adjust these weights to mitigate the impact of outliers on classification accuracy. Unlike traditional fuzzy clustering algorithms, we employ the particle swarm optimization algorithm with time-varying acceleration coefficients to determine the global optimal parameters, thus reducing the computational resources and enhancing the algorithm’s efficiency. The experimental results based on 16 publicly available datasets validate that the proposed algorithm achieves higher classification accuracy with minimal number of iterations.
基于各向异性高斯核的模糊聚类特征选择算法
基于模糊c均值(FCM)的软聚类算法已广泛应用于复杂数据分析。然而,现有的FCM变体仍然遇到关键的局限性:由于在高维数据上收敛缓慢而导致的大量迭代,所有样本的权重相等,这增加了对噪声的敏感性,以及对经验选择的模糊参数的强烈依赖,往往导致次优解。为了解决这些问题,本文提出了一种基于各向异性高斯核函数的自适应FCM聚类算法,该算法基于核宽度向量的更新进行特征约简后,能够以更少的迭代次数对多维数据进行特征选择。为了提高分类精度,我们为每个样本分配自适应权值,并调整这些权值以减轻异常值对分类精度的影响。与传统的模糊聚类算法不同,我们采用具有时变加速度系数的粒子群优化算法来确定全局最优参数,从而减少了计算资源,提高了算法效率。基于16个公开数据集的实验结果验证了该算法以最少的迭代次数实现了较高的分类精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems 工程技术-工程:电子与电气
CiteScore
20.50
自引率
13.40%
发文量
517
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信