Anisotropic-Gaussian-Kernel-Based Fuzzy Clustering Algorithm for Feature Selection

IF 11.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Fuzzy Systems Pub Date : 2025-06-20 DOI:10.1109/TFUZZ.2025.3581918

Jun Liu;Mengyuan Wu;Mingwei Lin;Zeshui Xu

{"title":"Anisotropic-Gaussian-Kernel-Based Fuzzy Clustering Algorithm for Feature Selection","authors":"Jun Liu;Mengyuan Wu;Mingwei Lin;Zeshui Xu","doi":"10.1109/TFUZZ.2025.3581918","DOIUrl":null,"url":null,"abstract":"Soft clustering algorithms based on fuzzy C-means (FCM) have been extensively applied to complex data analysis. However, existing FCM variants still encounter key limitations: a large number of iterations due to slow convergence on high-dimensional data, equal weights of all samples, which increases sensitivity to noise, and strong dependence on empirically chosen fuzzy parameters, often resulting in suboptimal solutions. In order to address these challenges, in this work, we propose an adaptive FCM clustering algorithm based on anisotropic Gaussian kernel function, which facilitates the process of feature selection for multidimensional data with fewer iterations after feature reduction based on updating the kernel width vector. In order to enhance the classification accuracy, we assign adaptive weights to each sample and adjust these weights to mitigate the impact of outliers on classification accuracy. Unlike traditional fuzzy clustering algorithms, we employ the particle swarm optimization algorithm with time-varying acceleration coefficients to determine the global optimal parameters, thus reducing the computational resources and enhancing the algorithm’s efficiency. The experimental results based on 16 publicly available datasets validate that the proposed algorithm achieves higher classification accuracy with minimal number of iterations.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 9","pages":"3061-3075"},"PeriodicalIF":11.9000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11045817/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Soft clustering algorithms based on fuzzy C-means (FCM) have been extensively applied to complex data analysis. However, existing FCM variants still encounter key limitations: a large number of iterations due to slow convergence on high-dimensional data, equal weights of all samples, which increases sensitivity to noise, and strong dependence on empirically chosen fuzzy parameters, often resulting in suboptimal solutions. In order to address these challenges, in this work, we propose an adaptive FCM clustering algorithm based on anisotropic Gaussian kernel function, which facilitates the process of feature selection for multidimensional data with fewer iterations after feature reduction based on updating the kernel width vector. In order to enhance the classification accuracy, we assign adaptive weights to each sample and adjust these weights to mitigate the impact of outliers on classification accuracy. Unlike traditional fuzzy clustering algorithms, we employ the particle swarm optimization algorithm with time-varying acceleration coefficients to determine the global optimal parameters, thus reducing the computational resources and enhancing the algorithm’s efficiency. The experimental results based on 16 publicly available datasets validate that the proposed algorithm achieves higher classification accuracy with minimal number of iterations.

查看原文本刊更多论文

基于各向异性高斯核的模糊聚类特征选择算法

基于模糊c均值（FCM）的软聚类算法已广泛应用于复杂数据分析。然而，现有的FCM变体仍然遇到关键的局限性：由于在高维数据上收敛缓慢而导致的大量迭代，所有样本的权重相等，这增加了对噪声的敏感性，以及对经验选择的模糊参数的强烈依赖，往往导致次优解。为了解决这些问题，本文提出了一种基于各向异性高斯核函数的自适应FCM聚类算法，该算法基于核宽度向量的更新进行特征约简后，能够以更少的迭代次数对多维数据进行特征选择。为了提高分类精度，我们为每个样本分配自适应权值，并调整这些权值以减轻异常值对分类精度的影响。与传统的模糊聚类算法不同，我们采用具有时变加速度系数的粒子群优化算法来确定全局最优参数，从而减少了计算资源，提高了算法效率。基于16个公开数据集的实验结果验证了该算法以最少的迭代次数实现了较高的分类精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Fuzzy Systems 工程技术-工程：电子与电气

CiteScore

20.50

自引率

13.40%

发文量

517

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.