Wenjun Wu, Lingling Zhang, Yiwei Chen, Xuan Luo, Bifan Wei, Jun Liu
{"title":"Fuzzy c-Means Clustering with Discriminative Projection","authors":"Wenjun Wu, Lingling Zhang, Yiwei Chen, Xuan Luo, Bifan Wei, Jun Liu","doi":"10.1109/ICKG52313.2021.00062","DOIUrl":null,"url":null,"abstract":"The clustering technique plays an important role in data mining and machine learning fields. Clustering for high-dimensional data, such as texts, images, and videos, remains a challenging task due to the existence of many noise features. The widely used methods for this issue focus on mining a effective pattern in high-dimensional data using some dimensionality reduction techniques before clustering. This strategy slightly mitigates the effects of irrelevant and redundant features, but cannot significantly improve the clustering performance because the captured pattern by dimensionality reduction is not directly related to the clustering task. In this paper, we propose a unified framework to achieve discriminative dimensionality reduction and fuzzy clustering for high-dimensional data simultaneously. The proposed framework not only utilizes the clustering results to directly guide or supervise the process of discriminative dimensionality reduction, but also controls the clustering fuzziness more easily by a $F$ -norm regularization term. An efficient optimization algorithm is exploited to address the objective function of our method, which is proved to converge to the local optimal solution in theory. We evaluate the proposed method on three large-scale fine-grained image datasets, including Birds, Flowers, and Cars, for clustering and retrieval two tasks. The experimental results on metrics ACC, NMI, ARI and Recall@K indicate that our method achieves the comparable performance over the state-of-the-art methods.","PeriodicalId":174126,"journal":{"name":"2021 IEEE International Conference on Big Knowledge (ICBK)","volume":"31 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Big Knowledge (ICBK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKG52313.2021.00062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The clustering technique plays an important role in data mining and machine learning fields. Clustering for high-dimensional data, such as texts, images, and videos, remains a challenging task due to the existence of many noise features. The widely used methods for this issue focus on mining a effective pattern in high-dimensional data using some dimensionality reduction techniques before clustering. This strategy slightly mitigates the effects of irrelevant and redundant features, but cannot significantly improve the clustering performance because the captured pattern by dimensionality reduction is not directly related to the clustering task. In this paper, we propose a unified framework to achieve discriminative dimensionality reduction and fuzzy clustering for high-dimensional data simultaneously. The proposed framework not only utilizes the clustering results to directly guide or supervise the process of discriminative dimensionality reduction, but also controls the clustering fuzziness more easily by a $F$ -norm regularization term. An efficient optimization algorithm is exploited to address the objective function of our method, which is proved to converge to the local optimal solution in theory. We evaluate the proposed method on three large-scale fine-grained image datasets, including Birds, Flowers, and Cars, for clustering and retrieval two tasks. The experimental results on metrics ACC, NMI, ARI and Recall@K indicate that our method achieves the comparable performance over the state-of-the-art methods.