一种新的多原型聚类算法

2021 11th International Conference on Information Science and Technology (ICIST) Pub Date : 2021-05-21 DOI:10.1109/ICIST52614.2021.9440589

Lu Wang, Huidong Wang, Chuanzheng Bai

{"title":"一种新的多原型聚类算法","authors":"Lu Wang, Huidong Wang, Chuanzheng Bai","doi":"10.1109/ICIST52614.2021.9440589","DOIUrl":null,"url":null,"abstract":"K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.","PeriodicalId":371599,"journal":{"name":"2021 11th International Conference on Information Science and Technology (ICIST)","volume":"14 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A new multi-prototype based clustering algorithm\",\"authors\":\"Lu Wang, Huidong Wang, Chuanzheng Bai\",\"doi\":\"10.1109/ICIST52614.2021.9440589\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.\",\"PeriodicalId\":371599,\"journal\":{\"name\":\"2021 11th International Conference on Information Science and Technology (ICIST)\",\"volume\":\"14 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 11th International Conference on Information Science and Technology (ICIST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIST52614.2021.9440589\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST52614.2021.9440589","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

K-means是一种简单高效的基于原型的聚类算法。然而，大多数k-means方法假设不同的类由一个原型表示，这使得k-means算法受到限制。近年来提出了多原型聚类方法来解决这一问题，该方法分为两个阶段:分裂阶段和合并阶段。对于多原型算法，适当的原型数对算法的性能起着至关重要的作用，通常由用户通过试错的方式给出。本文设计了一种新的增量k-均值聚类算法来自动确定合适的原型数。首先，提出了一种新的指标来判断分步阶段的原型数量是否合适;其次，根据合并阶段数据点到超平面的距离公式定义新的合并指标;最后，在8个数据集上的仿真结果验证了该算法的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A new multi-prototype based clustering algorithm

K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 11th International Conference on Information Science and Technology (ICIST)

自引率

0.00%

发文量