一种新的多原型聚类算法

Lu Wang, Huidong Wang, Chuanzheng Bai
{"title":"一种新的多原型聚类算法","authors":"Lu Wang, Huidong Wang, Chuanzheng Bai","doi":"10.1109/ICIST52614.2021.9440589","DOIUrl":null,"url":null,"abstract":"K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.","PeriodicalId":371599,"journal":{"name":"2021 11th International Conference on Information Science and Technology (ICIST)","volume":"14 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A new multi-prototype based clustering algorithm\",\"authors\":\"Lu Wang, Huidong Wang, Chuanzheng Bai\",\"doi\":\"10.1109/ICIST52614.2021.9440589\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.\",\"PeriodicalId\":371599,\"journal\":{\"name\":\"2021 11th International Conference on Information Science and Technology (ICIST)\",\"volume\":\"14 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 11th International Conference on Information Science and Technology (ICIST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIST52614.2021.9440589\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST52614.2021.9440589","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

K-means是一种简单高效的基于原型的聚类算法。然而,大多数k-means方法假设不同的类由一个原型表示,这使得k-means算法受到限制。近年来提出了多原型聚类方法来解决这一问题,该方法分为两个阶段:分裂阶段和合并阶段。对于多原型算法,适当的原型数对算法的性能起着至关重要的作用,通常由用户通过试错的方式给出。本文设计了一种新的增量k-均值聚类算法来自动确定合适的原型数。首先,提出了一种新的指标来判断分步阶段的原型数量是否合适;其次,根据合并阶段数据点到超平面的距离公式定义新的合并指标;最后,在8个数据集上的仿真结果验证了该算法的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A new multi-prototype based clustering algorithm
K-means is a well-known prototype based clustering algorithm for its simplicity and efficiency. However, most k-means methods assume different classes are represented by one prototype, which makes a limit of k-means algorithms. Recently, multi-prototype clustering methods have been raised to tackle this problem, which composed of two stages: split stage and merge stage. For multi-prototype algorithms, a proper prototype number plays a vital role in the algorithm performance and it is generally given by users in a trial and error way. In this paper, a new incremental k-means clustering algorithm is designed to determine the propriate prototype number automatically. Firstly, a new indicator is presented to judge whether the number of prototype is appropriate in the split stage. Secondly, a new merge indicator is defined according to the distance formula from datapoint to hyperplane in the merge stage. Finally, simulation results on 8 datasets illustrate the effectiveness and superiority of the proposed algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信