Improved K-Means Clustering Algorithm Based on Dynamic Clustering

Li-Guo Zheng
{"title":"Improved K-Means Clustering Algorithm Based on Dynamic Clustering","authors":"Li-Guo Zheng","doi":"10.21742/ijarbms.2020.4.1.02","DOIUrl":null,"url":null,"abstract":"Cluster analysis can not only find potential and valuable structured information in the data set, but also provide pre-processing functions for other data mining algorithms, and then can refine the processing results to improve the accuracy of the algorithm. Therefore, cluster analysis has become one of the hot research topics in the field of data mining. K-means algorithm, as a clustering algorithm based on the partitioning idea, can compare the differences between the data set classes and classes. We can use the K-means algorithm to mine the clustering results and further discover the potentially valuable knowledge in the data set. Help people make more accurate decisions. This paper summarizes and analyzes the traditional K-means algorithm, summarizes the improvement direction of the K-means algorithm, fully considers the dynamic change of information in the K-means clustering process, and reduces the standard setting value for the termination condition of the algorithm to reduce The number of iterations of the algorithm reduces the learning time; the redundant information generated by the dynamic change of information is deleted to reduce the interference in the dynamic clustering process, so that the algorithm achieves a more accurate and efficient clustering effect. Experimental results show that when the amount of data is large, compared with the traditional K-means algorithm, the improved K-means algorithm has a greater improvement in accuracy and execution efficiency. 1","PeriodicalId":377126,"journal":{"name":"International Journal of Advanced Research in Big Data Management System","volume":"5 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Research in Big Data Management System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijarbms.2020.4.1.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Cluster analysis can not only find potential and valuable structured information in the data set, but also provide pre-processing functions for other data mining algorithms, and then can refine the processing results to improve the accuracy of the algorithm. Therefore, cluster analysis has become one of the hot research topics in the field of data mining. K-means algorithm, as a clustering algorithm based on the partitioning idea, can compare the differences between the data set classes and classes. We can use the K-means algorithm to mine the clustering results and further discover the potentially valuable knowledge in the data set. Help people make more accurate decisions. This paper summarizes and analyzes the traditional K-means algorithm, summarizes the improvement direction of the K-means algorithm, fully considers the dynamic change of information in the K-means clustering process, and reduces the standard setting value for the termination condition of the algorithm to reduce The number of iterations of the algorithm reduces the learning time; the redundant information generated by the dynamic change of information is deleted to reduce the interference in the dynamic clustering process, so that the algorithm achieves a more accurate and efficient clustering effect. Experimental results show that when the amount of data is large, compared with the traditional K-means algorithm, the improved K-means algorithm has a greater improvement in accuracy and execution efficiency. 1
基于动态聚类的改进 K-Means 聚类算法
聚类分析不仅可以发现数据集中潜在的、有价值的结构化信息,还可以为其他数据挖掘算法提供预处理功能,进而可以完善处理结果,提高算法的准确性。因此,聚类分析已成为数据挖掘领域的热门研究课题之一。K-means 算法作为一种基于划分思想的聚类算法,可以比较数据集类与类之间的差异。我们可以利用 K-means 算法挖掘聚类结果,进一步发现数据集中潜在的有价值知识。帮助人们做出更准确的决策。本文对传统的 K-means 算法进行了归纳和分析,总结了 K-means 算法的改进方向,充分考虑了 K-means 聚类过程中信息的动态变化,降低了算法终止条件的标准设定值,减少了算法的迭代次数,缩短了学习时间;删除了信息动态变化产生的冗余信息,减少了动态聚类过程中的干扰,使算法达到了更准确、更高效的聚类效果。实验结果表明,当数据量较大时,与传统的K-means算法相比,改进的K-means算法在准确性和执行效率上都有较大的提高。1
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信