{"title":"基于均值移位的改进K-Means算法及其实现","authors":"Yang Chen, Pengfei Hu, Weilan Wang","doi":"10.1109/CISP-BMEI.2018.8633100","DOIUrl":null,"url":null,"abstract":"The traditional K-means algorithm is sensitive to the initial clustering center, and randomly selecting different initial clustering centers will result in different clustering results. In this paper, an improved K-means algorithm based on Mean Shift clustering is proposed to solve the existing problems of the K-means algorithm. This algorithm selects a high-density migration vector set MP by Mean Shift, and selects k points with the farthest distance from each other in the high-density region in MP as the initial cluster center. This paper adopts the iris data set and the wine data set from the international standard UCI database, and 150 vowel image texts on the upper part of the baseline for the text analysis of the Ujin body Tibetan ancient books are used to verify the proposed algorithm (The real sample is called the Tibetan dataset). It can be seen from the experimental results that the algorithm can achieve better clustering results with higher accuracy and more stability.","PeriodicalId":117227,"journal":{"name":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"34 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Improved K-Means Algorithm and its Implementation Based on Mean Shift\",\"authors\":\"Yang Chen, Pengfei Hu, Weilan Wang\",\"doi\":\"10.1109/CISP-BMEI.2018.8633100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The traditional K-means algorithm is sensitive to the initial clustering center, and randomly selecting different initial clustering centers will result in different clustering results. In this paper, an improved K-means algorithm based on Mean Shift clustering is proposed to solve the existing problems of the K-means algorithm. This algorithm selects a high-density migration vector set MP by Mean Shift, and selects k points with the farthest distance from each other in the high-density region in MP as the initial cluster center. This paper adopts the iris data set and the wine data set from the international standard UCI database, and 150 vowel image texts on the upper part of the baseline for the text analysis of the Ujin body Tibetan ancient books are used to verify the proposed algorithm (The real sample is called the Tibetan dataset). It can be seen from the experimental results that the algorithm can achieve better clustering results with higher accuracy and more stability.\",\"PeriodicalId\":117227,\"journal\":{\"name\":\"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"volume\":\"34 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISP-BMEI.2018.8633100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2018.8633100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved K-Means Algorithm and its Implementation Based on Mean Shift
The traditional K-means algorithm is sensitive to the initial clustering center, and randomly selecting different initial clustering centers will result in different clustering results. In this paper, an improved K-means algorithm based on Mean Shift clustering is proposed to solve the existing problems of the K-means algorithm. This algorithm selects a high-density migration vector set MP by Mean Shift, and selects k points with the farthest distance from each other in the high-density region in MP as the initial cluster center. This paper adopts the iris data set and the wine data set from the international standard UCI database, and 150 vowel image texts on the upper part of the baseline for the text analysis of the Ujin body Tibetan ancient books are used to verify the proposed algorithm (The real sample is called the Tibetan dataset). It can be seen from the experimental results that the algorithm can achieve better clustering results with higher accuracy and more stability.