{"title":"基于k-均值聚类算法的研究与改进方法","authors":"Guohua Zhang, Kangting Zhao, Yi Li","doi":"10.5539/cis.v12n1p49","DOIUrl":null,"url":null,"abstract":"In view of the sensitivity of the traditional mean algorithm to outliers and noise points, an improved mean algorithm is proposed in this paper, which is based on the density of the distribution of objects in space. In the measurement of density, the sensitivity of clustering effect to initial parameters is reduced. The improved algorithm can filter the \"noise\" data and discover the clustering of arbitrary shapes, which is obviously superior to the standard mean algorithm.","PeriodicalId":14676,"journal":{"name":"J. Chem. Inf. Comput. Sci.","volume":"61 1","pages":"49-52"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Research and Improvement Method Based on k-mean Clustering Algorithm\",\"authors\":\"Guohua Zhang, Kangting Zhao, Yi Li\",\"doi\":\"10.5539/cis.v12n1p49\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In view of the sensitivity of the traditional mean algorithm to outliers and noise points, an improved mean algorithm is proposed in this paper, which is based on the density of the distribution of objects in space. In the measurement of density, the sensitivity of clustering effect to initial parameters is reduced. The improved algorithm can filter the \\\"noise\\\" data and discover the clustering of arbitrary shapes, which is obviously superior to the standard mean algorithm.\",\"PeriodicalId\":14676,\"journal\":{\"name\":\"J. Chem. Inf. Comput. Sci.\",\"volume\":\"61 1\",\"pages\":\"49-52\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Chem. Inf. Comput. Sci.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5539/cis.v12n1p49\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Chem. Inf. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5539/cis.v12n1p49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research and Improvement Method Based on k-mean Clustering Algorithm
In view of the sensitivity of the traditional mean algorithm to outliers and noise points, an improved mean algorithm is proposed in this paper, which is based on the density of the distribution of objects in space. In the measurement of density, the sensitivity of clustering effect to initial parameters is reduced. The improved algorithm can filter the "noise" data and discover the clustering of arbitrary shapes, which is obviously superior to the standard mean algorithm.