{"title":"通过亲和度量提高模糊c均值聚类唯一性","authors":"A. Banumathi, A. Pethalakshmi","doi":"10.1109/ICPRIME.2012.6208282","DOIUrl":null,"url":null,"abstract":"Clustering is a widely used technique in data mining application for discovering patterns in large dataset. In this paper the Fuzzy C-Means algorithm is analyzed and found that quality of the resultant cluster is based on the initial seed where it is selected either sequentially or randomly. Fuzzy C-Means uses K-Means clustering approach for the initial operation of clustering and then degree of membership is calculated. Fuzzy C-Means is very similar to the K-Means algorithm and hence in this paper K-Means is outlined and proved how the drawback of K-Means algorithm is rectified through UCAM (Unique Clustering with Affinity Measure) clustering algorithm and then UCAM is refined to give a new view namely Fuzzy-UCAM. Fuzzy C-Means algorithm should be initiated with the number of cluster C and initial seeds. For real time large database it's difficult to predict the number of cluster and initial seeds accurately. In order to overcome this drawback the current paper focused on developing the Fuzzy-UCAM algorithm for clustering without giving initial seed and number of clusters for Fuzzy C-Means. Unique clustering is obtained with the help of affinity measures.","PeriodicalId":148511,"journal":{"name":"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)","volume":"341 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Increasing cluster uniqueness in Fuzzy C-Means through affinity measure\",\"authors\":\"A. Banumathi, A. Pethalakshmi\",\"doi\":\"10.1109/ICPRIME.2012.6208282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is a widely used technique in data mining application for discovering patterns in large dataset. In this paper the Fuzzy C-Means algorithm is analyzed and found that quality of the resultant cluster is based on the initial seed where it is selected either sequentially or randomly. Fuzzy C-Means uses K-Means clustering approach for the initial operation of clustering and then degree of membership is calculated. Fuzzy C-Means is very similar to the K-Means algorithm and hence in this paper K-Means is outlined and proved how the drawback of K-Means algorithm is rectified through UCAM (Unique Clustering with Affinity Measure) clustering algorithm and then UCAM is refined to give a new view namely Fuzzy-UCAM. Fuzzy C-Means algorithm should be initiated with the number of cluster C and initial seeds. For real time large database it's difficult to predict the number of cluster and initial seeds accurately. In order to overcome this drawback the current paper focused on developing the Fuzzy-UCAM algorithm for clustering without giving initial seed and number of clusters for Fuzzy C-Means. Unique clustering is obtained with the help of affinity measures.\",\"PeriodicalId\":148511,\"journal\":{\"name\":\"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)\",\"volume\":\"341 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPRIME.2012.6208282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPRIME.2012.6208282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
聚类是一种广泛应用于数据挖掘的技术,用于在大数据集中发现模式。本文对模糊c均值算法进行了分析,发现聚类结果的质量取决于初始种子,初始种子的选择可以是顺序的,也可以是随机的。模糊C-Means采用K-Means聚类方法进行聚类的初始操作,然后计算隶属度。模糊C-Means与K-Means算法非常相似,因此本文概述了K-Means算法,并证明了如何通过UCAM (Unique Clustering with Affinity Measure)聚类算法纠正K-Means算法的缺点,然后对UCAM进行改进,给出了一种新的观点,即Fuzzy-UCAM。模糊C-均值算法的初始化需要有聚类C的个数和初始种子的个数。对于实时的大型数据库,很难准确地预测聚类和初始种子的数量。为了克服这一缺点,本文重点研究了不给出模糊c均值初始种子和簇数的Fuzzy- ucam聚类算法。利用亲和度量获得了唯一的聚类。
Increasing cluster uniqueness in Fuzzy C-Means through affinity measure
Clustering is a widely used technique in data mining application for discovering patterns in large dataset. In this paper the Fuzzy C-Means algorithm is analyzed and found that quality of the resultant cluster is based on the initial seed where it is selected either sequentially or randomly. Fuzzy C-Means uses K-Means clustering approach for the initial operation of clustering and then degree of membership is calculated. Fuzzy C-Means is very similar to the K-Means algorithm and hence in this paper K-Means is outlined and proved how the drawback of K-Means algorithm is rectified through UCAM (Unique Clustering with Affinity Measure) clustering algorithm and then UCAM is refined to give a new view namely Fuzzy-UCAM. Fuzzy C-Means algorithm should be initiated with the number of cluster C and initial seeds. For real time large database it's difficult to predict the number of cluster and initial seeds accurately. In order to overcome this drawback the current paper focused on developing the Fuzzy-UCAM algorithm for clustering without giving initial seed and number of clusters for Fuzzy C-Means. Unique clustering is obtained with the help of affinity measures.