Ludmila Himmelspach, Daniel Hommers, Stefan Conrad
{"title":"Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data","authors":"Ludmila Himmelspach, Daniel Hommers, Stefan Conrad","doi":"10.2991/eusflat.2011.136","DOIUrl":null,"url":null,"abstract":"The quality of results for partitioning clustering algorithms depends on the assumption made on the number of clusters presented in the data set. Applying clustering methods on real data missing values turn out to be an additional challenging problem for clustering algorithms. Fuzzy clustering approaches adapted to incomplete data perform well for a given number of clusters. In this study, we analyse dierent cluster validity functions in terms of applicability on incomplete data on the one hand. On the other hand we analyse in experiments on several data sets to what extent the clustering results produced by fuzzy clustering methods for incomplete data reect the distribution structure of data.","PeriodicalId":403191,"journal":{"name":"EUSFLAT Conf.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EUSFLAT Conf.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2991/eusflat.2011.136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The quality of results for partitioning clustering algorithms depends on the assumption made on the number of clusters presented in the data set. Applying clustering methods on real data missing values turn out to be an additional challenging problem for clustering algorithms. Fuzzy clustering approaches adapted to incomplete data perform well for a given number of clusters. In this study, we analyse dierent cluster validity functions in terms of applicability on incomplete data on the one hand. On the other hand we analyse in experiments on several data sets to what extent the clustering results produced by fuzzy clustering methods for incomplete data reect the distribution structure of data.