{"title":"一种新的聚类倾向集成方法用于数据聚类","authors":"Van Nha Pham, L. Ngo, L. T. Pham, Pham Van Hai","doi":"10.1145/3287921.3287927","DOIUrl":null,"url":null,"abstract":"The ensemble is an universal machine learning method that is based on the divide-and-conquer principle. The ensemble aims to improve performance of system in terms of processing speed and quality. The assessment of cluster tendency is a method determining whether a considering data-set contains meaningful clusters. Recently, a silhouette-based assessment of cluster tendency method (SACT) has been proposed to simultaneously determine the appropriate number of data clusters and the prototypes. The advantages of SACT are accuracy and less the parameter, while there are limitations in data size and processing speed. In this paper, we proposed an improved SACT method for data clustering. We call eSACT algorithm. Experiments were conducted on synthetic data-sets and color image images. The proposed algorithm exhibited high performance, reliability and accuracy compared to previous proposed algorithms in the assessment of cluster tendency.","PeriodicalId":448008,"journal":{"name":"Proceedings of the 9th International Symposium on Information and Communication Technology","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A New Assessment of Cluster Tendency Ensemble approach for Data Clustering\",\"authors\":\"Van Nha Pham, L. Ngo, L. T. Pham, Pham Van Hai\",\"doi\":\"10.1145/3287921.3287927\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The ensemble is an universal machine learning method that is based on the divide-and-conquer principle. The ensemble aims to improve performance of system in terms of processing speed and quality. The assessment of cluster tendency is a method determining whether a considering data-set contains meaningful clusters. Recently, a silhouette-based assessment of cluster tendency method (SACT) has been proposed to simultaneously determine the appropriate number of data clusters and the prototypes. The advantages of SACT are accuracy and less the parameter, while there are limitations in data size and processing speed. In this paper, we proposed an improved SACT method for data clustering. We call eSACT algorithm. Experiments were conducted on synthetic data-sets and color image images. The proposed algorithm exhibited high performance, reliability and accuracy compared to previous proposed algorithms in the assessment of cluster tendency.\",\"PeriodicalId\":448008,\"journal\":{\"name\":\"Proceedings of the 9th International Symposium on Information and Communication Technology\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th International Symposium on Information and Communication Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3287921.3287927\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Symposium on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3287921.3287927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A New Assessment of Cluster Tendency Ensemble approach for Data Clustering
The ensemble is an universal machine learning method that is based on the divide-and-conquer principle. The ensemble aims to improve performance of system in terms of processing speed and quality. The assessment of cluster tendency is a method determining whether a considering data-set contains meaningful clusters. Recently, a silhouette-based assessment of cluster tendency method (SACT) has been proposed to simultaneously determine the appropriate number of data clusters and the prototypes. The advantages of SACT are accuracy and less the parameter, while there are limitations in data size and processing speed. In this paper, we proposed an improved SACT method for data clustering. We call eSACT algorithm. Experiments were conducted on synthetic data-sets and color image images. The proposed algorithm exhibited high performance, reliability and accuracy compared to previous proposed algorithms in the assessment of cluster tendency.