Oussama Chabih, Sara Sbai, Hicham Behja, Mohammed Reda Chbihi Louhdi, E. Zemmouri, B. Trousse
{"title":"确定无监督分类中最优簇数K的新方法","authors":"Oussama Chabih, Sara Sbai, Hicham Behja, Mohammed Reda Chbihi Louhdi, E. Zemmouri, B. Trousse","doi":"10.1109/CiSt49399.2021.9357249","DOIUrl":null,"url":null,"abstract":"One of the most common problems in the field of information processing is the definition of the optimal number of clusters when using unsupervised classification. The wrong choice of the number of clusters leads to insignificant and inconsistent results. In this paper, we present a simple automatic technique for estimating the optimal number of clusters when using a classification algorithm based on k-means. The proposed algorithm is based on detecting the immutable clusters during the different experiments and takes them as a reference in order to specify the best classification.","PeriodicalId":253233,"journal":{"name":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"New approach to determine the optimal number of clusters K in unsupervised classification\",\"authors\":\"Oussama Chabih, Sara Sbai, Hicham Behja, Mohammed Reda Chbihi Louhdi, E. Zemmouri, B. Trousse\",\"doi\":\"10.1109/CiSt49399.2021.9357249\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the most common problems in the field of information processing is the definition of the optimal number of clusters when using unsupervised classification. The wrong choice of the number of clusters leads to insignificant and inconsistent results. In this paper, we present a simple automatic technique for estimating the optimal number of clusters when using a classification algorithm based on k-means. The proposed algorithm is based on detecting the immutable clusters during the different experiments and takes them as a reference in order to specify the best classification.\",\"PeriodicalId\":253233,\"journal\":{\"name\":\"2020 6th IEEE Congress on Information Science and Technology (CiSt)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 6th IEEE Congress on Information Science and Technology (CiSt)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CiSt49399.2021.9357249\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th IEEE Congress on Information Science and Technology (CiSt)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CiSt49399.2021.9357249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
New approach to determine the optimal number of clusters K in unsupervised classification
One of the most common problems in the field of information processing is the definition of the optimal number of clusters when using unsupervised classification. The wrong choice of the number of clusters leads to insignificant and inconsistent results. In this paper, we present a simple automatic technique for estimating the optimal number of clusters when using a classification algorithm based on k-means. The proposed algorithm is based on detecting the immutable clusters during the different experiments and takes them as a reference in order to specify the best classification.