{"title":"博客挖掘的最优k均值聚类算法","authors":"Vipin Jain, K. Kashyap","doi":"10.1109/CSNT51715.2021.9509644","DOIUrl":null,"url":null,"abstract":"World wide web (WWW) generates a huge number of unstructured data and information. The information is stored in weblog file. Weblogs information can be analyzed and visualized by various clustering algorithm. In this work, the k-means clustering algorithm is applied for grouping of the users with similar interest based on accessing of similer information. The optimal value of k is also determined by Elbow method to obtained optimal numbers of clusters. Clustering results are analyzed by various values of k. Comparative analysis ofvarious methods used for selecting the optimal number of clusters are also analyzed.","PeriodicalId":122176,"journal":{"name":"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal K-Means Clustering Algorithm for Weblog Mining\",\"authors\":\"Vipin Jain, K. Kashyap\",\"doi\":\"10.1109/CSNT51715.2021.9509644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"World wide web (WWW) generates a huge number of unstructured data and information. The information is stored in weblog file. Weblogs information can be analyzed and visualized by various clustering algorithm. In this work, the k-means clustering algorithm is applied for grouping of the users with similar interest based on accessing of similer information. The optimal value of k is also determined by Elbow method to obtained optimal numbers of clusters. Clustering results are analyzed by various values of k. Comparative analysis ofvarious methods used for selecting the optimal number of clusters are also analyzed.\",\"PeriodicalId\":122176,\"journal\":{\"name\":\"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSNT51715.2021.9509644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSNT51715.2021.9509644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimal K-Means Clustering Algorithm for Weblog Mining
World wide web (WWW) generates a huge number of unstructured data and information. The information is stored in weblog file. Weblogs information can be analyzed and visualized by various clustering algorithm. In this work, the k-means clustering algorithm is applied for grouping of the users with similar interest based on accessing of similer information. The optimal value of k is also determined by Elbow method to obtained optimal numbers of clusters. Clustering results are analyzed by various values of k. Comparative analysis ofvarious methods used for selecting the optimal number of clusters are also analyzed.