{"title":"Optimal K-Means Clustering Algorithm for Weblog Mining","authors":"Vipin Jain, K. Kashyap","doi":"10.1109/CSNT51715.2021.9509644","DOIUrl":null,"url":null,"abstract":"World wide web (WWW) generates a huge number of unstructured data and information. The information is stored in weblog file. Weblogs information can be analyzed and visualized by various clustering algorithm. In this work, the k-means clustering algorithm is applied for grouping of the users with similar interest based on accessing of similer information. The optimal value of k is also determined by Elbow method to obtained optimal numbers of clusters. Clustering results are analyzed by various values of k. Comparative analysis ofvarious methods used for selecting the optimal number of clusters are also analyzed.","PeriodicalId":122176,"journal":{"name":"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSNT51715.2021.9509644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
World wide web (WWW) generates a huge number of unstructured data and information. The information is stored in weblog file. Weblogs information can be analyzed and visualized by various clustering algorithm. In this work, the k-means clustering algorithm is applied for grouping of the users with similar interest based on accessing of similer information. The optimal value of k is also determined by Elbow method to obtained optimal numbers of clusters. Clustering results are analyzed by various values of k. Comparative analysis ofvarious methods used for selecting the optimal number of clusters are also analyzed.