{"title":"An Effective Data Clustering System using Weighted K-Means and Firefly Optimization Algorithms","authors":"Keerthi Shetty, CV Aravinda","doi":"10.1109/DISCOVER52564.2021.9663710","DOIUrl":null,"url":null,"abstract":"Clustering of data is a standard way used for analyzing the data in several applications such as, data mining, image analysis, pattern recognition, etc. The weighted K-means clustering is one amongst the various data mining techniques used for clustering of the data. The key advantages of weighted k-means clustering are efficient in managing huge amount of data, easy to implement, scalable, simple and easily modifiable. In contrast, the major disadvantage of weighted K-means clustering is the problem with choosing the initial centroids. This clustering technique chooses the initial centroids randomly that leads to a local optimum solution. To address this concern, an effective naturally-inspired optimization algorithm: fire-fly optimization is combined with weighted k-means clustering for obtaining the global optimum solution. In this research paper, weighted k-means clustering along with fire-fly optimization algorithm was developed for enhancing the performance of information sharing and searching efficiency among the population. Here, the proposed system was experimented on dissimilar medical datasets such as, heart disease (original), heart disease (stat-log), liver disease and Indian liver patients. In the practical study, the proposed method enhances the performance up to 0.02-0.4 (label value) as compared to the existing systems by using the concept of precision, recall, and FB-cubed.","PeriodicalId":413789,"journal":{"name":"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISCOVER52564.2021.9663710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Clustering of data is a standard way used for analyzing the data in several applications such as, data mining, image analysis, pattern recognition, etc. The weighted K-means clustering is one amongst the various data mining techniques used for clustering of the data. The key advantages of weighted k-means clustering are efficient in managing huge amount of data, easy to implement, scalable, simple and easily modifiable. In contrast, the major disadvantage of weighted K-means clustering is the problem with choosing the initial centroids. This clustering technique chooses the initial centroids randomly that leads to a local optimum solution. To address this concern, an effective naturally-inspired optimization algorithm: fire-fly optimization is combined with weighted k-means clustering for obtaining the global optimum solution. In this research paper, weighted k-means clustering along with fire-fly optimization algorithm was developed for enhancing the performance of information sharing and searching efficiency among the population. Here, the proposed system was experimented on dissimilar medical datasets such as, heart disease (original), heart disease (stat-log), liver disease and Indian liver patients. In the practical study, the proposed method enhances the performance up to 0.02-0.4 (label value) as compared to the existing systems by using the concept of precision, recall, and FB-cubed.