基于加权k均值和萤火虫优化算法的有效数据聚类系统

Keerthi Shetty, CV Aravinda
{"title":"基于加权k均值和萤火虫优化算法的有效数据聚类系统","authors":"Keerthi Shetty, CV Aravinda","doi":"10.1109/DISCOVER52564.2021.9663710","DOIUrl":null,"url":null,"abstract":"Clustering of data is a standard way used for analyzing the data in several applications such as, data mining, image analysis, pattern recognition, etc. The weighted K-means clustering is one amongst the various data mining techniques used for clustering of the data. The key advantages of weighted k-means clustering are efficient in managing huge amount of data, easy to implement, scalable, simple and easily modifiable. In contrast, the major disadvantage of weighted K-means clustering is the problem with choosing the initial centroids. This clustering technique chooses the initial centroids randomly that leads to a local optimum solution. To address this concern, an effective naturally-inspired optimization algorithm: fire-fly optimization is combined with weighted k-means clustering for obtaining the global optimum solution. In this research paper, weighted k-means clustering along with fire-fly optimization algorithm was developed for enhancing the performance of information sharing and searching efficiency among the population. Here, the proposed system was experimented on dissimilar medical datasets such as, heart disease (original), heart disease (stat-log), liver disease and Indian liver patients. In the practical study, the proposed method enhances the performance up to 0.02-0.4 (label value) as compared to the existing systems by using the concept of precision, recall, and FB-cubed.","PeriodicalId":413789,"journal":{"name":"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Effective Data Clustering System using Weighted K-Means and Firefly Optimization Algorithms\",\"authors\":\"Keerthi Shetty, CV Aravinda\",\"doi\":\"10.1109/DISCOVER52564.2021.9663710\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering of data is a standard way used for analyzing the data in several applications such as, data mining, image analysis, pattern recognition, etc. The weighted K-means clustering is one amongst the various data mining techniques used for clustering of the data. The key advantages of weighted k-means clustering are efficient in managing huge amount of data, easy to implement, scalable, simple and easily modifiable. In contrast, the major disadvantage of weighted K-means clustering is the problem with choosing the initial centroids. This clustering technique chooses the initial centroids randomly that leads to a local optimum solution. To address this concern, an effective naturally-inspired optimization algorithm: fire-fly optimization is combined with weighted k-means clustering for obtaining the global optimum solution. In this research paper, weighted k-means clustering along with fire-fly optimization algorithm was developed for enhancing the performance of information sharing and searching efficiency among the population. Here, the proposed system was experimented on dissimilar medical datasets such as, heart disease (original), heart disease (stat-log), liver disease and Indian liver patients. In the practical study, the proposed method enhances the performance up to 0.02-0.4 (label value) as compared to the existing systems by using the concept of precision, recall, and FB-cubed.\",\"PeriodicalId\":413789,\"journal\":{\"name\":\"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DISCOVER52564.2021.9663710\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISCOVER52564.2021.9663710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

数据聚类是数据分析的一种标准方法,在数据挖掘、图像分析、模式识别等应用中得到广泛应用。加权k均值聚类是用于数据聚类的各种数据挖掘技术中的一种。加权k-均值聚类的主要优点是管理大量数据的效率高、易于实现、可扩展、简单且易于修改。相比之下,加权k均值聚类的主要缺点是初始质心的选择问题。该聚类技术随机选择初始质心,从而得到一个局部最优解。为了解决这个问题,一种有效的自然启发优化算法:萤火虫优化与加权k-means聚类相结合,以获得全局最优解。为了提高种群间的信息共享性能和搜索效率,本文提出了加权k-means聚类和萤火虫优化算法。在这里,所提出的系统在不同的医疗数据集上进行了实验,如心脏病(原始)、心脏病(统计日志)、肝病和印度肝病患者。在实际研究中,与现有系统相比,该方法通过使用精度、召回率和fb立方的概念,将性能提高了0.02-0.4(标签值)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Effective Data Clustering System using Weighted K-Means and Firefly Optimization Algorithms
Clustering of data is a standard way used for analyzing the data in several applications such as, data mining, image analysis, pattern recognition, etc. The weighted K-means clustering is one amongst the various data mining techniques used for clustering of the data. The key advantages of weighted k-means clustering are efficient in managing huge amount of data, easy to implement, scalable, simple and easily modifiable. In contrast, the major disadvantage of weighted K-means clustering is the problem with choosing the initial centroids. This clustering technique chooses the initial centroids randomly that leads to a local optimum solution. To address this concern, an effective naturally-inspired optimization algorithm: fire-fly optimization is combined with weighted k-means clustering for obtaining the global optimum solution. In this research paper, weighted k-means clustering along with fire-fly optimization algorithm was developed for enhancing the performance of information sharing and searching efficiency among the population. Here, the proposed system was experimented on dissimilar medical datasets such as, heart disease (original), heart disease (stat-log), liver disease and Indian liver patients. In the practical study, the proposed method enhances the performance up to 0.02-0.4 (label value) as compared to the existing systems by using the concept of precision, recall, and FB-cubed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信