数据挖掘中的聚类技术:方法、挑战和应用概览

Tasnim Alasali, Yasin Ortakci
{"title":"数据挖掘中的聚类技术:方法、挑战和应用概览","authors":"Tasnim Alasali, Yasin Ortakci","doi":"10.53070/bbd.1421527","DOIUrl":null,"url":null,"abstract":"Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.","PeriodicalId":503380,"journal":{"name":"Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Veri Madenciliğinde Kümeleme Teknikleri: Yöntemler, Zorluklar ve Uygulamalar Üzerine Bir Araştırma\",\"authors\":\"Tasnim Alasali, Yasin Ortakci\",\"doi\":\"10.53070/bbd.1421527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.\",\"PeriodicalId\":503380,\"journal\":{\"name\":\"Computer Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.53070/bbd.1421527\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53070/bbd.1421527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

聚类是数据挖掘研究和实际应用中的一项重要技术。传统上,它是一种关键的分析技术,有助于组织无标记数据,以提取有意义的见解。聚类挑战固有的复杂性促使人们开发了各种聚类算法。这些算法中的每一种都是针对特定数据聚类场景量身定制的。在此背景下,本文对数据挖掘中的聚类技术进行了深入分析,包括它们在不同领域中面临的挑战和应用。本文还广泛探讨了不同聚类方法的优势和局限性,包括基于距离、层次、网格和密度的算法。此外,它还解释了聚类算法的大量实例及其在各个领域的经验结果,包括但不限于医疗保健、图像处理、文本和文档聚类以及大数据分析领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Veri Madenciliğinde Kümeleme Teknikleri: Yöntemler, Zorluklar ve Uygulamalar Üzerine Bir Araştırma
Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信