数据挖掘中的聚类技术：方法、挑战和应用概览

Computer Science Pub Date : 2024-03-05 DOI:10.53070/bbd.1421527

Tasnim Alasali, Yasin Ortakci

{"title":"数据挖掘中的聚类技术：方法、挑战和应用概览","authors":"Tasnim Alasali, Yasin Ortakci","doi":"10.53070/bbd.1421527","DOIUrl":null,"url":null,"abstract":"Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.","PeriodicalId":503380,"journal":{"name":"Computer Science","volume":"13 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Veri Madenciliğinde Kümeleme Teknikleri: Yöntemler, Zorluklar ve Uygulamalar Üzerine Bir Araştırma\",\"authors\":\"Tasnim Alasali, Yasin Ortakci\",\"doi\":\"10.53070/bbd.1421527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.\",\"PeriodicalId\":503380,\"journal\":{\"name\":\"Computer Science\",\"volume\":\"13 6\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.53070/bbd.1421527\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53070/bbd.1421527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

聚类是数据挖掘研究和实际应用中的一项重要技术。传统上，它是一种关键的分析技术，有助于组织无标记数据，以提取有意义的见解。聚类挑战固有的复杂性促使人们开发了各种聚类算法。这些算法中的每一种都是针对特定数据聚类场景量身定制的。在此背景下，本文对数据挖掘中的聚类技术进行了深入分析，包括它们在不同领域中面临的挑战和应用。本文还广泛探讨了不同聚类方法的优势和局限性，包括基于距离、层次、网格和密度的算法。此外，它还解释了聚类算法的大量实例及其在各个领域的经验结果，包括但不限于医疗保健、图像处理、文本和文档聚类以及大数据分析领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Veri Madenciliğinde Kümeleme Teknikleri: Yöntemler, Zorluklar ve Uygulamalar Üzerine Bir Araştırma

Clustering is a crucial technique in both research and practical applications of data mining. It has traditionally functioned as a pivotal analytical technique, facilitating the organization of unlabeled data to extract meaningful insights. The inherent complexity of clustering challenges has led to the development of a variety of clustering algorithms. Each of these algorithms is tailored to address specific data clustering scenarios. In this context, this paper provides a thorough analysis of clustering techniques in data mining, including their challenges and applications in various domains. It also undertakes an extensive exploration of the strengths and limitations characterizing distinct clustering methodologies, encompassing distance-based, hierarchical, grid-based, and density-based algorithms. Additionally, it explains numerous examples of clustering algorithms and their empirical results in various domains, including but not limited to healthcare, image processing, text and document clustering, and the field of big data analytics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Science

自引率

0.00%

发文量