A new clustering algorithm of large datasets with O(N) computational complexity

5th International Conference on Intelligent Systems Design and Applications (ISDA'05) Pub Date : 2005-09-08 DOI:10.1109/ISDA.2005.12

Nuannuan Zong, Feng Gui, M. Adjouadi

{"title":"A new clustering algorithm of large datasets with O(N) computational complexity","authors":"Nuannuan Zong, Feng Gui, M. Adjouadi","doi":"10.1109/ISDA.2005.12","DOIUrl":null,"url":null,"abstract":"In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.","PeriodicalId":345842,"journal":{"name":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2005.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.

查看原文本刊更多论文

一种计算复杂度为0 (N)的大型数据集聚类算法

在生物信息学、细胞术、地理信息系统等领域，仅举几例，大量的数据，通常是多维的，比以往任何时候都更加突出了对新算法的需求，以减少数据分析和解释所需的计算需求。在本研究中，我们提出了一种新的无监督聚类算法/基于子/密度的自适应窗口聚类算法，该算法将计算负荷降低到/spl sim/ O(N)次，使其比现有的分层算法更具吸引力和更快。该方法依赖于将数据集加权到网格上的网格点，并通过减少低密度点、排序和关联计算来识别密度峰值。使用的自适应窗口是对最近提出的k窗口聚类算法的修改，以形成所需的聚类。新算法使用户更容易观察和分析数据，以增强解释和改进现实世界的应用，特别是在临床实践中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

5th International Conference on Intelligent Systems Design and Applications (ISDA'05)

自引率

0.00%

发文量