A new clustering algorithm of large datasets with O(N) computational complexity

Nuannuan Zong, Feng Gui, M. Adjouadi
{"title":"A new clustering algorithm of large datasets with O(N) computational complexity","authors":"Nuannuan Zong, Feng Gui, M. Adjouadi","doi":"10.1109/ISDA.2005.12","DOIUrl":null,"url":null,"abstract":"In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.","PeriodicalId":345842,"journal":{"name":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2005.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.
一种计算复杂度为0 (N)的大型数据集聚类算法
在生物信息学、细胞术、地理信息系统等领域,仅举几例,大量的数据,通常是多维的,比以往任何时候都更加突出了对新算法的需求,以减少数据分析和解释所需的计算需求。在本研究中,我们提出了一种新的无监督聚类算法/基于子/密度的自适应窗口聚类算法,该算法将计算负荷降低到/spl sim/ O(N)次,使其比现有的分层算法更具吸引力和更快。该方法依赖于将数据集加权到网格上的网格点,并通过减少低密度点、排序和关联计算来识别密度峰值。使用的自适应窗口是对最近提出的k窗口聚类算法的修改,以形成所需的聚类。新算法使用户更容易观察和分析数据,以增强解释和改进现实世界的应用,特别是在临床实践中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信