{"title":"A new clustering algorithm of large datasets with O(N) computational complexity","authors":"Nuannuan Zong, Feng Gui, M. Adjouadi","doi":"10.1109/ISDA.2005.12","DOIUrl":null,"url":null,"abstract":"In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.","PeriodicalId":345842,"journal":{"name":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Intelligent Systems Design and Applications (ISDA'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2005.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In fields such as bioinformatics, cytometry, geographic information systems, just to name a few, huge amount of data, often multidimensional in nature, has more than ever highlighted the need for new algorithms to reduce the computational requirements needed for data analysis and interpretation. In this study, we present a new unsupervised clustering algorithm /sub e/nsity-based adaptive window clustering algorithm, which reduces the computational load to /spl sim/ O(N) number of computations, making it more attractive and faster than current hierarchical algorithms. This method relies on weighting a dataset to grid points on a mesh, and identifies the density peaks by reducing low density points, ranking and correlation calculation. The adaptive windows used are a modification of the recently proposed k-windows clustering algorithm to shape the desired clusters. The new algorithm makes it easier for users to observe and analyze data for enhanced interpretation and improved real-world applications, especially in clinical practices.