{"title":"CABGD: An Improved Clustering Algorithm Based on Grid-Density","authors":"Lili Meng, Jiadong Ren, Changzhen Hu","doi":"10.1109/ICICIC.2009.131","DOIUrl":null,"url":null,"abstract":"In data mining fields, clustering is an important issue. Compared with other algorithms, grid-based algorithms generally have a fast processing time. However, since the size of a cell is determined by users, the large size cell may contain data points of different clusters and leads to low clustering quality. In this paper, we propose an improved clustering algorithm based on grid-density (CABGD). The concept of center intensity of grid cell is presented and is applied to identify the distribution of data points in a grid and to decide whether or not to split the grid. Then all density-connected grids are assigned to a cluster. Experimental results on synthetic datasets show that the algorithm has higher clustering accuracy and lower sensitivity to parameters.","PeriodicalId":240226,"journal":{"name":"2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC)","volume":"5 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIC.2009.131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In data mining fields, clustering is an important issue. Compared with other algorithms, grid-based algorithms generally have a fast processing time. However, since the size of a cell is determined by users, the large size cell may contain data points of different clusters and leads to low clustering quality. In this paper, we propose an improved clustering algorithm based on grid-density (CABGD). The concept of center intensity of grid cell is presented and is applied to identify the distribution of data points in a grid and to decide whether or not to split the grid. Then all density-connected grids are assigned to a cluster. Experimental results on synthetic datasets show that the algorithm has higher clustering accuracy and lower sensitivity to parameters.