{"title":"An Improved DBSCAN Clustering Algorithm for Multi-density Datasets","authors":"Tang Cheng","doi":"10.1145/3144789.3144808","DOIUrl":null,"url":null,"abstract":"In this paper, we proposed a DBSCAN-based clustering algorithm called NNDD-DBSCAN with the main focus of handling multi-density datasets and reducing parameter sensitivity. The NNDD-DBSCAN used a new distance measuring method called nearest neighbor density distance (NNDD) which makes the new algorithm can clustering properly in multi-density datasets. By analyzing the relationship between the threshold of nearest neighbor density distance and the threshold of nearest neighborcollection, we give a heuristic method to find the appropriate nearest neighbor density distance threshold and reducing parameter sensitivity. Experimental results show that the NNDD-DBSCAN has a good robustadaptation and can get the ideal clustering result both in single density datasets and multi-density datasets.","PeriodicalId":254163,"journal":{"name":"Proceedings of the 2nd International Conference on Intelligent Information Processing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Intelligent Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3144789.3144808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In this paper, we proposed a DBSCAN-based clustering algorithm called NNDD-DBSCAN with the main focus of handling multi-density datasets and reducing parameter sensitivity. The NNDD-DBSCAN used a new distance measuring method called nearest neighbor density distance (NNDD) which makes the new algorithm can clustering properly in multi-density datasets. By analyzing the relationship between the threshold of nearest neighbor density distance and the threshold of nearest neighborcollection, we give a heuristic method to find the appropriate nearest neighbor density distance threshold and reducing parameter sensitivity. Experimental results show that the NNDD-DBSCAN has a good robustadaptation and can get the ideal clustering result both in single density datasets and multi-density datasets.