{"title":"DBSCAN算法确定Epsilon参数的分析","authors":"Herwin Simanjutak, Sawaluddin, M. Zarlis","doi":"10.5220/0008552002200222","DOIUrl":null,"url":null,"abstract":"DBSCAN algorithm (Density-Based Spatial Clustering of Applications with Noise) is one of the numerical based clustering algorithms, numerical data is used as the test for this algorithm. The DBSCAN algorithm has the disadvantage of being difficult to determine the appropriate Epsilon value in order to obtain good clustering results. In the DBSCAN algorithm, the value of epsilon is calculated based on a lot of data from the entire data that is captured. In this study a modification of the DBSCAN (Density -Based Spatial Clustering of Applications with Noise) algorithm was carried out by determining the value of epsilon, the results obtained in the study of Euclidean Distance obtained better than the results obtained from the DBSCAN.","PeriodicalId":414686,"journal":{"name":"Proceedings of the International Conference on Natural Resources and Technology","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis of DBSCAN Algorithm in Determining Epsilon Parameters Numerical Data Clustering\",\"authors\":\"Herwin Simanjutak, Sawaluddin, M. Zarlis\",\"doi\":\"10.5220/0008552002200222\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"DBSCAN algorithm (Density-Based Spatial Clustering of Applications with Noise) is one of the numerical based clustering algorithms, numerical data is used as the test for this algorithm. The DBSCAN algorithm has the disadvantage of being difficult to determine the appropriate Epsilon value in order to obtain good clustering results. In the DBSCAN algorithm, the value of epsilon is calculated based on a lot of data from the entire data that is captured. In this study a modification of the DBSCAN (Density -Based Spatial Clustering of Applications with Noise) algorithm was carried out by determining the value of epsilon, the results obtained in the study of Euclidean Distance obtained better than the results obtained from the DBSCAN.\",\"PeriodicalId\":414686,\"journal\":{\"name\":\"Proceedings of the International Conference on Natural Resources and Technology\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference on Natural Resources and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0008552002200222\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Natural Resources and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0008552002200222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
DBSCAN算法(Density-Based Spatial Clustering of Applications with Noise)是一种基于数值的聚类算法,用数值数据作为该算法的测试。DBSCAN算法的缺点是难以确定合适的Epsilon值以获得良好的聚类结果。在DBSCAN算法中,epsilon的值是根据捕获的整个数据中的大量数据计算的。本文通过确定epsilon的值,对DBSCAN (Density -Based Spatial Clustering of Applications with Noise)算法进行了改进,得到了比DBSCAN更好的欧氏距离研究结果。
Analysis of DBSCAN Algorithm in Determining Epsilon Parameters Numerical Data Clustering
DBSCAN algorithm (Density-Based Spatial Clustering of Applications with Noise) is one of the numerical based clustering algorithms, numerical data is used as the test for this algorithm. The DBSCAN algorithm has the disadvantage of being difficult to determine the appropriate Epsilon value in order to obtain good clustering results. In the DBSCAN algorithm, the value of epsilon is calculated based on a lot of data from the entire data that is captured. In this study a modification of the DBSCAN (Density -Based Spatial Clustering of Applications with Noise) algorithm was carried out by determining the value of epsilon, the results obtained in the study of Euclidean Distance obtained better than the results obtained from the DBSCAN.