Mia Nuranti Putri Pamulang, Mia Nuur Aini, Ultach Enri3
{"title":"Komparasi Distance Measure Pada K-Medoids Clustering untuk Pengelompokkan Penyakit ISPA","authors":"Mia Nuranti Putri Pamulang, Mia Nuur Aini, Ultach Enri3","doi":"10.29408/edumatic.v5i1.3359","DOIUrl":null,"url":null,"abstract":"K-Medoids is an unsupervised algorithm that uses a distance measure to classify data. The distance measure is a method that can help an algorithm classify data based on the similarity of the variables. Several studies have shown that using the right distance measure can improve the performance of the algorithm in clustering. Euclidean and Chebyshev is two of some distance measures that can be used. In 2016, Karawang Health Office stated that 175.891 Karawang citizens were suffering from ISPA. This figure continued to increase in the following year until 2019. The total of Karawang citizens who suffering from ISPA reached 181.945 people. To assist the government in overcoming this problem, a clustering process will be carried out to group the areas where the ISPA is spreading in Karawang District. The area will be divided into three clusters, namely low, medium and high. Comparison of distance measures is carried out to find the best model based on the evaluation of the Davies Bouldin Index (DBI). The use of Euclidean-distance produces a DBI score of 0,088 meanwhile the use of Chebyshev distance resulted in a DBI score of 0,116. The performance of the K-Medoids algorithm with Euclidean-distance is considered to be better than Chebyshev distance because it produces a DBI score that is near to 0.","PeriodicalId":314771,"journal":{"name":"Edumatic: Jurnal Pendidikan Informatika","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edumatic: Jurnal Pendidikan Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29408/edumatic.v5i1.3359","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
k - mediids是一种无监督算法,它使用距离度量来对数据进行分类。距离度量是一种可以帮助算法根据变量的相似性对数据进行分类的方法。一些研究表明,使用合适的距离度量可以提高算法在聚类中的性能。欧几里得和切比雪夫是两种可用的距离度量。2016年,卡拉旺卫生办公室表示,175.891名卡拉旺市民患有ISPA。这一数字在第二年继续增加,直到2019年。患有ISPA的卡拉旺市民总数达到181,945人。为了帮助政府克服这一问题,将进行一个集群过程,将ISPA在卡拉旺地区蔓延的地区分组。该区域将分为三个集群,即低、中、高。在评价Davies Bouldin指数(DBI)的基础上,对距离测度进行比较,寻找最佳模型。使用欧几里得距离的DBI得分为0.088,而使用切比雪夫距离的DBI得分为0.116。具有欧几里得距离的K-Medoids算法的性能被认为比Chebyshev距离更好,因为它产生的DBI分数接近于0。
Komparasi Distance Measure Pada K-Medoids Clustering untuk Pengelompokkan Penyakit ISPA
K-Medoids is an unsupervised algorithm that uses a distance measure to classify data. The distance measure is a method that can help an algorithm classify data based on the similarity of the variables. Several studies have shown that using the right distance measure can improve the performance of the algorithm in clustering. Euclidean and Chebyshev is two of some distance measures that can be used. In 2016, Karawang Health Office stated that 175.891 Karawang citizens were suffering from ISPA. This figure continued to increase in the following year until 2019. The total of Karawang citizens who suffering from ISPA reached 181.945 people. To assist the government in overcoming this problem, a clustering process will be carried out to group the areas where the ISPA is spreading in Karawang District. The area will be divided into three clusters, namely low, medium and high. Comparison of distance measures is carried out to find the best model based on the evaluation of the Davies Bouldin Index (DBI). The use of Euclidean-distance produces a DBI score of 0,088 meanwhile the use of Chebyshev distance resulted in a DBI score of 0,116. The performance of the K-Medoids algorithm with Euclidean-distance is considered to be better than Chebyshev distance because it produces a DBI score that is near to 0.