Dataset transformation using hybrid method of polar-based cartesian and image filtering technique for annual rainfall clustering

2021 International Seminar on Intelligent Technology and Its Applications (ISITIA) Pub Date : 2021-07-21 DOI:10.1109/ISITIA52817.2021.9502202

B. Suprapty, Anggri Sartika Wiguna, Agusma Wajiansyah, R. Malani

{"title":"Dataset transformation using hybrid method of polar-based cartesian and image filtering technique for annual rainfall clustering","authors":"B. Suprapty, Anggri Sartika Wiguna, Agusma Wajiansyah, R. Malani","doi":"10.1109/ISITIA52817.2021.9502202","DOIUrl":null,"url":null,"abstract":"Clustering is categorized as unsupervised learning because there is no class label information available in grouping a dataset. For this reason, an assessment of the quality of the clustering results is critical. In general, two essential clustering parameters are the similarities between cluster members in a cluster and the cluster centers’ separation. Various approaches can be used to improve a clustering algorithm’s performance: raw data pre-processing, cluster center initialization techniques, objective function assignment, modification of specific steps, and others. The subject of this study is the pattern of rainfall every month during the year of the observation period obtained from the clustering process. This study aims to improve the performance of K-Mean Clustering through manipulation of raw data pre-processing into certain datasets. Image filtering technique is used to generate a dataset based on the relationship between neighboring rainfall values. Polar-based Cartesian data space transformation is used to generate a dataset based on a range of rainfall values for each month during the year of the observation period. Four scenarios have been used to test the performance of the proposed method. The study results show that the proposed method produces the highest performance ratio (54.79%) of all scenarios’ total average GOS (Global Optimum Solution). Meanwhile, increasing GOS to the original method also resulted in the highest increase in GOS ratio (63.68%) compared to other methods. Further studies will focus on the application of the proposed methods for improving the performance of SOM and Fuzzy C-Mean Clustering.","PeriodicalId":161240,"journal":{"name":"2021 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA52817.2021.9502202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Clustering is categorized as unsupervised learning because there is no class label information available in grouping a dataset. For this reason, an assessment of the quality of the clustering results is critical. In general, two essential clustering parameters are the similarities between cluster members in a cluster and the cluster centers’ separation. Various approaches can be used to improve a clustering algorithm’s performance: raw data pre-processing, cluster center initialization techniques, objective function assignment, modification of specific steps, and others. The subject of this study is the pattern of rainfall every month during the year of the observation period obtained from the clustering process. This study aims to improve the performance of K-Mean Clustering through manipulation of raw data pre-processing into certain datasets. Image filtering technique is used to generate a dataset based on the relationship between neighboring rainfall values. Polar-based Cartesian data space transformation is used to generate a dataset based on a range of rainfall values for each month during the year of the observation period. Four scenarios have been used to test the performance of the proposed method. The study results show that the proposed method produces the highest performance ratio (54.79%) of all scenarios’ total average GOS (Global Optimum Solution). Meanwhile, increasing GOS to the original method also resulted in the highest increase in GOS ratio (63.68%) compared to other methods. Further studies will focus on the application of the proposed methods for improving the performance of SOM and Fuzzy C-Mean Clustering.

查看原文本刊更多论文

基于极坐标和图像滤波技术的数据集变换用于年降水聚类

聚类被归类为无监督学习，因为在分组数据集时没有可用的类标签信息。因此，对聚类结果的质量进行评估是至关重要的。一般来说，两个重要的聚类参数是聚类中聚类成员之间的相似性和聚类中心的间距。可以使用各种方法来提高聚类算法的性能:原始数据预处理、聚类中心初始化技术、目标函数分配、修改特定步骤等等。本研究的对象是通过聚类过程获得的观测期内的年逐月降水格局。本研究旨在通过将原始数据预处理成特定的数据集来提高k -均值聚类的性能。采用图像滤波技术，根据相邻降雨量之间的关系生成数据集。利用基于极坐标的笛卡尔数据空间变换，根据观测期全年每个月的降雨值范围生成数据集。四个场景已经被用来测试所提出的方法的性能。研究结果表明，该方法在所有场景的总平均GOS(全局最优解)中产生最高的性能比(54.79%)。同时，与其他方法相比，在原方法基础上增加GOS也使GOS比值增加最多(63.68%)。进一步的研究将集中在应用所提出的方法来提高SOM和模糊c均值聚类的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Seminar on Intelligent Technology and Its Applications (ISITIA)

自引率

0.00%

发文量