Dataset transformation using hybrid method of polar-based cartesian and image filtering technique for annual rainfall clustering

B. Suprapty, Anggri Sartika Wiguna, Agusma Wajiansyah, R. Malani
{"title":"Dataset transformation using hybrid method of polar-based cartesian and image filtering technique for annual rainfall clustering","authors":"B. Suprapty, Anggri Sartika Wiguna, Agusma Wajiansyah, R. Malani","doi":"10.1109/ISITIA52817.2021.9502202","DOIUrl":null,"url":null,"abstract":"Clustering is categorized as unsupervised learning because there is no class label information available in grouping a dataset. For this reason, an assessment of the quality of the clustering results is critical. In general, two essential clustering parameters are the similarities between cluster members in a cluster and the cluster centers’ separation. Various approaches can be used to improve a clustering algorithm’s performance: raw data pre-processing, cluster center initialization techniques, objective function assignment, modification of specific steps, and others. The subject of this study is the pattern of rainfall every month during the year of the observation period obtained from the clustering process. This study aims to improve the performance of K-Mean Clustering through manipulation of raw data pre-processing into certain datasets. Image filtering technique is used to generate a dataset based on the relationship between neighboring rainfall values. Polar-based Cartesian data space transformation is used to generate a dataset based on a range of rainfall values for each month during the year of the observation period. Four scenarios have been used to test the performance of the proposed method. The study results show that the proposed method produces the highest performance ratio (54.79%) of all scenarios’ total average GOS (Global Optimum Solution). Meanwhile, increasing GOS to the original method also resulted in the highest increase in GOS ratio (63.68%) compared to other methods. Further studies will focus on the application of the proposed methods for improving the performance of SOM and Fuzzy C-Mean Clustering.","PeriodicalId":161240,"journal":{"name":"2021 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA52817.2021.9502202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Clustering is categorized as unsupervised learning because there is no class label information available in grouping a dataset. For this reason, an assessment of the quality of the clustering results is critical. In general, two essential clustering parameters are the similarities between cluster members in a cluster and the cluster centers’ separation. Various approaches can be used to improve a clustering algorithm’s performance: raw data pre-processing, cluster center initialization techniques, objective function assignment, modification of specific steps, and others. The subject of this study is the pattern of rainfall every month during the year of the observation period obtained from the clustering process. This study aims to improve the performance of K-Mean Clustering through manipulation of raw data pre-processing into certain datasets. Image filtering technique is used to generate a dataset based on the relationship between neighboring rainfall values. Polar-based Cartesian data space transformation is used to generate a dataset based on a range of rainfall values for each month during the year of the observation period. Four scenarios have been used to test the performance of the proposed method. The study results show that the proposed method produces the highest performance ratio (54.79%) of all scenarios’ total average GOS (Global Optimum Solution). Meanwhile, increasing GOS to the original method also resulted in the highest increase in GOS ratio (63.68%) compared to other methods. Further studies will focus on the application of the proposed methods for improving the performance of SOM and Fuzzy C-Mean Clustering.
基于极坐标和图像滤波技术的数据集变换用于年降水聚类
聚类被归类为无监督学习,因为在分组数据集时没有可用的类标签信息。因此,对聚类结果的质量进行评估是至关重要的。一般来说,两个重要的聚类参数是聚类中聚类成员之间的相似性和聚类中心的间距。可以使用各种方法来提高聚类算法的性能:原始数据预处理、聚类中心初始化技术、目标函数分配、修改特定步骤等等。本研究的对象是通过聚类过程获得的观测期内的年逐月降水格局。本研究旨在通过将原始数据预处理成特定的数据集来提高k -均值聚类的性能。采用图像滤波技术,根据相邻降雨量之间的关系生成数据集。利用基于极坐标的笛卡尔数据空间变换,根据观测期全年每个月的降雨值范围生成数据集。四个场景已经被用来测试所提出的方法的性能。研究结果表明,该方法在所有场景的总平均GOS(全局最优解)中产生最高的性能比(54.79%)。同时,与其他方法相比,在原方法基础上增加GOS也使GOS比值增加最多(63.68%)。进一步的研究将集中在应用所提出的方法来提高SOM和模糊c均值聚类的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信