Efficiency of Time Series Clustering Method Based on Distribution of Difference Using Several Distances

2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE) Pub Date : 2022-06-22 DOI:10.1109/jcsse54890.2022.9836279

Phudit Thanakulkairid, Tanupat Trakulthongchai, Naruesorn Prabpon, Pat Vatiwutipong

引用次数: 0

Abstract

Clustering is a machine learning method widely used in time series analysis. In this work, we cluster time series by applying four distance functions: Euclidean distance, Kullback-Leibler divergence, Wasserstein distance, and dynamic time warping. We consider the distribution of the first-order difference of time series and compare time series using such distributions under each of the four distances. Then, we model each time series as a vertex of a graph and the distance between each pair of time series as a weighted edge. Graph partitioning is performed as a clustering method. The advantages and drawbacks of each method are discussed. The experimental results show that Euclidean distance and Kullback-Leibler divergence perform better and more efficient clustering than the other two.

查看原文本刊更多论文

基于多距离差分分布的时间序列聚类方法的有效性

聚类是一种广泛应用于时间序列分析的机器学习方法。在这项工作中，我们通过应用四个距离函数对时间序列进行聚类:欧几里得距离、Kullback-Leibler散度、Wasserstein距离和动态时间翘曲。我们考虑了时间序列的一阶差分分布，并利用这四种距离下的一阶差分分布对时间序列进行了比较。然后，我们将每个时间序列建模为一个图的顶点，将每对时间序列之间的距离建模为一个加权边。图划分作为一种聚类方法来执行。讨论了每种方法的优缺点。实验结果表明，欧几里得距离和Kullback-Leibler散度的聚类效果优于其他两种聚类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE)

自引率

0.00%

发文量