Research on Similarity Search Technique of Variable Long Time Series Data Mining

2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA) Pub Date : 2021-09-01 DOI:10.1109/ICDSBA53075.2021.00014

Mengru Zhang

{"title":"Research on Similarity Search Technique of Variable Long Time Series Data Mining","authors":"Mengru Zhang","doi":"10.1109/ICDSBA53075.2021.00014","DOIUrl":null,"url":null,"abstract":"Time series similarity search is the main subroutine of time series data mining algorithm. The efficiency of time series similarity research has become an obstacle to the development of time series mining algorithms. The representation of time series and the measurement of similarity are the basis of time series similarity research and play a crucial role in completing the similarity search task of time series. As a method of measuring similarity, the dynamic distortion of time can be effectively dealt with by deforming the time series over time, and it has good stability. However, time series data is usually an ever-increasing data stream, and the direct study of similarity will cause considerable storage space consumption, and may affect the accuracy and reliability of the algorithm. Therefore, it is necessary to determine the time series in advance, express the main self of the original time series in a concise and abstract form, and carry out similarity search on the developed sequence to improve the similarity search efficiency of the sequence.","PeriodicalId":154348,"journal":{"name":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSBA53075.2021.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Time series similarity search is the main subroutine of time series data mining algorithm. The efficiency of time series similarity research has become an obstacle to the development of time series mining algorithms. The representation of time series and the measurement of similarity are the basis of time series similarity research and play a crucial role in completing the similarity search task of time series. As a method of measuring similarity, the dynamic distortion of time can be effectively dealt with by deforming the time series over time, and it has good stability. However, time series data is usually an ever-increasing data stream, and the direct study of similarity will cause considerable storage space consumption, and may affect the accuracy and reliability of the algorithm. Therefore, it is necessary to determine the time series in advance, express the main self of the original time series in a concise and abstract form, and carry out similarity search on the developed sequence to improve the similarity search efficiency of the sequence.

查看原文本刊更多论文

变量长时间序列数据挖掘的相似度搜索技术研究

时间序列相似度搜索是时间序列数据挖掘算法的主要子程序。时间序列相似性研究的效率问题已经成为时间序列挖掘算法发展的障碍。时间序列的表示和相似度的度量是时间序列相似度研究的基础，对完成时间序列的相似度搜索任务起着至关重要的作用。作为一种度量相似度的方法，通过对时间序列进行随时间的变形，可以有效地处理时间的动态畸变，并且具有良好的稳定性。然而，时间序列数据通常是一个不断增长的数据流，直接研究相似度会消耗相当大的存储空间，并可能影响算法的准确性和可靠性。因此，有必要提前确定时间序列，以简洁抽象的形式表达原始时间序列的主体自我，并对开发的序列进行相似性搜索，以提高序列的相似性搜索效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)

自引率

0.00%

发文量