{"title":"Research on Similarity Search Technique of Variable Long Time Series Data Mining","authors":"Mengru Zhang","doi":"10.1109/ICDSBA53075.2021.00014","DOIUrl":null,"url":null,"abstract":"Time series similarity search is the main subroutine of time series data mining algorithm. The efficiency of time series similarity research has become an obstacle to the development of time series mining algorithms. The representation of time series and the measurement of similarity are the basis of time series similarity research and play a crucial role in completing the similarity search task of time series. As a method of measuring similarity, the dynamic distortion of time can be effectively dealt with by deforming the time series over time, and it has good stability. However, time series data is usually an ever-increasing data stream, and the direct study of similarity will cause considerable storage space consumption, and may affect the accuracy and reliability of the algorithm. Therefore, it is necessary to determine the time series in advance, express the main self of the original time series in a concise and abstract form, and carry out similarity search on the developed sequence to improve the similarity search efficiency of the sequence.","PeriodicalId":154348,"journal":{"name":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSBA53075.2021.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Time series similarity search is the main subroutine of time series data mining algorithm. The efficiency of time series similarity research has become an obstacle to the development of time series mining algorithms. The representation of time series and the measurement of similarity are the basis of time series similarity research and play a crucial role in completing the similarity search task of time series. As a method of measuring similarity, the dynamic distortion of time can be effectively dealt with by deforming the time series over time, and it has good stability. However, time series data is usually an ever-increasing data stream, and the direct study of similarity will cause considerable storage space consumption, and may affect the accuracy and reliability of the algorithm. Therefore, it is necessary to determine the time series in advance, express the main self of the original time series in a concise and abstract form, and carry out similarity search on the developed sequence to improve the similarity search efficiency of the sequence.