趋势基序检测:时间序列基序发现的有效框架

2021 11th International Conference on Information Technology in Medicine and Education (ITME) Pub Date : 2021-11-01 DOI:10.1109/ITME53901.2021.00035

Xiang Chen, Zongwen Fan, Jin Gou

{"title":"趋势基序检测:时间序列基序发现的有效框架","authors":"Xiang Chen, Zongwen Fan, Jin Gou","doi":"10.1109/ITME53901.2021.00035","DOIUrl":null,"url":null,"abstract":"The task of finding similar patterns in a long time series, commonly called motifs, has received continuous and increasing attention from diverse scientific fields. Although numerous approaches have been proposed for motif discovery, they cannot discover the motifs in an exact and efficient manner. Furthermore, domain knowledge is required from the experts for those methods to predefine the pattern length, which is also quite objective. In addiction, it is very time-consuming to extract the exact motifs and sometimes the extracted motif has no specific meanings. Especially in the field of financial and hydrology, many studies are focused on whether there is a fixed pattern including trend information hidden in the data. To address the above problems, we proposed a framework to automatically discovery the trend motifs without predefining the length of patterns. It has four main steps, (1) singular spectrum analysis is first applied to removed noise; (2) segmentation by extracting extreme points is then employed to automatically obtain the unequal length of time series pattern; (3) symbolic aggregate approximation is introduced to discretize the data and transform them into string sequences; (4) finally, the trend motifs are selected by measuring their similarity. Experimental results on the real-world time-series datasets reveal that our framework fit well in different circumstances, indicating our proposed framework is effective for trend motif discovery.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"26 1","pages":"122-126"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting trend motifs: an efficient framework for time series motif discovery\",\"authors\":\"Xiang Chen, Zongwen Fan, Jin Gou\",\"doi\":\"10.1109/ITME53901.2021.00035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The task of finding similar patterns in a long time series, commonly called motifs, has received continuous and increasing attention from diverse scientific fields. Although numerous approaches have been proposed for motif discovery, they cannot discover the motifs in an exact and efficient manner. Furthermore, domain knowledge is required from the experts for those methods to predefine the pattern length, which is also quite objective. In addiction, it is very time-consuming to extract the exact motifs and sometimes the extracted motif has no specific meanings. Especially in the field of financial and hydrology, many studies are focused on whether there is a fixed pattern including trend information hidden in the data. To address the above problems, we proposed a framework to automatically discovery the trend motifs without predefining the length of patterns. It has four main steps, (1) singular spectrum analysis is first applied to removed noise; (2) segmentation by extracting extreme points is then employed to automatically obtain the unequal length of time series pattern; (3) symbolic aggregate approximation is introduced to discretize the data and transform them into string sequences; (4) finally, the trend motifs are selected by measuring their similarity. Experimental results on the real-world time-series datasets reveal that our framework fit well in different circumstances, indicating our proposed framework is effective for trend motif discovery.\",\"PeriodicalId\":6774,\"journal\":{\"name\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"volume\":\"26 1\",\"pages\":\"122-126\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 11th International Conference on Information Technology in Medicine and Education (ITME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITME53901.2021.00035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在长时间序列中寻找相似模式的任务，通常被称为基序，已经受到了各个科学领域不断增加的关注。虽然人们提出了许多发现母题的方法，但它们都不能准确有效地发现母题。此外，这些方法需要专家的领域知识来预先定义模式长度，这也是相当客观的。在成瘾中，提取准确的母题是非常耗时的，有时提取的母题没有特定的意义。特别是在金融和水文领域，许多研究都集中在数据中是否存在包含趋势信息的固定模式。为了解决上述问题，我们提出了一个无需预先定义模式长度即可自动发现趋势主题的框架。主要分为四个步骤:(1)首先应用奇异谱分析去除噪声;(2)提取极值点分割，自动获取不等长时间序列模式;(3)引入符号聚合近似对数据进行离散化，并将其转化为字符串序列;(4)最后，通过相似性度量选择趋势母题。在真实时间序列数据集上的实验结果表明，我们的框架在不同情况下都能很好地适应，表明我们提出的框架对趋势基序发现是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detecting trend motifs: an efficient framework for time series motif discovery

The task of finding similar patterns in a long time series, commonly called motifs, has received continuous and increasing attention from diverse scientific fields. Although numerous approaches have been proposed for motif discovery, they cannot discover the motifs in an exact and efficient manner. Furthermore, domain knowledge is required from the experts for those methods to predefine the pattern length, which is also quite objective. In addiction, it is very time-consuming to extract the exact motifs and sometimes the extracted motif has no specific meanings. Especially in the field of financial and hydrology, many studies are focused on whether there is a fixed pattern including trend information hidden in the data. To address the above problems, we proposed a framework to automatically discovery the trend motifs without predefining the length of patterns. It has four main steps, (1) singular spectrum analysis is first applied to removed noise; (2) segmentation by extracting extreme points is then employed to automatically obtain the unequal length of time series pattern; (3) symbolic aggregate approximation is introduced to discretize the data and transform them into string sequences; (4) finally, the trend motifs are selected by measuring their similarity. Experimental results on the real-world time-series datasets reveal that our framework fit well in different circumstances, indicating our proposed framework is effective for trend motif discovery.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 11th International Conference on Information Technology in Medicine and Education (ITME)

自引率

0.00%

发文量