Detecting trend motifs: an efficient framework for time series motif discovery

Xiang Chen, Zongwen Fan, Jin Gou
{"title":"Detecting trend motifs: an efficient framework for time series motif discovery","authors":"Xiang Chen, Zongwen Fan, Jin Gou","doi":"10.1109/ITME53901.2021.00035","DOIUrl":null,"url":null,"abstract":"The task of finding similar patterns in a long time series, commonly called motifs, has received continuous and increasing attention from diverse scientific fields. Although numerous approaches have been proposed for motif discovery, they cannot discover the motifs in an exact and efficient manner. Furthermore, domain knowledge is required from the experts for those methods to predefine the pattern length, which is also quite objective. In addiction, it is very time-consuming to extract the exact motifs and sometimes the extracted motif has no specific meanings. Especially in the field of financial and hydrology, many studies are focused on whether there is a fixed pattern including trend information hidden in the data. To address the above problems, we proposed a framework to automatically discovery the trend motifs without predefining the length of patterns. It has four main steps, (1) singular spectrum analysis is first applied to removed noise; (2) segmentation by extracting extreme points is then employed to automatically obtain the unequal length of time series pattern; (3) symbolic aggregate approximation is introduced to discretize the data and transform them into string sequences; (4) finally, the trend motifs are selected by measuring their similarity. Experimental results on the real-world time-series datasets reveal that our framework fit well in different circumstances, indicating our proposed framework is effective for trend motif discovery.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"26 1","pages":"122-126"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The task of finding similar patterns in a long time series, commonly called motifs, has received continuous and increasing attention from diverse scientific fields. Although numerous approaches have been proposed for motif discovery, they cannot discover the motifs in an exact and efficient manner. Furthermore, domain knowledge is required from the experts for those methods to predefine the pattern length, which is also quite objective. In addiction, it is very time-consuming to extract the exact motifs and sometimes the extracted motif has no specific meanings. Especially in the field of financial and hydrology, many studies are focused on whether there is a fixed pattern including trend information hidden in the data. To address the above problems, we proposed a framework to automatically discovery the trend motifs without predefining the length of patterns. It has four main steps, (1) singular spectrum analysis is first applied to removed noise; (2) segmentation by extracting extreme points is then employed to automatically obtain the unequal length of time series pattern; (3) symbolic aggregate approximation is introduced to discretize the data and transform them into string sequences; (4) finally, the trend motifs are selected by measuring their similarity. Experimental results on the real-world time-series datasets reveal that our framework fit well in different circumstances, indicating our proposed framework is effective for trend motif discovery.
趋势基序检测:时间序列基序发现的有效框架
在长时间序列中寻找相似模式的任务,通常被称为基序,已经受到了各个科学领域不断增加的关注。虽然人们提出了许多发现母题的方法,但它们都不能准确有效地发现母题。此外,这些方法需要专家的领域知识来预先定义模式长度,这也是相当客观的。在成瘾中,提取准确的母题是非常耗时的,有时提取的母题没有特定的意义。特别是在金融和水文领域,许多研究都集中在数据中是否存在包含趋势信息的固定模式。为了解决上述问题,我们提出了一个无需预先定义模式长度即可自动发现趋势主题的框架。主要分为四个步骤:(1)首先应用奇异谱分析去除噪声;(2)提取极值点分割,自动获取不等长时间序列模式;(3)引入符号聚合近似对数据进行离散化,并将其转化为字符串序列;(4)最后,通过相似性度量选择趋势母题。在真实时间序列数据集上的实验结果表明,我们的框架在不同情况下都能很好地适应,表明我们提出的框架对趋势基序发现是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信