Segmented Representation of Time Series Based on Key Morphological Points

Lei Tu, Hongxin Xu, Yide Di
{"title":"Segmented Representation of Time Series Based on Key Morphological Points","authors":"Lei Tu, Hongxin Xu, Yide Di","doi":"10.1145/3424978.3425099","DOIUrl":null,"url":null,"abstract":"Time series data has the characteristics of high dimension and high noise, data mining directly from the original series is not only time-consuming but also inefficient. Therefore, it is necessary to preprocess the original time series. Among various compression methods, PLR based on feature points has been widely studied because of its simplicity, high efficiency, and easy interpretation. However, most of the feature-based linear representation methods have the problem of low accuracy of the selected segmentation points. Based on the idea of traditional important extreme point extraction method, this paper proposes a time series segment representation algorithm based on key morphological points. The algorithm extracts the key extremum and key stationary value, which can greatly reduce the noise and retain the key points. Compared with IP, SEEP, KP and IEP algorithms, the proposed algorithm is tested in 9 different fields of UCR. The results show that under 98% compression rate, the algorithm has the least fitting error on seven groups of data, and second only to the best case in one group of data, so the algorithm can better fit the original sequence at high compression rate; In addition, the DTW distance calculation results on 9 types of data show that the algorithm can correctly classify 7 types of data, it has the best classification accuracy. The comparison results of difference value of the similarity of different classes show that it has better robustness and can be easily applied to the subsequent application research of the sequence.","PeriodicalId":178822,"journal":{"name":"Proceedings of the 4th International Conference on Computer Science and Application Engineering","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Computer Science and Application Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3424978.3425099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Time series data has the characteristics of high dimension and high noise, data mining directly from the original series is not only time-consuming but also inefficient. Therefore, it is necessary to preprocess the original time series. Among various compression methods, PLR based on feature points has been widely studied because of its simplicity, high efficiency, and easy interpretation. However, most of the feature-based linear representation methods have the problem of low accuracy of the selected segmentation points. Based on the idea of traditional important extreme point extraction method, this paper proposes a time series segment representation algorithm based on key morphological points. The algorithm extracts the key extremum and key stationary value, which can greatly reduce the noise and retain the key points. Compared with IP, SEEP, KP and IEP algorithms, the proposed algorithm is tested in 9 different fields of UCR. The results show that under 98% compression rate, the algorithm has the least fitting error on seven groups of data, and second only to the best case in one group of data, so the algorithm can better fit the original sequence at high compression rate; In addition, the DTW distance calculation results on 9 types of data show that the algorithm can correctly classify 7 types of data, it has the best classification accuracy. The comparison results of difference value of the similarity of different classes show that it has better robustness and can be easily applied to the subsequent application research of the sequence.
基于关键形态点的时间序列分段表示
时间序列数据具有高维和高噪声的特点,直接从原始序列中挖掘数据不仅耗时而且效率低下。因此,有必要对原始时间序列进行预处理。在各种压缩方法中,基于特征点的PLR以其简单、高效、易解释等优点得到了广泛的研究。然而,大多数基于特征的线性表示方法存在分割点选择精度低的问题。在传统重要极值点提取方法思想的基础上,提出了一种基于关键形态点的时间序列片段表示算法。该算法提取关键极值和关键平稳值,可以大大降低噪声并保留关键点。与IP、SEEP、KP和IEP算法进行比较,在UCR的9个不同领域进行了测试。结果表明:在98%压缩率下,该算法对7组数据的拟合误差最小,仅次于1组数据的最佳情况,表明该算法在高压缩率下能较好地拟合原始序列;此外,对9类数据的DTW距离计算结果表明,该算法可以正确分类7类数据,具有最佳的分类精度。不同类别相似性差值的比较结果表明,该方法具有较好的鲁棒性,易于应用于序列的后续应用研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信