Dual-Side Auto-Encoder for High-Dimensional Time Series Segmentation

Yue Bai, Lichen Wang, Yunyu Liu, Yu Yin, Y. Fu
{"title":"Dual-Side Auto-Encoder for High-Dimensional Time Series Segmentation","authors":"Yue Bai, Lichen Wang, Yunyu Liu, Yu Yin, Y. Fu","doi":"10.1109/ICDM50108.2020.00102","DOIUrl":null,"url":null,"abstract":"High-dimensional time series segmentation aims to segment a long temporal sequence into several short and meaningful subsequences. The high-dimensionality makes it challenging due to the complicated correlations among the sequential features. A large number of labeled data is required in existing supervised methods, and unsupervised methods mainly deploy clustering approaches, which are sensitive to outliers and hard to guarantee high performance. Also, most existing methods mainly rely on hand-craft features to deal with regular time series segmentation and achieve promising results. However, these approaches cannot effectively handle high-dimensional time series and will result in a high computational cost. In our work, we propose a novel unsupervised representation learning framework called Dual-Side Auto-Encoder (DSAE). It mainly focuses on high-dimensional time series segmentation by effectively capturing the temporal correlative patterns. Specifically, a single-to-multiple auto-encoder is designed to capture local sequential information. Besides, a long-shot distance encoding strategy is proposed. It aims to explicitly guide the learning process to obtain distinctive representations for segmentation. Furthermore, the long-short distance strategy is also executed in the decoded feature space, which implicitly directs the representation learning. Substantial experiments on six datasets illustrate the model effectiveness11Code will be released at https://github.com/yueb17/HTSS.","PeriodicalId":202149,"journal":{"name":"2020 IEEE International Conference on Data Mining (ICDM)","volume":"299 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM50108.2020.00102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

High-dimensional time series segmentation aims to segment a long temporal sequence into several short and meaningful subsequences. The high-dimensionality makes it challenging due to the complicated correlations among the sequential features. A large number of labeled data is required in existing supervised methods, and unsupervised methods mainly deploy clustering approaches, which are sensitive to outliers and hard to guarantee high performance. Also, most existing methods mainly rely on hand-craft features to deal with regular time series segmentation and achieve promising results. However, these approaches cannot effectively handle high-dimensional time series and will result in a high computational cost. In our work, we propose a novel unsupervised representation learning framework called Dual-Side Auto-Encoder (DSAE). It mainly focuses on high-dimensional time series segmentation by effectively capturing the temporal correlative patterns. Specifically, a single-to-multiple auto-encoder is designed to capture local sequential information. Besides, a long-shot distance encoding strategy is proposed. It aims to explicitly guide the learning process to obtain distinctive representations for segmentation. Furthermore, the long-short distance strategy is also executed in the decoded feature space, which implicitly directs the representation learning. Substantial experiments on six datasets illustrate the model effectiveness11Code will be released at https://github.com/yueb17/HTSS.
用于高维时间序列分割的双面自编码器
高维时间序列分割的目的是将一个长时间序列分割成几个短而有意义的子序列。由于序列特征之间存在复杂的相关性,因此高维特征分析具有挑战性。现有的有监督方法需要大量标记数据,无监督方法主要采用聚类方法,对异常值敏感,难以保证高性能。此外,现有的方法主要依靠手工特征来处理规则时间序列分割,并取得了令人满意的效果。然而,这些方法不能有效地处理高维时间序列,并且计算成本高。在我们的工作中,我们提出了一种新的无监督表示学习框架,称为双侧自动编码器(DSAE)。它主要通过有效捕获时间相关模式来实现高维时间序列分割。具体来说,单对多自编码器被设计用于捕获本地顺序信息。此外,还提出了一种长镜头距离编码策略。它旨在明确地指导学习过程,以获得独特的分割表示。此外,在解码的特征空间中还执行了长短距离策略,这隐含地指导了表征学习。在六个数据集上的大量实验证明了模型的有效性11code将在https://github.com/yueb17/HTSS上发布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信