{"title":"Time series similarity search based on Middle points and Clipping","authors":"Thanh Son Nguyen, T. Duong","doi":"10.1109/DMO.2011.5976498","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce a new time series dimensionality reduction method, MP_C (Middle points and Clipping). This method is performed by dividing time series into segments, some points in each segment being extracted and then these points are transformed into a sequence of bits. In our method, we choose the points in each segment by dividing a segment into sub-segments and the middle points of these sub-segments are selected. We can prove that MP_C satisfies the lower bounding condition and make MP_C indexable by showing that a time series compressed by MP_C can be indexed with the support of Skyline index. Our experiments show that our MP_C method is better than PAA in terms of tightness of lower bound and pruning power, and in similarity search, MP_C with the support of Skyline index performs faster than PAA based on traditional R*-tree.","PeriodicalId":436393,"journal":{"name":"2011 3rd Conference on Data Mining and Optimization (DMO)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 3rd Conference on Data Mining and Optimization (DMO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DMO.2011.5976498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
In this paper, we introduce a new time series dimensionality reduction method, MP_C (Middle points and Clipping). This method is performed by dividing time series into segments, some points in each segment being extracted and then these points are transformed into a sequence of bits. In our method, we choose the points in each segment by dividing a segment into sub-segments and the middle points of these sub-segments are selected. We can prove that MP_C satisfies the lower bounding condition and make MP_C indexable by showing that a time series compressed by MP_C can be indexed with the support of Skyline index. Our experiments show that our MP_C method is better than PAA in terms of tightness of lower bound and pruning power, and in similarity search, MP_C with the support of Skyline index performs faster than PAA based on traditional R*-tree.
本文提出了一种新的时间序列降维方法MP_C (Middle points and Clipping)。该方法通过将时间序列分割成段,在每段中提取一些点,然后将这些点转换成比特序列来实现。在我们的方法中,我们通过将一个段划分为子段来选择每个段中的点,并选择这些子段的中间点。通过证明MP_C压缩后的时间序列在Skyline索引的支持下可以被索引,可以证明MP_C满足下边界条件,并使MP_C可被索引。实验表明,MP_C方法在下界紧密度和剪枝能力方面优于PAA方法,在相似性搜索方面,支持Skyline索引的MP_C方法比基于传统R*树的PAA方法更快。