基于可变小波变换和极坐标小波的近期偏置时间序列数据库相似性搜索

D. Devi, V. Maheswari, P. Thambidurai
{"title":"基于可变小波变换和极坐标小波的近期偏置时间序列数据库相似性搜索","authors":"D. Devi, V. Maheswari, P. Thambidurai","doi":"10.1109/INTERACT.2010.5706187","DOIUrl":null,"url":null,"abstract":"A time series database is a collection of data that are generated in series as time goes on and constitutes a large portion of data stored in computers like stock-price movements, weather data, bio-medical measurements, video data etc., Two time sequences of same length are said to be similar if the Euclidean distance is less or equal to a given threshold. The main issue of similarity search in time series databases is to improve the search performance since time sequences are usually of high dimension. So it is important to reduce the search space for efficient processing of similarity search in large time series databases. Popular techniques for efficient retrieval of time sequences in time series databases are DFT, DWT, SVD, PAA, PCA, APCA etc., In this paper we explore the feasibility of using Vari-DWT and Polar wavelet with a comprehensive analysis of the two methods as matching functions which can improve the search performance in Recent-Biased time series databases. Vari-DWT is fast to compute and requires little storage for each sequence, It preserves Euclidean distance and recency and also allows good approximation with a subset of coefficients. But it shows poor performance for locally distributed time series data which are clustered around certain values since it uses averages to reduce the dimensionality of data. Polar wavelet uses polar coordinates which are not affected from averages and so can improve the search performance especially in locally distributed time series databases. Moreover, DWT has the limitation that it works best if the length of time series 2n otherwise it approximates the signal by adding 0 to the right side of the series to make the length to be 2n which distorts the original signal. Polar wavelet works with time sequences of any length without distorting the original signal. The effectiveness of Vari-DWT and Polar wavelets are evaluated empirically on real weather data and synthetic datasets.","PeriodicalId":201931,"journal":{"name":"INTERACT-2010","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Similarity search in Recent Biased time series databases using Vari-DWT and Polar wavelets\",\"authors\":\"D. Devi, V. Maheswari, P. Thambidurai\",\"doi\":\"10.1109/INTERACT.2010.5706187\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A time series database is a collection of data that are generated in series as time goes on and constitutes a large portion of data stored in computers like stock-price movements, weather data, bio-medical measurements, video data etc., Two time sequences of same length are said to be similar if the Euclidean distance is less or equal to a given threshold. The main issue of similarity search in time series databases is to improve the search performance since time sequences are usually of high dimension. So it is important to reduce the search space for efficient processing of similarity search in large time series databases. Popular techniques for efficient retrieval of time sequences in time series databases are DFT, DWT, SVD, PAA, PCA, APCA etc., In this paper we explore the feasibility of using Vari-DWT and Polar wavelet with a comprehensive analysis of the two methods as matching functions which can improve the search performance in Recent-Biased time series databases. Vari-DWT is fast to compute and requires little storage for each sequence, It preserves Euclidean distance and recency and also allows good approximation with a subset of coefficients. But it shows poor performance for locally distributed time series data which are clustered around certain values since it uses averages to reduce the dimensionality of data. Polar wavelet uses polar coordinates which are not affected from averages and so can improve the search performance especially in locally distributed time series databases. Moreover, DWT has the limitation that it works best if the length of time series 2n otherwise it approximates the signal by adding 0 to the right side of the series to make the length to be 2n which distorts the original signal. Polar wavelet works with time sequences of any length without distorting the original signal. The effectiveness of Vari-DWT and Polar wavelets are evaluated empirically on real weather data and synthetic datasets.\",\"PeriodicalId\":201931,\"journal\":{\"name\":\"INTERACT-2010\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"INTERACT-2010\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INTERACT.2010.5706187\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"INTERACT-2010","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTERACT.2010.5706187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

时间序列数据库是随着时间的推移而产生的一系列数据的集合,它构成了存储在计算机中的大部分数据,如股票价格变动、天气数据、生物医学测量、视频数据等。如果欧几里得距离小于或等于给定的阈值,则两个相同长度的时间序列称为相似。由于时间序列通常是高维的,在时间序列数据库中进行相似度搜索的主要问题是提高搜索性能。因此,减少搜索空间是有效处理大型时间序列数据库中相似度搜索的重要方法。在时间序列数据库中高效检索时间序列的常用技术有DFT、DWT、SVD、PAA、PCA、APCA等,本文探讨了将变DWT和Polar小波作为匹配函数的可行性,并对两种方法进行了综合分析,从而提高了在最近偏差时间序列数据库中的搜索性能。可变dwt计算速度快,每个序列需要很少的存储空间,它保留了欧几里得距离和近似值,并且还允许使用系数子集进行良好的近似。但是对于局部分布的时间序列数据,由于它使用平均值来降低数据的维数,因此表现出较差的性能。极坐标小波使用不受平均值影响的极坐标,可以提高局部分布时间序列数据库的搜索性能。此外,DWT有一个限制,如果时间序列的长度为2n,它的效果最好,否则它会通过在序列的右侧添加0来近似信号,使长度为2n,从而扭曲原始信号。极性小波可以处理任意长度的时间序列而不会使原始信号失真。在实际天气数据和合成数据集上对变小波变换和极波变换的有效性进行了实证评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Similarity search in Recent Biased time series databases using Vari-DWT and Polar wavelets
A time series database is a collection of data that are generated in series as time goes on and constitutes a large portion of data stored in computers like stock-price movements, weather data, bio-medical measurements, video data etc., Two time sequences of same length are said to be similar if the Euclidean distance is less or equal to a given threshold. The main issue of similarity search in time series databases is to improve the search performance since time sequences are usually of high dimension. So it is important to reduce the search space for efficient processing of similarity search in large time series databases. Popular techniques for efficient retrieval of time sequences in time series databases are DFT, DWT, SVD, PAA, PCA, APCA etc., In this paper we explore the feasibility of using Vari-DWT and Polar wavelet with a comprehensive analysis of the two methods as matching functions which can improve the search performance in Recent-Biased time series databases. Vari-DWT is fast to compute and requires little storage for each sequence, It preserves Euclidean distance and recency and also allows good approximation with a subset of coefficients. But it shows poor performance for locally distributed time series data which are clustered around certain values since it uses averages to reduce the dimensionality of data. Polar wavelet uses polar coordinates which are not affected from averages and so can improve the search performance especially in locally distributed time series databases. Moreover, DWT has the limitation that it works best if the length of time series 2n otherwise it approximates the signal by adding 0 to the right side of the series to make the length to be 2n which distorts the original signal. Polar wavelet works with time sequences of any length without distorting the original signal. The effectiveness of Vari-DWT and Polar wavelets are evaluated empirically on real weather data and synthetic datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信