基于时间的滑动窗口数据流极端聚合的空间缩减

Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao
{"title":"基于时间的滑动窗口数据流极端聚合的空间缩减","authors":"Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao","doi":"10.1109/CLOUD.2012.80","DOIUrl":null,"url":null,"abstract":"Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.","PeriodicalId":214084,"journal":{"name":"2012 IEEE Fifth International Conference on Cloud Computing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Space Reduction for Extreme Aggregation of Data Stream over Time-Based Sliding Window\",\"authors\":\"Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao\",\"doi\":\"10.1109/CLOUD.2012.80\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.\",\"PeriodicalId\":214084,\"journal\":{\"name\":\"2012 IEEE Fifth International Conference on Cloud Computing\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE Fifth International Conference on Cloud Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLOUD.2012.80\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Fifth International Conference on Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLOUD.2012.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

云或物联网中的数据处理有时意味着作为数据流的连续实时查询。传统方法为了在基于时间的滑动窗口中获取数据流的极值,特别是在高速率或高并发等超环境下,需要通过大空间计算精确解。本文设计了空间有界的概要数据结构和极值聚合算法,通过有限的极值候选值随时间滑动窗口得到近似解,从理论上保证了其有效性。在合成数据集和真实数据集上设计了综合实验,分析了精度和开销之间的权衡,并说明了效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Space Reduction for Extreme Aggregation of Data Stream over Time-Based Sliding Window
Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信