{"title":"基于时间的滑动窗口数据流极端聚合的空间缩减","authors":"Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao","doi":"10.1109/CLOUD.2012.80","DOIUrl":null,"url":null,"abstract":"Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.","PeriodicalId":214084,"journal":{"name":"2012 IEEE Fifth International Conference on Cloud Computing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Space Reduction for Extreme Aggregation of Data Stream over Time-Based Sliding Window\",\"authors\":\"Weilong Ding, Yanbo Han, Jing Wang, Zhuofeng Zhao\",\"doi\":\"10.1109/CLOUD.2012.80\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.\",\"PeriodicalId\":214084,\"journal\":{\"name\":\"2012 IEEE Fifth International Conference on Cloud Computing\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE Fifth International Conference on Cloud Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLOUD.2012.80\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Fifth International Conference on Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLOUD.2012.80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Space Reduction for Extreme Aggregation of Data Stream over Time-Based Sliding Window
Data process in Cloud or IoT (Internet of Things) sometimes implies continuous real-time queries as data streams. In order to acquire extreme value of data stream over time-based sliding window, traditional approaches computed the exact solution through vast space especially under ultra circumstances like high-rate or high-concurrency. In this paper, we design space-bounded synopsis data structure and extreme aggregation algorithm to get approximate solution by finite extreme candidates over time sliding window, whose validity can be theoretically guaranteed. Comprehensive experiments over synthetic and real data set are designed to analyze the tradeoff between accuracy and overhead, which also illustrate the efficiency.