{"title":"Diversified set monitoring over distributed data streams","authors":"Daichi Amagata, T. Hara","doi":"10.1145/2933267.2933298","DOIUrl":null,"url":null,"abstract":"Data monitoring over distributed streams is a fundamental problem, as represented by modern applications, e.g., sensor network and financial data monitoring. Such applications need a technique which continuously monitors user-requiring data and achieves not only time and space efficiencies but also communication efficiency. In addition, result diversification is also required to increase user satisfaction, thus has been receiving significant attention recently. This motivates us to consider a problem of monitoring k-diverse data over distributed streams. Result diversification is well known to be NP-hard, so the natures of NP-hardness and dynamic distributed data bring non-trivial challenges, e.g., impracticably of centralized approaches. In this paper, we propose a novel algorithm that monitors k-diverse data with time, space, and communication efficiencies. The results of our experiments using both real and synthetic data confirm the effectiveness of our algorithm.","PeriodicalId":277061,"journal":{"name":"Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2933267.2933298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Data monitoring over distributed streams is a fundamental problem, as represented by modern applications, e.g., sensor network and financial data monitoring. Such applications need a technique which continuously monitors user-requiring data and achieves not only time and space efficiencies but also communication efficiency. In addition, result diversification is also required to increase user satisfaction, thus has been receiving significant attention recently. This motivates us to consider a problem of monitoring k-diverse data over distributed streams. Result diversification is well known to be NP-hard, so the natures of NP-hardness and dynamic distributed data bring non-trivial challenges, e.g., impracticably of centralized approaches. In this paper, we propose a novel algorithm that monitors k-diverse data with time, space, and communication efficiencies. The results of our experiments using both real and synthetic data confirm the effectiveness of our algorithm.