Distributed Operator Placement and Data Caching in Large-Scale Sensor Networks

IEEE INFOCOM 2008 - The 27th Conference on Computer Communications Pub Date : 2008-04-13 DOI:10.1109/INFOCOM.2008.151

Lei Ying, Zhen Liu, D. Towsley, Cathy H. Xia

{"title":"Distributed Operator Placement and Data Caching in Large-Scale Sensor Networks","authors":"Lei Ying, Zhen Liu, D. Towsley, Cathy H. Xia","doi":"10.1109/INFOCOM.2008.151","DOIUrl":null,"url":null,"abstract":"Recent advances in computer technology and wireless communications have enabled the emergence of stream-based sensor networks. In such sensor networks, real-time data are generated by a large number of distributed sources. Queries are made that may require sophisticated processing and filtering of the data. A query is represented by a query graph. In order to reduce the data transmission and to better utilize resources, it is desirable to place operators of the query graph inside the network, and thus to perform in-network processing. Moreover, given that various queries occur with different frequencies and that only a subset of sensor data may actually be queried, caching intermediate data objects inside the network can help improve query efficiency. In this paper, we consider the problem of placing both operators and intermediate data objects inside the network for a set of queries so as to minimize the total cost of storage, computation, and data transmission. We propose distributed algorithms that achieve optimal solutions for tree-structured query graph topologies and general network topologies. The algorithms converge in Lmax(.HQ + 1) iterations, where Lmax is the order of the diameter of the sensor network, and Hq represents the depth of the query graph, defined as the maximum number of operations needed for a raw data to become a final data. For a regular grid network and complete binary tree query graph, the complexity is 0(radic(N)log2 M), where N is the number of nodes in the sensor network and M is the number of data objects in a query graph. The most attractive features of these algorithms are that they require only information exchanges between neighbors, can be executed asynchronously, are adaptive to cost change and topology change, and are resilient to node or link failures.","PeriodicalId":447520,"journal":{"name":"IEEE INFOCOM 2008 - The 27th Conference on Computer Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"62","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2008 - The 27th Conference on Computer Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOM.2008.151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 62

Abstract

Recent advances in computer technology and wireless communications have enabled the emergence of stream-based sensor networks. In such sensor networks, real-time data are generated by a large number of distributed sources. Queries are made that may require sophisticated processing and filtering of the data. A query is represented by a query graph. In order to reduce the data transmission and to better utilize resources, it is desirable to place operators of the query graph inside the network, and thus to perform in-network processing. Moreover, given that various queries occur with different frequencies and that only a subset of sensor data may actually be queried, caching intermediate data objects inside the network can help improve query efficiency. In this paper, we consider the problem of placing both operators and intermediate data objects inside the network for a set of queries so as to minimize the total cost of storage, computation, and data transmission. We propose distributed algorithms that achieve optimal solutions for tree-structured query graph topologies and general network topologies. The algorithms converge in Lmax(.HQ + 1) iterations, where Lmax is the order of the diameter of the sensor network, and Hq represents the depth of the query graph, defined as the maximum number of operations needed for a raw data to become a final data. For a regular grid network and complete binary tree query graph, the complexity is 0(radic(N)log2 M), where N is the number of nodes in the sensor network and M is the number of data objects in a query graph. The most attractive features of these algorithms are that they require only information exchanges between neighbors, can be executed asynchronously, are adaptive to cost change and topology change, and are resilient to node or link failures.

查看原文本刊更多论文

大规模传感器网络中的分布式算子配置和数据缓存

计算机技术和无线通信的最新进展使基于流的传感器网络得以出现。在这种传感器网络中，实时数据是由大量分布的数据源产生的。查询可能需要对数据进行复杂的处理和过滤。查询由查询图表示。为了减少数据传输，更好地利用资源，需要将查询图的运算符放置在网络中，从而进行网络内处理。此外，考虑到各种查询以不同的频率发生，并且实际上可能只查询传感器数据的一个子集，在网络中缓存中间数据对象可以帮助提高查询效率。在本文中，我们考虑在网络中为一组查询放置操作符和中间数据对象的问题，以最小化存储，计算和数据传输的总成本。我们提出了分布式算法，以实现树结构查询图拓扑和一般网络拓扑的最优解。算法收敛于Lmax()。HQ + 1)次迭代，其中Lmax为传感器网络直径的阶数，HQ为查询图的深度，定义为原始数据变为最终数据所需的最大操作次数。对于规则网格网络和完全二叉树查询图，复杂度为0(radic(N) log2m)，其中N为传感器网络的节点数，M为查询图中数据对象的数量。这些算法最吸引人的特点是它们只需要邻居之间的信息交换，可以异步执行，适应成本变化和拓扑变化，并且对节点或链路故障具有弹性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE INFOCOM 2008 - The 27th Conference on Computer Communications

自引率

0.00%

发文量