Learning Data Dependency with Communication Cost

Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing Pub Date : 2018-04-29 DOI:10.1145/3209582.3209600

Hyeryung Jang, Hyungseok Song, Yung Yi

{"title":"Learning Data Dependency with Communication Cost","authors":"Hyeryung Jang, Hyungseok Song, Yung Yi","doi":"10.1145/3209582.3209600","DOIUrl":null,"url":null,"abstract":"In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori). This problem is referred to as structure learning. When nodes are spatially separated in different locations, running an inference algorithm requires a non-negligible amount of message passing, incurring some communication cost. We inevitably have the trade-off between the accuracy of structure learning and the cost we need to pay to perform a given message-passing based inference task because the learnt edge structures of data dependency and physical connectivity graph are often highly different. In this paper, we formalize this trade-off in an optimization problem which outputs the data dependency graph that jointly considers learning accuracy and message-passing cost. We focus on a distributed MAP as the target inference task due to its popularity, and consider two different implementations, ASYNC-MAP and SYNC-MAP that have different message-passing mechanisms and thus different cost structures. In ASYNC-MAP, we propose a polynomial time learning algorithm that is optimal, motivated by the problem of finding a maximum weight spanning tree. In SYNC-MAP, we first prove that it is NP-hard and propose a greedy heuristic. For both implementations, we then quantify how the probability that the resulting data graphs from those learning algorithms differ from the ideal data graph decays as the number of data samples grows, using the large deviation principle, where the decaying rate is characterized by some topological structures of both original data dependency and physical connectivity graphs as well as the degree of the trade-off, which provides some guideline on how many samples are necessary to obtain a certain learning accuracy. We validate our theoretical findings through extensive simulations, which confirm that it has a good match.","PeriodicalId":375932,"journal":{"name":"Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3209582.3209600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori). This problem is referred to as structure learning. When nodes are spatially separated in different locations, running an inference algorithm requires a non-negligible amount of message passing, incurring some communication cost. We inevitably have the trade-off between the accuracy of structure learning and the cost we need to pay to perform a given message-passing based inference task because the learnt edge structures of data dependency and physical connectivity graph are often highly different. In this paper, we formalize this trade-off in an optimization problem which outputs the data dependency graph that jointly considers learning accuracy and message-passing cost. We focus on a distributed MAP as the target inference task due to its popularity, and consider two different implementations, ASYNC-MAP and SYNC-MAP that have different message-passing mechanisms and thus different cost structures. In ASYNC-MAP, we propose a polynomial time learning algorithm that is optimal, motivated by the problem of finding a maximum weight spanning tree. In SYNC-MAP, we first prove that it is NP-hard and propose a greedy heuristic. For both implementations, we then quantify how the probability that the resulting data graphs from those learning algorithms differ from the ideal data graph decays as the number of data samples grows, using the large deviation principle, where the decaying rate is characterized by some topological structures of both original data dependency and physical connectivity graphs as well as the degree of the trade-off, which provides some guideline on how many samples are necessary to obtain a certain learning accuracy. We validate our theoretical findings through extensive simulations, which confirm that it has a good match.

查看原文本刊更多论文

基于通信成本的数据依赖学习

在本文中，我们考虑了从节点生成的一组数据样本中恢复一个表示节点间统计数据依赖关系的图的问题，该图为执行推理任务提供了基本结构，如MAP (maximum a posteriori)。这个问题被称为结构学习。当节点在空间上分散在不同位置时，运行推理算法需要不可忽略的消息传递量，从而产生一定的通信成本。我们不可避免地要在结构学习的准确性和执行给定的基于消息传递的推理任务所需付出的代价之间进行权衡，因为学习到的数据依赖关系和物理连接图的边缘结构通常是非常不同的。在本文中，我们在一个优化问题中形式化了这种权衡，该优化问题输出了联合考虑学习精度和消息传递成本的数据依赖图。由于分布式MAP的流行，我们将重点放在作为目标推理任务的分布式MAP上，并考虑两种不同的实现，ASYNC-MAP和SYNC-MAP，它们具有不同的消息传递机制，因此具有不同的成本结构。在ASYNC-MAP中，我们提出了一个多项式时间学习算法，该算法是最优的，其动机是寻找最大权值生成树的问题。在SYNC-MAP中，我们首先证明了它是np困难的，并提出了一个贪心启发式算法。对于这两种实现，我们使用大偏差原理，量化这些学习算法产生的数据图与理想数据图不同的概率如何随着数据样本数量的增长而衰减，其中衰减率由原始数据依赖和物理连接图的一些拓扑结构以及权衡的程度来描述。这为获得一定的学习精度需要多少样本提供了一些指导。我们通过广泛的模拟验证了我们的理论发现，这证实了它有很好的匹配。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing

自引率

0.00%

发文量