Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm.

JMLR workshop and conference proceedings Pub Date : 2014-06-01

Hadi Daneshmand, Manuel Gomez-Rodriguez, Le Song, Bernhard Schölkopf

{"title":"Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm.","authors":"Hadi Daneshmand, Manuel Gomez-Rodriguez, Le Song, Bernhard Schölkopf","doi":"","DOIUrl":null,"url":null,"abstract":"Information spreads across social and technological networks, but often the network structures are hidden from us and we only observe the traces left by the diffusion processes, called cascades. Can we recover the hidden network structures from these observed cascades? What kind of cascades and how many cascades do we need? Are there some network structures which are more difficult than others to recover? Can we design efficient inference algorithms with provable guarantees? Despite the increasing availability of cascade-data and methods for inferring networks from these data, a thorough theoretical understanding of the above questions remains largely unexplored in the literature. In this paper, we investigate the network structure inference problem for a general family of continuous-time diffusion models using an [Formula: see text]-regularized likelihood maximization framework. We show that, as long as the cascade sampling process satisfies a natural incoherence condition, our framework can recover the correct network structure with high probability if we observe O(d3 log N) cascades, where d is the maximum number of parents of a node and N is the total number of nodes. Moreover, we develop a simple and efficient soft-thresholding inference algorithm, which we use to illustrate the consequences of our theoretical results, and show that our framework outperforms other alternatives in practice.","PeriodicalId":89793,"journal":{"name":"JMLR workshop and conference proceedings","volume":"32 2","pages":"793-801"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4412853/pdf/nihms-680553.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMLR workshop and conference proceedings","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Information spreads across social and technological networks, but often the network structures are hidden from us and we only observe the traces left by the diffusion processes, called cascades. Can we recover the hidden network structures from these observed cascades? What kind of cascades and how many cascades do we need? Are there some network structures which are more difficult than others to recover? Can we design efficient inference algorithms with provable guarantees? Despite the increasing availability of cascade-data and methods for inferring networks from these data, a thorough theoretical understanding of the above questions remains largely unexplored in the literature. In this paper, we investigate the network structure inference problem for a general family of continuous-time diffusion models using an [Formula: see text]-regularized likelihood maximization framework. We show that, as long as the cascade sampling process satisfies a natural incoherence condition, our framework can recover the correct network structure with high probability if we observe O(d³ log N) cascades, where d is the maximum number of parents of a node and N is the total number of nodes. Moreover, we develop a simple and efficient soft-thresholding inference algorithm, which we use to illustrate the consequences of our theoretical results, and show that our framework outperforms other alternatives in practice.

Abstract Image

本刊更多论文

估计扩散网络结构:恢复条件，样本复杂度和软阈值算法。

信息在社会和技术网络中传播，但网络结构往往对我们是隐藏的，我们只能观察到扩散过程留下的痕迹，称为级联。我们能从这些观察到的级联中恢复隐藏的网络结构吗?我们需要什么样的级联，需要多少级联?是否存在比其他网络结构更难恢复的网络结构?我们能否设计出具有可证明保证的高效推理算法?尽管级联数据的可用性和从这些数据推断网络的方法越来越多，但对上述问题的全面理论理解在文献中仍未得到充分探讨。本文利用正则化似然最大化框架研究了一类连续时间扩散模型的网络结构推理问题。我们证明，只要级联采样过程满足自然的非相干条件，如果观察O(d3 log N)级联，我们的框架可以高概率地恢复正确的网络结构，其中d为节点的最大父节点数，N为节点总数。此外，我们开发了一个简单而有效的软阈值推理算法，我们用它来说明我们的理论结果的后果，并表明我们的框架在实践中优于其他替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JMLR workshop and conference proceedings

自引率

0.00%

发文量