确定性线性阈值模型下社会网络影响最大化问题的逼近与不逼近

Zaixin Lu, Wei Zhang, Weili Wu, B. Fu, D. Du
{"title":"确定性线性阈值模型下社会网络影响最大化问题的逼近与不逼近","authors":"Zaixin Lu, Wei Zhang, Weili Wu, B. Fu, D. Du","doi":"10.1109/ICDCSW.2011.33","DOIUrl":null,"url":null,"abstract":"Influence Maximization is the problem of finding a certain amount of people in a social network such that their aggregation influence through the network is maximized. In the past this problem has been widely studied under a number of different models. In 2003, Kempe \\emph{et al.} gave a $(1-{1 \\over e})$-approximation algorithm for the \\emph{linear threshold model} and the \\emph{independent cascade model}, which are the two main models in the social network analysis. In addition, Chen \\emph{et al.} proved that the problem of exactly computing the influence given a seed set in the two models is $\\#$P-hard. Both the \\emph{linear threshold model} and the \\emph{independent cascade model} are based on randomized propagation. However such information might be obtained by surveys or data mining techniques, which makes great difference on the properties of the problem. In this paper, we study the Influence Maximization problem in the \\emph{deterministic linear threshold model}. As a contrast, we show that in the \\emph{deterministic linear threshold model}, there is no polynomial time $n^{1-\\epsilon}$-approximation unless P=NP even at the simple case that one person needs at most two active neighbors to become active. This inapproximability result is derived with self-contained proofs without using PCP theorem. In the case that a person can be activated when one of its neighbors become active, there is a polynomial time ${e\\over e-1}$-approximation, and we prove it is the best possible approximation under a reasonable assumption in the complexity theory, $NP \\not\\subset DTIME(n^{\\log\\log n})$. We also show that the exact computation of the final influence given a seed set can be solved in linear time in the \\emph{deterministic linear threshold model}. The Least Seed Set problem, which aims to find a seed set with least number of people to activate all the required people in a given social network, is discussed. Using an analysis framework based on Set Cover, we show a $O($log$n)$-approximation in the case that a people become active when one of its neighbors is activated.","PeriodicalId":133514,"journal":{"name":"2011 31st International Conference on Distributed Computing Systems Workshops","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Approximation and Inapproximation for the Influence Maximization Problem in Social Networks under Deterministic Linear Threshold Model\",\"authors\":\"Zaixin Lu, Wei Zhang, Weili Wu, B. Fu, D. Du\",\"doi\":\"10.1109/ICDCSW.2011.33\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Influence Maximization is the problem of finding a certain amount of people in a social network such that their aggregation influence through the network is maximized. In the past this problem has been widely studied under a number of different models. In 2003, Kempe \\\\emph{et al.} gave a $(1-{1 \\\\over e})$-approximation algorithm for the \\\\emph{linear threshold model} and the \\\\emph{independent cascade model}, which are the two main models in the social network analysis. In addition, Chen \\\\emph{et al.} proved that the problem of exactly computing the influence given a seed set in the two models is $\\\\#$P-hard. Both the \\\\emph{linear threshold model} and the \\\\emph{independent cascade model} are based on randomized propagation. However such information might be obtained by surveys or data mining techniques, which makes great difference on the properties of the problem. In this paper, we study the Influence Maximization problem in the \\\\emph{deterministic linear threshold model}. As a contrast, we show that in the \\\\emph{deterministic linear threshold model}, there is no polynomial time $n^{1-\\\\epsilon}$-approximation unless P=NP even at the simple case that one person needs at most two active neighbors to become active. This inapproximability result is derived with self-contained proofs without using PCP theorem. In the case that a person can be activated when one of its neighbors become active, there is a polynomial time ${e\\\\over e-1}$-approximation, and we prove it is the best possible approximation under a reasonable assumption in the complexity theory, $NP \\\\not\\\\subset DTIME(n^{\\\\log\\\\log n})$. We also show that the exact computation of the final influence given a seed set can be solved in linear time in the \\\\emph{deterministic linear threshold model}. The Least Seed Set problem, which aims to find a seed set with least number of people to activate all the required people in a given social network, is discussed. Using an analysis framework based on Set Cover, we show a $O($log$n)$-approximation in the case that a people become active when one of its neighbors is activated.\",\"PeriodicalId\":133514,\"journal\":{\"name\":\"2011 31st International Conference on Distributed Computing Systems Workshops\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 31st International Conference on Distributed Computing Systems Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDCSW.2011.33\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 31st International Conference on Distributed Computing Systems Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCSW.2011.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25

摘要

影响力最大化是指在一个社交网络中找到一定数量的人,使他们通过该网络的聚合影响力最大化。在过去,这个问题已经在许多不同的模型下进行了广泛的研究。2003年,Kempe\emph{等人}对社会网络分析中的两种主要模型——\emph{线性阈值模型}和\emph{独立级联模型}给出了$(1-{1 \over e})$ -逼近算法。此外,Chen\emph{等人}证明了在两种模型中精确计算给定种子集的影响的问题是$\#$ P-hard。\emph{线性阈值模型}和\emph{独立级联模型}都是基于随机传播的。然而,这些信息可以通过调查或数据挖掘技术获得,这对问题的性质有很大的不同。本文研究了\emph{确定性线性阈值模型}中的影响最大化问题。作为对比,我们表明,在\emph{确定性线性阈值模型}中,即使在一个人最多需要两个活动邻居才能变得活跃的简单情况下,除非P=NP,否则不存在多项式时间$n^{1-\epsilon}$ -近似。在不使用PCP定理的情况下,用自包含证明得到了这个不逼近结果。在这种情况下,当一个人的邻居被激活时,有一个多项式时间${e\over e-1}$ -近似,我们证明它是在复杂性理论中一个合理的假设下的最佳可能近似,$NP \not\subset DTIME(n^{\log\log n})$。我们还证明了在确定性线性\emph{阈值模型中,给定种子集的最终影响的精确计算可以在线性}时间内解决。讨论了最小种子集问题,该问题的目的是在给定的社会网络中找到一个人数最少的种子集来激活所有需要的人。使用基于Set Cover的分析框架,我们展示了一个$O($ log $n)$ -近似的情况,即当一个人的邻居被激活时,他就会变得活跃。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Approximation and Inapproximation for the Influence Maximization Problem in Social Networks under Deterministic Linear Threshold Model
Influence Maximization is the problem of finding a certain amount of people in a social network such that their aggregation influence through the network is maximized. In the past this problem has been widely studied under a number of different models. In 2003, Kempe \emph{et al.} gave a $(1-{1 \over e})$-approximation algorithm for the \emph{linear threshold model} and the \emph{independent cascade model}, which are the two main models in the social network analysis. In addition, Chen \emph{et al.} proved that the problem of exactly computing the influence given a seed set in the two models is $\#$P-hard. Both the \emph{linear threshold model} and the \emph{independent cascade model} are based on randomized propagation. However such information might be obtained by surveys or data mining techniques, which makes great difference on the properties of the problem. In this paper, we study the Influence Maximization problem in the \emph{deterministic linear threshold model}. As a contrast, we show that in the \emph{deterministic linear threshold model}, there is no polynomial time $n^{1-\epsilon}$-approximation unless P=NP even at the simple case that one person needs at most two active neighbors to become active. This inapproximability result is derived with self-contained proofs without using PCP theorem. In the case that a person can be activated when one of its neighbors become active, there is a polynomial time ${e\over e-1}$-approximation, and we prove it is the best possible approximation under a reasonable assumption in the complexity theory, $NP \not\subset DTIME(n^{\log\log n})$. We also show that the exact computation of the final influence given a seed set can be solved in linear time in the \emph{deterministic linear threshold model}. The Least Seed Set problem, which aims to find a seed set with least number of people to activate all the required people in a given social network, is discussed. Using an analysis framework based on Set Cover, we show a $O($log$n)$-approximation in the case that a people become active when one of its neighbors is activated.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信