A Scalable Algorithm for Fair Influence Maximization With Unbiased Estimator

IF 10.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2025-04-24 DOI:10.1109/TKDE.2025.3564283

Xiaobin Rui;Zhixiao Wang;Hao Peng;Wei Chen;Philip S. Yu

{"title":"A Scalable Algorithm for Fair Influence Maximization With Unbiased Estimator","authors":"Xiaobin Rui;Zhixiao Wang;Hao Peng;Wei Chen;Philip S. Yu","doi":"10.1109/TKDE.2025.3564283","DOIUrl":null,"url":null,"abstract":"This paper studies the fair influence maximization problem with efficient algorithms. In particular, given a graph <inline-formula><tex-math>$G$</tex-math></inline-formula>, a community structure <inline-formula><tex-math>${\\mathcal {C}}$</tex-math></inline-formula> consisting of disjoint communities, and a budget <inline-formula><tex-math>$k$</tex-math></inline-formula>, the problem asks to select a seed set <inline-formula><tex-math>$S$</tex-math></inline-formula> (<inline-formula><tex-math>$|S|=k$</tex-math></inline-formula>) that maximizes the influence spread while narrowing the influence gap between different communities. This problem derives from some significant social scenarios, such as health interventions (e.g. suicide/HIV prevention) where individuals from underrepresented groups or LGBTQ communities may be disproportionately excluded from the benefits of the intervention. To depict the concept of fairness in the context of influence maximization, researchers have proposed various notions of fairness, where the welfare fairness notion that better balances fairness level and influence spread has shown promising effectiveness. However, the lack of efficient algorithms for optimizing the objective function under welfare fairness restricts its application to networks of only a few hundred nodes. In this paper, we modify the objective function of welfare fairness to maximize the exponentially weighted sum and the logarithmically weighted sum over all communities’ influenced fractions (utility). To achieve efficient algorithms with theoretical guarantees, we first introduce two unbiased estimators: one for the fractional power of the arithmetic mean and the other for the logarithm of the arithmetic mean. Then, by adapting the Reverse Influence Sampling (RIS) approach, we convert the optimization problem to a weighted maximum coverage problem. We also analyze the number of reverse reachable sets needed to approximate the fair influence at a high probability. Finally, we present an efficient algorithm that guarantees <inline-formula><tex-math>$1-1/e - \\varepsilon$</tex-math></inline-formula> (positive objective function) or <inline-formula><tex-math>$1+1/e + \\varepsilon$</tex-math></inline-formula> (negative objective function) approximation for any small <inline-formula><tex-math>$\\varepsilon > 0$</tex-math></inline-formula>. Experiments demonstrate that our proposed algorithm could efficiently handle large-scale networks with good performance.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 7","pages":"3881-3895"},"PeriodicalIF":10.4000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976569/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

This paper studies the fair influence maximization problem with efficient algorithms. In particular, given a graph

$G$

, a community structure

${\mathcal {C}}$

consisting of disjoint communities, and a budget

$k$

, the problem asks to select a seed set

$S$

(

$|S|=k$

) that maximizes the influence spread while narrowing the influence gap between different communities. This problem derives from some significant social scenarios, such as health interventions (e.g. suicide/HIV prevention) where individuals from underrepresented groups or LGBTQ communities may be disproportionately excluded from the benefits of the intervention. To depict the concept of fairness in the context of influence maximization, researchers have proposed various notions of fairness, where the welfare fairness notion that better balances fairness level and influence spread has shown promising effectiveness. However, the lack of efficient algorithms for optimizing the objective function under welfare fairness restricts its application to networks of only a few hundred nodes. In this paper, we modify the objective function of welfare fairness to maximize the exponentially weighted sum and the logarithmically weighted sum over all communities’ influenced fractions (utility). To achieve efficient algorithms with theoretical guarantees, we first introduce two unbiased estimators: one for the fractional power of the arithmetic mean and the other for the logarithm of the arithmetic mean. Then, by adapting the Reverse Influence Sampling (RIS) approach, we convert the optimization problem to a weighted maximum coverage problem. We also analyze the number of reverse reachable sets needed to approximate the fair influence at a high probability. Finally, we present an efficient algorithm that guarantees

$1-1/e - \varepsilon$

(positive objective function) or

$1+1/e + \varepsilon$

(negative objective function) approximation for any small

$\varepsilon > 0$

. Experiments demonstrate that our proposed algorithm could efficiently handle large-scale networks with good performance.

查看原文本刊更多论文

一种具有无偏估计量的可伸缩公平影响最大化算法

本文用高效算法研究了公平影响最大化问题。具体来说，给定一个图$G$，一个由不相交的社区组成的社区结构${\mathcal {C}}$，以及一个预算$k$，问题要求选择一个种子集$S$ ($|S|=k$)，使影响传播最大化，同时缩小不同社区之间的影响差距。这一问题源于一些重要的社会情况，例如卫生干预措施（例如自杀/艾滋病毒预防），其中代表性不足的群体或LGBTQ社区的个人可能不成比例地被排除在干预措施的好处之外。为了描述影响力最大化背景下的公平概念，研究者们提出了各种公平概念，其中福利公平概念更好地平衡了公平水平和影响力传播，显示出了良好的效果。然而，缺乏有效的优化福利公平下目标函数的算法，限制了其在只有几百个节点的网络中的应用。在本文中，我们修改了福利公平的目标函数，以最大化所有社区影响分数（效用）的指数加权和对数加权和。为了实现具有理论保证的高效算法，我们首先引入了两个无偏估计量：一个用于算术平均值的分数次方，另一个用于算术平均值的对数。然后，采用反向影响采样（RIS）方法，将优化问题转化为加权最大覆盖问题。我们还分析了在高概率下近似公平影响所需的反向可达集的数量。最后，我们提出了一个有效的算法，保证$1-1/e - \varepsilon$（正目标函数）或$1+1/e + \varepsilon$（负目标函数）逼近任何小$\varepsilon >；0美元。实验表明，该算法能够有效地处理大规模网络，并具有良好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.