An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph Discovery

Xiaojia Xu, Haoyu Liu, Xiaowei Lv, Yongcai Wang, Deying Li
{"title":"An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph Discovery","authors":"Xiaojia Xu, Haoyu Liu, Xiaowei Lv, Yongcai Wang, Deying Li","doi":"arxiv-2408.14022","DOIUrl":null,"url":null,"abstract":"Detecting locally, non-overlapping, near-clique densest subgraphs is a\ncrucial problem for community search in social networks. As a vertex may be\ninvolved in multiple overlapped local cliques, detecting locally densest\nsub-structures considering h-clique density, i.e., locally h-clique densest\nsubgraph (LhCDS) attracts great interests. This paper investigates the LhCDS\ndetection problem and proposes an efficient and exact algorithm to list the\ntop-k non-overlapping, locally h-clique dense, and compact subgraphs. We in\nparticular jointly consider h-clique compact number and LhCDS and design a new\n\"Iterative Propose-Prune-and-Verify\" pipeline (IPPV) for top-k LhCDS detection.\n(1) In the proposal part, we derive initial bounds for h-clique compact\nnumbers; prove the validity, and extend a convex programming method to tighten\nthe bounds for proposing LhCDS candidates without missing any. (2) Then a\ntentative graph decomposition method is proposed to solve the challenging case\nwhere a clique spans multiple subgraphs in graph decomposition. (3) To deal\nwith the verification difficulty, both a basic and a fast verification method\nare proposed, where the fast method constructs a smaller-scale flow network to\nimprove efficiency while preserving the verification correctness. The verified\nLhCDSes are returned, while the candidates that remained unsure reenter the\nIPPV pipeline. (4) We further extend the proposed methods to locally more\ngeneral pattern densest subgraph detection problems. We prove the exactness and\nlow complexity of the proposed algorithm. Extensive experiments on real\ndatasets show the effectiveness and high efficiency of IPPV.","PeriodicalId":501525,"journal":{"name":"arXiv - CS - Data Structures and Algorithms","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Data Structures and Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.14022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting locally, non-overlapping, near-clique densest subgraphs is a crucial problem for community search in social networks. As a vertex may be involved in multiple overlapped local cliques, detecting locally densest sub-structures considering h-clique density, i.e., locally h-clique densest subgraph (LhCDS) attracts great interests. This paper investigates the LhCDS detection problem and proposes an efficient and exact algorithm to list the top-k non-overlapping, locally h-clique dense, and compact subgraphs. We in particular jointly consider h-clique compact number and LhCDS and design a new "Iterative Propose-Prune-and-Verify" pipeline (IPPV) for top-k LhCDS detection. (1) In the proposal part, we derive initial bounds for h-clique compact numbers; prove the validity, and extend a convex programming method to tighten the bounds for proposing LhCDS candidates without missing any. (2) Then a tentative graph decomposition method is proposed to solve the challenging case where a clique spans multiple subgraphs in graph decomposition. (3) To deal with the verification difficulty, both a basic and a fast verification method are proposed, where the fast method constructs a smaller-scale flow network to improve efficiency while preserving the verification correctness. The verified LhCDSes are returned, while the candidates that remained unsure reenter the IPPV pipeline. (4) We further extend the proposed methods to locally more general pattern densest subgraph detection problems. We prove the exactness and low complexity of the proposed algorithm. Extensive experiments on real datasets show the effectiveness and high efficiency of IPPV.
局部 h-Clique 最密集子图发现的高效精确算法
在社交网络中,检测局部非重叠的近小集团最密集子图是社区搜索的关键问题。由于一个顶点可能涉及多个重叠的局部小群,因此检测考虑 h 小群密度的局部最密集子结构,即局部 h 小群最密集子图(LhCDS)引起了人们的极大兴趣。本文研究了LhCDS检测问题,并提出了一种高效精确的算法来列出前k个非重叠、局部h-clique密集且紧凑的子图。(1)在提议部分,我们推导出了 h-clique compact number 的初始边界;证明了其有效性,并扩展了一种凸编程方法,以收紧提议 LhCDS 候选者的边界,且不遗漏任何候选者。(2) 然后,我们提出了本质图分解方法,以解决图分解中一个簇跨越多个子图的难题。(3) 针对验证困难的问题,提出了基本验证方法和快速验证方法。经过验证的 LhCDSes 将被返回,而仍不确定的候选者将重新进入 IPPV 流水线。(4) 我们进一步将提出的方法扩展到局部更一般的模式最密子图检测问题。我们证明了所提算法的精确性和低复杂度。在真实数据集上进行的大量实验证明了 IPPV 的有效性和高效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信