Xiaojia Xu, Haoyu Liu, Xiaowei Lv, Yongcai Wang, Deying Li
{"title":"An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph Discovery","authors":"Xiaojia Xu, Haoyu Liu, Xiaowei Lv, Yongcai Wang, Deying Li","doi":"arxiv-2408.14022","DOIUrl":null,"url":null,"abstract":"Detecting locally, non-overlapping, near-clique densest subgraphs is a\ncrucial problem for community search in social networks. As a vertex may be\ninvolved in multiple overlapped local cliques, detecting locally densest\nsub-structures considering h-clique density, i.e., locally h-clique densest\nsubgraph (LhCDS) attracts great interests. This paper investigates the LhCDS\ndetection problem and proposes an efficient and exact algorithm to list the\ntop-k non-overlapping, locally h-clique dense, and compact subgraphs. We in\nparticular jointly consider h-clique compact number and LhCDS and design a new\n\"Iterative Propose-Prune-and-Verify\" pipeline (IPPV) for top-k LhCDS detection.\n(1) In the proposal part, we derive initial bounds for h-clique compact\nnumbers; prove the validity, and extend a convex programming method to tighten\nthe bounds for proposing LhCDS candidates without missing any. (2) Then a\ntentative graph decomposition method is proposed to solve the challenging case\nwhere a clique spans multiple subgraphs in graph decomposition. (3) To deal\nwith the verification difficulty, both a basic and a fast verification method\nare proposed, where the fast method constructs a smaller-scale flow network to\nimprove efficiency while preserving the verification correctness. The verified\nLhCDSes are returned, while the candidates that remained unsure reenter the\nIPPV pipeline. (4) We further extend the proposed methods to locally more\ngeneral pattern densest subgraph detection problems. We prove the exactness and\nlow complexity of the proposed algorithm. Extensive experiments on real\ndatasets show the effectiveness and high efficiency of IPPV.","PeriodicalId":501525,"journal":{"name":"arXiv - CS - Data Structures and Algorithms","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Data Structures and Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.14022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting locally, non-overlapping, near-clique densest subgraphs is a
crucial problem for community search in social networks. As a vertex may be
involved in multiple overlapped local cliques, detecting locally densest
sub-structures considering h-clique density, i.e., locally h-clique densest
subgraph (LhCDS) attracts great interests. This paper investigates the LhCDS
detection problem and proposes an efficient and exact algorithm to list the
top-k non-overlapping, locally h-clique dense, and compact subgraphs. We in
particular jointly consider h-clique compact number and LhCDS and design a new
"Iterative Propose-Prune-and-Verify" pipeline (IPPV) for top-k LhCDS detection.
(1) In the proposal part, we derive initial bounds for h-clique compact
numbers; prove the validity, and extend a convex programming method to tighten
the bounds for proposing LhCDS candidates without missing any. (2) Then a
tentative graph decomposition method is proposed to solve the challenging case
where a clique spans multiple subgraphs in graph decomposition. (3) To deal
with the verification difficulty, both a basic and a fast verification method
are proposed, where the fast method constructs a smaller-scale flow network to
improve efficiency while preserving the verification correctness. The verified
LhCDSes are returned, while the candidates that remained unsure reenter the
IPPV pipeline. (4) We further extend the proposed methods to locally more
general pattern densest subgraph detection problems. We prove the exactness and
low complexity of the proposed algorithm. Extensive experiments on real
datasets show the effectiveness and high efficiency of IPPV.