Yang Liu;Hejiao Huang;Kaiqiang Yu;Shengxin Liu;Cheng Long
{"title":"最小$k$k顶点连通图搜索","authors":"Yang Liu;Hejiao Huang;Kaiqiang Yu;Shengxin Liu;Cheng Long","doi":"10.1109/TKDE.2025.3565844","DOIUrl":null,"url":null,"abstract":"The <inline-formula><tex-math>$k$</tex-math></inline-formula>-vertex connected (<inline-formula><tex-math>$k$</tex-math></inline-formula>-VC) subgraph, which remains connected with fewer than <inline-formula><tex-math>$k$</tex-math></inline-formula> vertices being removed, is an essential structure in graph mining. It has found many applications, such as survivable network design and web search optimization. However, existing studies focus on mining maximal <inline-formula><tex-math>$k$</tex-math></inline-formula>-VCs, which are excessively large yet less cohesive in real applications. In this paper, we study the minimum <i><inline-formula><tex-math>$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic></alternatives></inline-formula>-VC search (</i><i><monospace>MinVC</monospace></i><i>)</i> problem, seeking to find a <inline-formula><tex-math>$k$</tex-math></inline-formula>-VC with the minimum number of vertices. We formally prove that this problem is NP-hard and then propose two algorithms to obtain the exact solution. The basic method, called <monospace>Enum</monospace>, follows a branch-and-bound framework with some pruning rules, which directly enumerates all possible vertex sets. Nonetheless, it suffers from the efficiency issues due to the non-hereditary property of the <inline-formula><tex-math>$k$</tex-math></inline-formula>-VC model. To address this challenge, we propose an advanced method, called <monospace>VCtoB</monospace>, which divides the <monospace>MinVC</monospace> problem into several new sub-problems, called the <i>fixed-size <inline-formula><tex-math>$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic></alternatives></inline-formula>-VC</i> problems. Each of them can be solved efficiently by exploiting the hereditary property of the <inline-formula><tex-math>$s$</tex-math></inline-formula>-bundle model. Finally, our empirical experiments on 139 real-world networks demonstrate that <monospace>VCtoB</monospace> achieves performance improvement of up to six orders of magnitude over the baseline.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 7","pages":"4159-4165"},"PeriodicalIF":10.4000,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Minimum $k$k-Vertex Connected Graph Search\",\"authors\":\"Yang Liu;Hejiao Huang;Kaiqiang Yu;Shengxin Liu;Cheng Long\",\"doi\":\"10.1109/TKDE.2025.3565844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The <inline-formula><tex-math>$k$</tex-math></inline-formula>-vertex connected (<inline-formula><tex-math>$k$</tex-math></inline-formula>-VC) subgraph, which remains connected with fewer than <inline-formula><tex-math>$k$</tex-math></inline-formula> vertices being removed, is an essential structure in graph mining. It has found many applications, such as survivable network design and web search optimization. However, existing studies focus on mining maximal <inline-formula><tex-math>$k$</tex-math></inline-formula>-VCs, which are excessively large yet less cohesive in real applications. In this paper, we study the minimum <i><inline-formula><tex-math>$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic></alternatives></inline-formula>-VC search (</i><i><monospace>MinVC</monospace></i><i>)</i> problem, seeking to find a <inline-formula><tex-math>$k$</tex-math></inline-formula>-VC with the minimum number of vertices. We formally prove that this problem is NP-hard and then propose two algorithms to obtain the exact solution. The basic method, called <monospace>Enum</monospace>, follows a branch-and-bound framework with some pruning rules, which directly enumerates all possible vertex sets. Nonetheless, it suffers from the efficiency issues due to the non-hereditary property of the <inline-formula><tex-math>$k$</tex-math></inline-formula>-VC model. To address this challenge, we propose an advanced method, called <monospace>VCtoB</monospace>, which divides the <monospace>MinVC</monospace> problem into several new sub-problems, called the <i>fixed-size <inline-formula><tex-math>$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic></alternatives></inline-formula>-VC</i> problems. Each of them can be solved efficiently by exploiting the hereditary property of the <inline-formula><tex-math>$s$</tex-math></inline-formula>-bundle model. Finally, our empirical experiments on 139 real-world networks demonstrate that <monospace>VCtoB</monospace> achieves performance improvement of up to six orders of magnitude over the baseline.\",\"PeriodicalId\":13496,\"journal\":{\"name\":\"IEEE Transactions on Knowledge and Data Engineering\",\"volume\":\"37 7\",\"pages\":\"4159-4165\"},\"PeriodicalIF\":10.4000,\"publicationDate\":\"2025-03-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Knowledge and Data Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10982036/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10982036/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
The $k$-vertex connected ($k$-VC) subgraph, which remains connected with fewer than $k$ vertices being removed, is an essential structure in graph mining. It has found many applications, such as survivable network design and web search optimization. However, existing studies focus on mining maximal $k$-VCs, which are excessively large yet less cohesive in real applications. In this paper, we study the minimum $k$k-VC search (MinVC) problem, seeking to find a $k$-VC with the minimum number of vertices. We formally prove that this problem is NP-hard and then propose two algorithms to obtain the exact solution. The basic method, called Enum, follows a branch-and-bound framework with some pruning rules, which directly enumerates all possible vertex sets. Nonetheless, it suffers from the efficiency issues due to the non-hereditary property of the $k$-VC model. To address this challenge, we propose an advanced method, called VCtoB, which divides the MinVC problem into several new sub-problems, called the fixed-size $k$k-VC problems. Each of them can be solved efficiently by exploiting the hereditary property of the $s$-bundle model. Finally, our empirical experiments on 139 real-world networks demonstrate that VCtoB achieves performance improvement of up to six orders of magnitude over the baseline.
期刊介绍:
The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.