Fei Ye , Jingsong Xiao , Weidong Ma , Yulai Miao , Ying Yang
{"title":"Consistent community detection approach in the nonparametric weighted stochastic blockmodel with unspecified number of communities","authors":"Fei Ye , Jingsong Xiao , Weidong Ma , Yulai Miao , Ying Yang","doi":"10.1016/j.jspi.2025.106339","DOIUrl":null,"url":null,"abstract":"<div><div>The stochastic blockmodel (SBM) is a widely used model for representing graphs. Numerous approaches have been applied to the SBM to detect latent community structures in graphs, typically using two types of consistency (strong and weak) to evaluate their performance. Most of these methods have been studied and shown to be consistent under the SBM framework. However, the consistency of the weighted SBM, an important extension of the SBM, has been largely overlooked. Moreover, few approaches are capable of detecting communities when the number of communities is unknown. In this paper, we propose a nonparametric method for effective community detection under the assortative, nonparametric weighted SBM with an unknown number of communities, and we establish the consistency of our approach. We introduce a novel concept, “consistency in relationship”, as a more practical criterion to assess the performance of community detection algorithms. Since solving the optimization problem in our approach becomes intractable for large sample sizes, we propose an efficient algorithm to approximate it. Simulations demonstrate that our community detection method is both efficient and robust, particularly for unbalanced networks. We illustrate the effectiveness of our approach on three real-world networks.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"242 ","pages":"Article 106339"},"PeriodicalIF":0.8000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375825000771","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
The stochastic blockmodel (SBM) is a widely used model for representing graphs. Numerous approaches have been applied to the SBM to detect latent community structures in graphs, typically using two types of consistency (strong and weak) to evaluate their performance. Most of these methods have been studied and shown to be consistent under the SBM framework. However, the consistency of the weighted SBM, an important extension of the SBM, has been largely overlooked. Moreover, few approaches are capable of detecting communities when the number of communities is unknown. In this paper, we propose a nonparametric method for effective community detection under the assortative, nonparametric weighted SBM with an unknown number of communities, and we establish the consistency of our approach. We introduce a novel concept, “consistency in relationship”, as a more practical criterion to assess the performance of community detection algorithms. Since solving the optimization problem in our approach becomes intractable for large sample sizes, we propose an efficient algorithm to approximate it. Simulations demonstrate that our community detection method is both efficient and robust, particularly for unbalanced networks. We illustrate the effectiveness of our approach on three real-world networks.
期刊介绍:
The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists.
We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.