{"title":"通过基于共识的方法增强群落探测的稳定性并评估其不确定性","authors":"Fabio Morea, Domenico De Stefano","doi":"arxiv-2408.02959","DOIUrl":null,"url":null,"abstract":"Complex data in social and natural sciences find effective representation\nthrough networks, wherein quantitative and categorical information can be\nassociated with nodes and connecting edges. The internal structure of networks\ncan be explored using unsupervised machine learning methods known as community\ndetection algorithms. The process of community detection is inherently subject\nto uncertainty as algorithms utilize heuristic approaches and randomised\nprocedures to explore vast solution spaces, resulting in non-deterministic\noutcomes and variability in detected communities across multiple runs.\nMoreover, many algorithms are not designed to identify outliers and may fail to\ntake into account that a network is an unordered mathematical entity. The main\naim of our work is to address these issues through a consensus-based approach\nby introducing a new framework called Consensus Community Detection (CCD). Our\nmethod can be applied to different community detection algorithms, allowing the\nquantification of uncertainty for the whole network as well as for each node,\nand providing three strategies for dealing with outliers: incorporate,\nhighlight, or group. The effectiveness of our approach is evaluated on\nartificial benchmark networks.","PeriodicalId":501172,"journal":{"name":"arXiv - STAT - Applications","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Stability and Assessing Uncertainty in Community Detection through a Consensus-based Approach\",\"authors\":\"Fabio Morea, Domenico De Stefano\",\"doi\":\"arxiv-2408.02959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Complex data in social and natural sciences find effective representation\\nthrough networks, wherein quantitative and categorical information can be\\nassociated with nodes and connecting edges. The internal structure of networks\\ncan be explored using unsupervised machine learning methods known as community\\ndetection algorithms. The process of community detection is inherently subject\\nto uncertainty as algorithms utilize heuristic approaches and randomised\\nprocedures to explore vast solution spaces, resulting in non-deterministic\\noutcomes and variability in detected communities across multiple runs.\\nMoreover, many algorithms are not designed to identify outliers and may fail to\\ntake into account that a network is an unordered mathematical entity. The main\\naim of our work is to address these issues through a consensus-based approach\\nby introducing a new framework called Consensus Community Detection (CCD). Our\\nmethod can be applied to different community detection algorithms, allowing the\\nquantification of uncertainty for the whole network as well as for each node,\\nand providing three strategies for dealing with outliers: incorporate,\\nhighlight, or group. The effectiveness of our approach is evaluated on\\nartificial benchmark networks.\",\"PeriodicalId\":501172,\"journal\":{\"name\":\"arXiv - STAT - Applications\",\"volume\":\"26 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.02959\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhancing Stability and Assessing Uncertainty in Community Detection through a Consensus-based Approach
Complex data in social and natural sciences find effective representation
through networks, wherein quantitative and categorical information can be
associated with nodes and connecting edges. The internal structure of networks
can be explored using unsupervised machine learning methods known as community
detection algorithms. The process of community detection is inherently subject
to uncertainty as algorithms utilize heuristic approaches and randomised
procedures to explore vast solution spaces, resulting in non-deterministic
outcomes and variability in detected communities across multiple runs.
Moreover, many algorithms are not designed to identify outliers and may fail to
take into account that a network is an unordered mathematical entity. The main
aim of our work is to address these issues through a consensus-based approach
by introducing a new framework called Consensus Community Detection (CCD). Our
method can be applied to different community detection algorithms, allowing the
quantification of uncertainty for the whole network as well as for each node,
and providing three strategies for dealing with outliers: incorporate,
highlight, or group. The effectiveness of our approach is evaluated on
artificial benchmark networks.