Yasamin Tabatabaee, Eleanor Wedell, Minhyuk Park, Tandy Warnow
{"title":"FastEnsemble: A new scalable ensemble clustering method","authors":"Yasamin Tabatabaee, Eleanor Wedell, Minhyuk Park, Tandy Warnow","doi":"arxiv-2409.02077","DOIUrl":null,"url":null,"abstract":"Many community detection algorithms are stochastic in nature, and their\noutput can vary based on different input parameters and random seeds. Consensus\nclustering methods, such as FastConsensus and ECG, combine clusterings from\nmultiple runs of the same clustering algorithm, in order to improve stability\nand accuracy. In this study we present a new consensus clustering method,\nFastEnsemble, and show that it provides advantages over both FastConsensus and\nECG. Furthermore, FastEnsemble is designed for use with any clustering method,\nand we show results using \\ourmethod with Leiden optimizing modularity or the\nConstant Potts model. FastEnsemble is available in Github at\nhttps://github.com/ytabatabaee/fast-ensemble","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many community detection algorithms are stochastic in nature, and their
output can vary based on different input parameters and random seeds. Consensus
clustering methods, such as FastConsensus and ECG, combine clusterings from
multiple runs of the same clustering algorithm, in order to improve stability
and accuracy. In this study we present a new consensus clustering method,
FastEnsemble, and show that it provides advantages over both FastConsensus and
ECG. Furthermore, FastEnsemble is designed for use with any clustering method,
and we show results using \ourmethod with Leiden optimizing modularity or the
Constant Potts model. FastEnsemble is available in Github at
https://github.com/ytabatabaee/fast-ensemble