A Weighted Consensus Function Based on Information-Theoretic Principles to Combine Soft Clusterings

Yan Gao, Shiwen Gu, Jianhua Li, Zhining Liao
{"title":"A Weighted Consensus Function Based on Information-Theoretic Principles to Combine Soft Clusterings","authors":"Yan Gao, Shiwen Gu, Jianhua Li, Zhining Liao","doi":"10.1109/GrC.2007.156","DOIUrl":null,"url":null,"abstract":"How to combine multiple clusterings into a single clustering solution of better quality is a critical problem in cluster ensemble. In this paper, we extend Strehl's consensus function based on information- theoretic principles and propose a novel weighted consensus function to combine multiple \"soft\" clusterings. In our consensus function, we use mutual information to measure the sharing information between two \"soft\" clusterings and emphasize the clustering which is much different from the others. We use the algorithm similar to sequential k-means to obtain the solution of this consensus function and conduct experiments on four real-world datasets to compare our algorithm with other four consensus function, including CSPA, HGPA, MCLA, QMI. The results indicate that our consensus function provides solutions of better quality than CSPA, HGPA, MCLA, QMI and when the distribution of diversity in cluster ensembles is uneven, considering the influence of diversity can improve the quality of clustering ensemble.","PeriodicalId":259430,"journal":{"name":"2007 IEEE International Conference on Granular Computing (GRC 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Granular Computing (GRC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GrC.2007.156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

How to combine multiple clusterings into a single clustering solution of better quality is a critical problem in cluster ensemble. In this paper, we extend Strehl's consensus function based on information- theoretic principles and propose a novel weighted consensus function to combine multiple "soft" clusterings. In our consensus function, we use mutual information to measure the sharing information between two "soft" clusterings and emphasize the clustering which is much different from the others. We use the algorithm similar to sequential k-means to obtain the solution of this consensus function and conduct experiments on four real-world datasets to compare our algorithm with other four consensus function, including CSPA, HGPA, MCLA, QMI. The results indicate that our consensus function provides solutions of better quality than CSPA, HGPA, MCLA, QMI and when the distribution of diversity in cluster ensembles is uneven, considering the influence of diversity can improve the quality of clustering ensemble.
一种基于信息论原理的加权一致函数组合软聚类
如何将多个聚类组合成一个质量更好的聚类解是聚类集成中的一个关键问题。本文基于信息论原理,对Strehl的共识函数进行了扩展,提出了一种新的加权共识函数来组合多个“软”聚类。在我们的共识函数中,我们使用互信息来衡量两个“软”聚类之间的共享信息,并强调与其他聚类有很大不同的聚类。我们使用类似于序列k-means的算法来获得该共识函数的解,并在四个真实数据集上进行实验,将我们的算法与CSPA、HGPA、MCLA、QMI等其他四种共识函数进行比较。结果表明,我们的共识函数提供了比CSPA、HGPA、MCLA、QMI更好的解决方案,当多样性在聚类集合中分布不均匀时,考虑多样性的影响可以提高聚类集合的质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信