基于ising的核方法组合聚类

Masahito Kumagai, K. Komatsu, Masayuki Sato, Hiroaki Kobayashi
{"title":"基于ising的核方法组合聚类","authors":"Masahito Kumagai, K. Komatsu, Masayuki Sato, Hiroaki Kobayashi","doi":"10.1109/MCSoC51149.2021.00037","DOIUrl":null,"url":null,"abstract":"Combinatorial clustering based on the Ising model is getting attention as a method to obtain high-quality clustering results. Furthermore, combinatorial clustering using the kernel method can handle any irregular data type by using a kernel trick. The kernel trick is an approach to the extension of the data to an arbitrary high-dimensional feature space by switching the kernel function. However, the conventional kernel clustering based on the Ising model can only be used in the limited case where the number of clusters is two. This is because the Ising model is composed of decision variables that represent binary values. This paper proposes Ising-based combinatorial clustering using a kernel method that can handle two or more clusters. The key idea of the proposed method is to represent clustering results using one-hot encoding. One-hot encoding represents a cluster to which a single data belongs by using bits whose number is the same as that of clusters. However, the one-hot constraint caused by the use of one-hot encoding decreases the quality of clustering. To this problem, in this paper, combinatorial clustering based on an externally defined one-hot constraint is used. The proposed kernel-based combinatorial clustering works with more than two clusters. Therefore, the proposed method is compared with Euclidean distance-based combinatorial clustering that divides the data into two or more clusters as the conventional method. Through experiments, it is clarified that the quality of the clustering results of the proposed method for irregular data is significantly better than that of the conventional method.","PeriodicalId":166811,"journal":{"name":"2021 IEEE 14th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ising-Based Combinatorial Clustering Using the Kernel Method\",\"authors\":\"Masahito Kumagai, K. Komatsu, Masayuki Sato, Hiroaki Kobayashi\",\"doi\":\"10.1109/MCSoC51149.2021.00037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Combinatorial clustering based on the Ising model is getting attention as a method to obtain high-quality clustering results. Furthermore, combinatorial clustering using the kernel method can handle any irregular data type by using a kernel trick. The kernel trick is an approach to the extension of the data to an arbitrary high-dimensional feature space by switching the kernel function. However, the conventional kernel clustering based on the Ising model can only be used in the limited case where the number of clusters is two. This is because the Ising model is composed of decision variables that represent binary values. This paper proposes Ising-based combinatorial clustering using a kernel method that can handle two or more clusters. The key idea of the proposed method is to represent clustering results using one-hot encoding. One-hot encoding represents a cluster to which a single data belongs by using bits whose number is the same as that of clusters. However, the one-hot constraint caused by the use of one-hot encoding decreases the quality of clustering. To this problem, in this paper, combinatorial clustering based on an externally defined one-hot constraint is used. The proposed kernel-based combinatorial clustering works with more than two clusters. Therefore, the proposed method is compared with Euclidean distance-based combinatorial clustering that divides the data into two or more clusters as the conventional method. Through experiments, it is clarified that the quality of the clustering results of the proposed method for irregular data is significantly better than that of the conventional method.\",\"PeriodicalId\":166811,\"journal\":{\"name\":\"2021 IEEE 14th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 14th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MCSoC51149.2021.00037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 14th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MCSoC51149.2021.00037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于伊辛模型的组合聚类作为一种获得高质量聚类结果的方法越来越受到关注。此外,使用核方法的组合聚类可以通过使用核技巧处理任何不规则数据类型。核技巧是一种通过切换核函数将数据扩展到任意高维特征空间的方法。然而,传统的基于Ising模型的核聚类只能在聚类数量为2的有限情况下使用。这是因为Ising模型由表示二值的决策变量组成。本文提出了一种基于ising的组合聚类方法,该方法可以处理两个或多个聚类。该方法的关键思想是用单热编码表示聚类结果。单热编码通过使用与簇数相同的位数来表示单个数据所属的簇。然而,使用单热编码导致的单热约束降低了聚类的质量。针对这一问题,本文采用了基于外部定义的单热约束的组合聚类方法。所提出的基于核的组合聚类适用于两个以上的聚类。因此,将该方法与传统的基于欧几里得距离的组合聚类方法(将数据分成两个或多个聚类)进行了比较。通过实验表明,该方法对不规则数据的聚类结果质量明显优于常规方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ising-Based Combinatorial Clustering Using the Kernel Method
Combinatorial clustering based on the Ising model is getting attention as a method to obtain high-quality clustering results. Furthermore, combinatorial clustering using the kernel method can handle any irregular data type by using a kernel trick. The kernel trick is an approach to the extension of the data to an arbitrary high-dimensional feature space by switching the kernel function. However, the conventional kernel clustering based on the Ising model can only be used in the limited case where the number of clusters is two. This is because the Ising model is composed of decision variables that represent binary values. This paper proposes Ising-based combinatorial clustering using a kernel method that can handle two or more clusters. The key idea of the proposed method is to represent clustering results using one-hot encoding. One-hot encoding represents a cluster to which a single data belongs by using bits whose number is the same as that of clusters. However, the one-hot constraint caused by the use of one-hot encoding decreases the quality of clustering. To this problem, in this paper, combinatorial clustering based on an externally defined one-hot constraint is used. The proposed kernel-based combinatorial clustering works with more than two clusters. Therefore, the proposed method is compared with Euclidean distance-based combinatorial clustering that divides the data into two or more clusters as the conventional method. Through experiments, it is clarified that the quality of the clustering results of the proposed method for irregular data is significantly better than that of the conventional method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信