Y-Graph: A Max-Ascent-Angle Graph for Detecting Clusters

IF 8.9 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Junyi Guan;Sheng Li;Xiongxiong He;Jiajia Chen;Yangyang Zhao;Yuxuan Zhang
{"title":"Y-Graph: A Max-Ascent-Angle Graph for Detecting Clusters","authors":"Junyi Guan;Sheng Li;Xiongxiong He;Jiajia Chen;Yangyang Zhao;Yuxuan Zhang","doi":"10.1109/TKDE.2024.3486221","DOIUrl":null,"url":null,"abstract":"Graph clustering technique is highly effective in detecting complex-shaped clusters, in which graph building is a crucial step. Nevertheless, building a reasonable graph that can exhibit high connectivity within clusters and low connectivity across clusters is challenging. Herein, we design a max-ascent-angle graph called the “Y-graph”, a high-sparse graph that automatically allocates dense edges within clusters and sparse edges across clusters, regardless of their shapes or dimensionality. In the graph, every point \n<inline-formula><tex-math>$x$</tex-math></inline-formula>\n is allowed to connect its nearest higher-density neighbor \n<inline-formula><tex-math>$\\delta$</tex-math></inline-formula>\n, and another higher-density neighbor \n<inline-formula><tex-math>$\\gamma$</tex-math></inline-formula>\n, satisfying that the angle \n<inline-formula><tex-math>$\\angle \\delta x\\gamma$</tex-math></inline-formula>\n is the largest, called “max-ascent-angle”. By seeking the max-ascent-angle, points are automatically connected as the Y-graph, which is a reasonable graph that can effectively balance inter-cluster connectivity and intra-cluster non-connectivity. Besides, an edge weight function is designed to capture the similarity of the neighbor probability distribution, which effectively represents the density connectivity between points. By employing the Normalized-Cut (Ncut) technique, a Ncut-Y algorithm is proposed. Benefiting from the excellent performance of Y-graph, Ncut-Y can fast seek and cut the edges located in the low-density boundaries between clusters, thereby, capturing clusters effectively. Experimental results on both synthetic and real datasets demonstrate the effectiveness of Y-graph and Ncut-Y.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"542-556"},"PeriodicalIF":8.9000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10734073/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Graph clustering technique is highly effective in detecting complex-shaped clusters, in which graph building is a crucial step. Nevertheless, building a reasonable graph that can exhibit high connectivity within clusters and low connectivity across clusters is challenging. Herein, we design a max-ascent-angle graph called the “Y-graph”, a high-sparse graph that automatically allocates dense edges within clusters and sparse edges across clusters, regardless of their shapes or dimensionality. In the graph, every point $x$ is allowed to connect its nearest higher-density neighbor $\delta$ , and another higher-density neighbor $\gamma$ , satisfying that the angle $\angle \delta x\gamma$ is the largest, called “max-ascent-angle”. By seeking the max-ascent-angle, points are automatically connected as the Y-graph, which is a reasonable graph that can effectively balance inter-cluster connectivity and intra-cluster non-connectivity. Besides, an edge weight function is designed to capture the similarity of the neighbor probability distribution, which effectively represents the density connectivity between points. By employing the Normalized-Cut (Ncut) technique, a Ncut-Y algorithm is proposed. Benefiting from the excellent performance of Y-graph, Ncut-Y can fast seek and cut the edges located in the low-density boundaries between clusters, thereby, capturing clusters effectively. Experimental results on both synthetic and real datasets demonstrate the effectiveness of Y-graph and Ncut-Y.
y图:用于检测聚类的最大上升角图
图聚类技术是一种高效的复杂形状聚类检测技术,其中图的构建是关键步骤。然而,构建一个合理的图来显示集群内部的高连通性和集群之间的低连通性是具有挑战性的。在这里,我们设计了一个称为“y图”的最大上升角图,这是一个高度稀疏的图,可以自动分配集群内的密集边缘和集群间的稀疏边缘,而不管它们的形状或维度如何。在图中,每个点$x$被允许连接其最近的高密度邻居$\delta$和另一个高密度邻居$\gamma$,满足角度$\angle \delta x\gamma$是最大的,称为“最大上升角”。通过寻找最大上升角,将点自动连接成y图,这是一种合理的图,可以有效地平衡簇间连通性和簇内非连通性。此外,设计了一个边缘权函数来捕捉相邻概率分布的相似性,有效地表示了点之间的密度连通性。采用归一化切割(Ncut)技术,提出了一种Ncut- y算法。得益于y图的优异性能,Ncut-Y可以快速寻找和切割位于簇之间低密度边界的边,从而有效地捕获簇。在合成数据集和真实数据集上的实验结果都证明了Y-graph和Ncut-Y的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering 工程技术-工程:电子与电气
CiteScore
11.70
自引率
3.40%
发文量
515
审稿时长
6 months
期刊介绍: The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信