A new graph-based clustering method with dual-feature regularization and Laplacian rank constraint

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2024-11-23 DOI:10.1016/j.knosys.2024.112738

Hengdong Zhu , Yingshan Shen , Choujun Zhan , Fu Lee Wang , Heng Weng , Tianyong Hao

{"title":"A new graph-based clustering method with dual-feature regularization and Laplacian rank constraint","authors":"Hengdong Zhu , Yingshan Shen , Choujun Zhan , Fu Lee Wang , Heng Weng , Tianyong Hao","doi":"10.1016/j.knosys.2024.112738","DOIUrl":null,"url":null,"abstract":"<div><div>The performance of graph-based clustering is commonly limited by two-stage processing (Constructing and dividing similarity graph) and the quality of similar graphs. To this end, we propose a new graph-based clustering method with dual-feature regularization and Laplacian rank constraint. Specifically, our method reveals the clustering structure and unifies the two-stage process. It imposes a Laplacian rank constraint on the similarity graph to ensure that it has <span><math><mi>C</mi></math></span> connected components. In addition, a method based on dual-feature regularization is designed to capture local data feature information from both feature extraction and adaptive regression, and is applied to an accurate distance metric learning. A reweighting optimization is integrated to learn a high-quality robust similarity graph. Comprehensive experiments on Ecoli, Yale and Yeast datasets show that our method outperforms the existing graph-based clustering methods with an average improvement of about 4%, 5% and 7% on the evaluation metrics ACC, NMI and RI, respectively.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112738"},"PeriodicalIF":7.6000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124013728","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The performance of graph-based clustering is commonly limited by two-stage processing (Constructing and dividing similarity graph) and the quality of similar graphs. To this end, we propose a new graph-based clustering method with dual-feature regularization and Laplacian rank constraint. Specifically, our method reveals the clustering structure and unifies the two-stage process. It imposes a Laplacian rank constraint on the similarity graph to ensure that it has

C

connected components. In addition, a method based on dual-feature regularization is designed to capture local data feature information from both feature extraction and adaptive regression, and is applied to an accurate distance metric learning. A reweighting optimization is integrated to learn a high-quality robust similarity graph. Comprehensive experiments on Ecoli, Yale and Yeast datasets show that our method outperforms the existing graph-based clustering methods with an average improvement of about 4%, 5% and 7% on the evaluation metrics ACC, NMI and RI, respectively.

查看原文本刊更多论文

基于双特征正则化和拉普拉斯秩约束的基于图的聚类方法

基于图的聚类算法的性能通常受到两阶段处理（构造和划分相似图）和相似图质量的限制。为此，我们提出了一种新的基于图的双特征正则化和拉普拉斯秩约束聚类方法。具体而言，我们的方法揭示了聚类结构并统一了两阶段过程。它对相似图施加一个拉普拉斯秩约束，以确保它有C个连通的分量。此外，设计了一种基于双特征正则化的方法，从特征提取和自适应回归两方面捕获局部数据特征信息，并将其应用于精确的距离度量学习。采用加权优化方法学习高质量的鲁棒相似图。在Ecoli、Yale和Yeast数据集上的综合实验表明，我们的方法在评价指标ACC、NMI和RI上分别平均提高了4%、5%和7%，优于现有的基于图的聚类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.