{"title":"A novel study of kernel graph regularized semi-non-negative matrix factorization with orthogonal subspace for clustering","authors":"Yasong Chen , Wen Li, Junjian Zhao","doi":"10.1016/j.bdr.2025.100531","DOIUrl":null,"url":null,"abstract":"<div><div>As a nonlinear extension of Non-negative Matrix Factorization (NMF), Kernel Non-negative Matrix Factorization (KNMF) has demonstrated greater effectiveness in revealing latent features from raw data. Building on this, this paper introduces kernel theory and effectively combines the advantages of semi-nonnegative constraints, graph regularization, and orthogonal subspace constraints to propose a novel model-Kernel Graph Regularized Semi-Negative Matrix Factorization with Orthogonal Subspaces and Auxiliary Variables (semi-KGNMFOSV). This model introduces auxiliary variables and reformulates the optimization problem, successfully overcoming the convergence proof challenges typically associated with orthogonal subspace-constrained methods. Furthermore, the model utilizes kernel methods to effectively capture complex nonlinear structures in the data. The semi-nonnegative constraint, along with orthogonal subspace constraints incorporating auxiliary variables, enhances optimization efficiency, while graph regularization preserves the local geometric structure of the data. We develop an efficient optimization algorithm to solve the proposed model and conduct extensive experiments on multiple real-world datasets. Additionally, we investigate the impact of three different initialization strategies on the performance of the proposed algorithm. Experimental results demonstrate that, compared to classical and state-of-the-art methods, the proposed model exhibits superior performance across all three initialization strategies.</div></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"40 ","pages":"Article 100531"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Big Data Research","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214579625000267","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
As a nonlinear extension of Non-negative Matrix Factorization (NMF), Kernel Non-negative Matrix Factorization (KNMF) has demonstrated greater effectiveness in revealing latent features from raw data. Building on this, this paper introduces kernel theory and effectively combines the advantages of semi-nonnegative constraints, graph regularization, and orthogonal subspace constraints to propose a novel model-Kernel Graph Regularized Semi-Negative Matrix Factorization with Orthogonal Subspaces and Auxiliary Variables (semi-KGNMFOSV). This model introduces auxiliary variables and reformulates the optimization problem, successfully overcoming the convergence proof challenges typically associated with orthogonal subspace-constrained methods. Furthermore, the model utilizes kernel methods to effectively capture complex nonlinear structures in the data. The semi-nonnegative constraint, along with orthogonal subspace constraints incorporating auxiliary variables, enhances optimization efficiency, while graph regularization preserves the local geometric structure of the data. We develop an efficient optimization algorithm to solve the proposed model and conduct extensive experiments on multiple real-world datasets. Additionally, we investigate the impact of three different initialization strategies on the performance of the proposed algorithm. Experimental results demonstrate that, compared to classical and state-of-the-art methods, the proposed model exhibits superior performance across all three initialization strategies.
期刊介绍:
The journal aims to promote and communicate advances in big data research by providing a fast and high quality forum for researchers, practitioners and policy makers from the very many different communities working on, and with, this topic.
The journal will accept papers on foundational aspects in dealing with big data, as well as papers on specific Platforms and Technologies used to deal with big data. To promote Data Science and interdisciplinary collaboration between fields, and to showcase the benefits of data driven research, papers demonstrating applications of big data in domains as diverse as Geoscience, Social Web, Finance, e-Commerce, Health Care, Environment and Climate, Physics and Astronomy, Chemistry, life sciences and drug discovery, digital libraries and scientific publications, security and government will also be considered. Occasionally the journal may publish whitepapers on policies, standards and best practices.