Songtao Li , Qiancheng Wang , MengJie Luo , Yang Li , Chang Tang
{"title":"嵌入非负矩阵因式分解的自监督星图优化","authors":"Songtao Li , Qiancheng Wang , MengJie Luo , Yang Li , Chang Tang","doi":"10.1016/j.ipm.2024.103969","DOIUrl":null,"url":null,"abstract":"<div><div>Labeling expensive and graph structure fuzziness are recognized as indispensable prerequisites for solving practical problems in semi-supervised graph learning. This paper proposes a novel approach: a non-negative matrix factorization algorithm based on self-supervised star graph optimal embedding, utilizing the progressive spontaneous strategy of anchor graphs. The model considers the feature assignment rules in unlabeled samples and constructs a corresponding probabilistic extension model to extract pseudo-labeled information from the samples. It also constructs self-supervised hard constraints accordingly to enhance the learning process. In addition, inspired by the graph structure filter, we propose a star graph optimization method. It smooths the association relationships between nodes in the graph structure and improves the accuracy of the graph regularization term in describing the association relationships of the original data. Finally, we give the objective function of the model with the multiplicative update rule and analyze the convergence of the algorithm under this rule. Clustering experiments on several standard image datasets and electroencephalography datasets show that the proposed algorithm improves over the current state-of-the-art benchmark algorithms by 6.9% on average. This indicates that the proposed model has excellent self-supervised label discovery and data representation capabilities.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103969"},"PeriodicalIF":7.4000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-supervised star graph optimization embedding non-negative matrix factorization\",\"authors\":\"Songtao Li , Qiancheng Wang , MengJie Luo , Yang Li , Chang Tang\",\"doi\":\"10.1016/j.ipm.2024.103969\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Labeling expensive and graph structure fuzziness are recognized as indispensable prerequisites for solving practical problems in semi-supervised graph learning. This paper proposes a novel approach: a non-negative matrix factorization algorithm based on self-supervised star graph optimal embedding, utilizing the progressive spontaneous strategy of anchor graphs. The model considers the feature assignment rules in unlabeled samples and constructs a corresponding probabilistic extension model to extract pseudo-labeled information from the samples. It also constructs self-supervised hard constraints accordingly to enhance the learning process. In addition, inspired by the graph structure filter, we propose a star graph optimization method. It smooths the association relationships between nodes in the graph structure and improves the accuracy of the graph regularization term in describing the association relationships of the original data. Finally, we give the objective function of the model with the multiplicative update rule and analyze the convergence of the algorithm under this rule. Clustering experiments on several standard image datasets and electroencephalography datasets show that the proposed algorithm improves over the current state-of-the-art benchmark algorithms by 6.9% on average. This indicates that the proposed model has excellent self-supervised label discovery and data representation capabilities.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 2\",\"pages\":\"Article 103969\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457324003285\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324003285","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Self-supervised star graph optimization embedding non-negative matrix factorization
Labeling expensive and graph structure fuzziness are recognized as indispensable prerequisites for solving practical problems in semi-supervised graph learning. This paper proposes a novel approach: a non-negative matrix factorization algorithm based on self-supervised star graph optimal embedding, utilizing the progressive spontaneous strategy of anchor graphs. The model considers the feature assignment rules in unlabeled samples and constructs a corresponding probabilistic extension model to extract pseudo-labeled information from the samples. It also constructs self-supervised hard constraints accordingly to enhance the learning process. In addition, inspired by the graph structure filter, we propose a star graph optimization method. It smooths the association relationships between nodes in the graph structure and improves the accuracy of the graph regularization term in describing the association relationships of the original data. Finally, we give the objective function of the model with the multiplicative update rule and analyze the convergence of the algorithm under this rule. Clustering experiments on several standard image datasets and electroencephalography datasets show that the proposed algorithm improves over the current state-of-the-art benchmark algorithms by 6.9% on average. This indicates that the proposed model has excellent self-supervised label discovery and data representation capabilities.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.