Dawei Zhao , Hong Li , Yixiang Lu , De Zhu , Qingwei Gao
{"title":"基于锚点的图嵌入与软标签学习的缺失标签多标签分类","authors":"Dawei Zhao , Hong Li , Yixiang Lu , De Zhu , Qingwei Gao","doi":"10.1016/j.eswa.2025.129019","DOIUrl":null,"url":null,"abstract":"<div><div>Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive <span><math><mi>k</mi></math></span>-connected graph learning methods when constructing <span><math><mi>k</mi></math></span>-nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on <span><math><mi>k</mi></math></span>-means clustering and construct a sparse bipartite graph similarity matrix with <span><math><mi>k</mi></math></span>-nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"296 ","pages":"Article 129019"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Anchor-based graph embedding and soft label learning for multi-label classification with missing label\",\"authors\":\"Dawei Zhao , Hong Li , Yixiang Lu , De Zhu , Qingwei Gao\",\"doi\":\"10.1016/j.eswa.2025.129019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive <span><math><mi>k</mi></math></span>-connected graph learning methods when constructing <span><math><mi>k</mi></math></span>-nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on <span><math><mi>k</mi></math></span>-means clustering and construct a sparse bipartite graph similarity matrix with <span><math><mi>k</mi></math></span>-nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"296 \",\"pages\":\"Article 129019\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425026363\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425026363","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Anchor-based graph embedding and soft label learning for multi-label classification with missing label
Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive -connected graph learning methods when constructing -nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on -means clustering and construct a sparse bipartite graph similarity matrix with -nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.