基于锚点的图嵌入与软标签学习的缺失标签多标签分类

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Dawei Zhao , Hong Li , Yixiang Lu , De Zhu , Qingwei Gao
{"title":"基于锚点的图嵌入与软标签学习的缺失标签多标签分类","authors":"Dawei Zhao ,&nbsp;Hong Li ,&nbsp;Yixiang Lu ,&nbsp;De Zhu ,&nbsp;Qingwei Gao","doi":"10.1016/j.eswa.2025.129019","DOIUrl":null,"url":null,"abstract":"<div><div>Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive <span><math><mi>k</mi></math></span>-connected graph learning methods when constructing <span><math><mi>k</mi></math></span>-nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on <span><math><mi>k</mi></math></span>-means clustering and construct a sparse bipartite graph similarity matrix with <span><math><mi>k</mi></math></span>-nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"296 ","pages":"Article 129019"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Anchor-based graph embedding and soft label learning for multi-label classification with missing label\",\"authors\":\"Dawei Zhao ,&nbsp;Hong Li ,&nbsp;Yixiang Lu ,&nbsp;De Zhu ,&nbsp;Qingwei Gao\",\"doi\":\"10.1016/j.eswa.2025.129019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive <span><math><mi>k</mi></math></span>-connected graph learning methods when constructing <span><math><mi>k</mi></math></span>-nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on <span><math><mi>k</mi></math></span>-means clustering and construct a sparse bipartite graph similarity matrix with <span><math><mi>k</mi></math></span>-nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"296 \",\"pages\":\"Article 129019\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425026363\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425026363","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

尽管在多标签学习方面取得了进展,但处理弱注释数据中的缺失标签仍然具有挑战性。基于图的方法利用样本结构信息促进语义重构来恢复缺失标签,取得了令人满意的效果。然而,现有的基于图的软标签语义重构方法仍然存在缺少标签分类学习的问题。现有方法存在两个局限性:(a)实例相似图的构建和分类任务的执行分开导致分类相似矩阵的次优;(b)一些自适应k连通图学习方法在构造k近邻相似矩阵时产生的高计算成本阻碍了它们的实际应用。此外,复杂的标签相关性进一步增加了准确预测所有可能标签的难度。本文提出了一种结合二部图和软标签学习的多标签缺失标签分类方法。首先,我们基于k-means聚类获得样本的锚点,并以锚点和原始样本作为二部图的权值,构造具有k个最近邻连通分量的稀疏二部图相似矩阵。其次,提供基于地标选择的稀疏高阶高阶标签相关学习,对缺失标签进行软标签获取;最后,结合核技术在统一框架下学习多标签分类和标签关联,解决数据的线性不可分性问题。在几个真实世界的基准多标签数据集上的实验结果表明,与最先进的方法相比,我们提出的方法具有竞争力和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Anchor-based graph embedding and soft label learning for multi-label classification with missing label
Despite progress in multi-label learning, handling missing labels in weakly annotated data remains challenging. The graph-based method utilizes sample structure information to promote semantic reconstruction to recover missing labels, and it has achieved satisfactory performance. However, the existing graph-based soft label semantic reconstruction methods still have the following problems with missing label classification learning. Existing methods face two limitations: (a) the separate construction of instance similarity graphs and execution of classification tasks results in suboptimal similarity matrices for classification; (b) the high computational cost incurred by some adaptive k-connected graph learning methods when constructing k-nearest neighbor similarity matrices hinders their practical application. In addition, the complex label correlations further increase the difficulty of accurately predicting all possible labels. In this paper, we propose a joint bipartite graph and soft label learning for the multi-label missing label classification method. First, we obtain the anchor of the samples based on k-means clustering and construct a sparse bipartite graph similarity matrix with k-nearest neighbor connected components by using the anchor and the original samples as the weight of the bipartite graph. Second, sparse high-rank and high-order label correlation learning based on landmark selection is provided to obtain soft labels for the missing labels. Finally, learn multi-label classification and label correlations jointly in a unified framework with kernel techniques to solve the linear inseparability of the data. Experimental results on several real-world benchmark multi-label datasets demonstrate the competitiveness and effectiveness of our proposed method compared with the state-of-the-art approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信