具有特权信息的半监督流形正则化多任务学习

IF 8.1 1区 计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS
Bo Liu , Baoqing Li , Yanshan Xiao , Zhitong Wang , Boxu Zhou , Shengxin He , Chenlong Ye , Fan Cao
{"title":"具有特权信息的半监督流形正则化多任务学习","authors":"Bo Liu ,&nbsp;Baoqing Li ,&nbsp;Yanshan Xiao ,&nbsp;Zhitong Wang ,&nbsp;Boxu Zhou ,&nbsp;Shengxin He ,&nbsp;Chenlong Ye ,&nbsp;Fan Cao","doi":"10.1016/j.ins.2025.122112","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-task learning (MTL) represents an advanced learning paradigm that improves the generalization ability and learning efficiency of a model by learning multiple related tasks simultaneously. The fundamental principle of multi-task learning is the transfer of information between tasks. Nevertheless where data is limited in quantity, effectively modeling inter-task correlations is a significant challenge. We propose a novel method, semi-supervised manifold regularized multi-task learning with privileged information (MSMTL-PI), that effectively leverages the intrinsic geometric structure of data by enforcing manifold regularization and subspace learning techniques. Specifically, a similarity graph is constructed over both labeled and unlabeled samples, ensuring the preservation of local geometric relationships between data points, and manifold regularization is applied as a constraint. Concurrently, information sharing on low-dimensional subspace makes the relationship modeling between tasks more reasonable. Furthermore, a significant amount of privileged information is incorporated into the training phase, thereby optimizing the decision boundary and reducing the impact of insufficient labeled samples on the model. There is substantial experimental evidence that MSMTL-PI markedly enhances the performance of image and text classification tasks, achieving superior classification accuracy with minimal labeled data. Across 15 benchmark datasets, MSMTL-PI consistently outperforms existing methods, achieving an average F1-scores improvement of 1.92% compared to the best baseline, with a maximum gain of 4.17%.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"711 ","pages":"Article 122112"},"PeriodicalIF":8.1000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semi-supervised manifold regularized multi-task learning with privileged information\",\"authors\":\"Bo Liu ,&nbsp;Baoqing Li ,&nbsp;Yanshan Xiao ,&nbsp;Zhitong Wang ,&nbsp;Boxu Zhou ,&nbsp;Shengxin He ,&nbsp;Chenlong Ye ,&nbsp;Fan Cao\",\"doi\":\"10.1016/j.ins.2025.122112\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-task learning (MTL) represents an advanced learning paradigm that improves the generalization ability and learning efficiency of a model by learning multiple related tasks simultaneously. The fundamental principle of multi-task learning is the transfer of information between tasks. Nevertheless where data is limited in quantity, effectively modeling inter-task correlations is a significant challenge. We propose a novel method, semi-supervised manifold regularized multi-task learning with privileged information (MSMTL-PI), that effectively leverages the intrinsic geometric structure of data by enforcing manifold regularization and subspace learning techniques. Specifically, a similarity graph is constructed over both labeled and unlabeled samples, ensuring the preservation of local geometric relationships between data points, and manifold regularization is applied as a constraint. Concurrently, information sharing on low-dimensional subspace makes the relationship modeling between tasks more reasonable. Furthermore, a significant amount of privileged information is incorporated into the training phase, thereby optimizing the decision boundary and reducing the impact of insufficient labeled samples on the model. There is substantial experimental evidence that MSMTL-PI markedly enhances the performance of image and text classification tasks, achieving superior classification accuracy with minimal labeled data. Across 15 benchmark datasets, MSMTL-PI consistently outperforms existing methods, achieving an average F1-scores improvement of 1.92% compared to the best baseline, with a maximum gain of 4.17%.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"711 \",\"pages\":\"Article 122112\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025525002440\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525002440","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

多任务学习(Multi-task learning, MTL)是一种通过同时学习多个相关任务来提高模型泛化能力和学习效率的高级学习范式。多任务学习的基本原理是任务之间的信息传递。然而,在数据数量有限的情况下,有效地建模任务间相关性是一个重大挑战。本文提出了一种基于特权信息的半监督流形正则化多任务学习方法(MSMTL-PI),该方法通过强化流形正则化和子空间学习技术,有效地利用了数据的内在几何结构。具体而言,在标记和未标记的样本上构建相似图,确保数据点之间局部几何关系的保留,并应用流形正则化作为约束。同时,在低维子空间上的信息共享使得任务间的关系建模更加合理。此外,在训练阶段引入了大量的特权信息,从而优化了决策边界,减少了标记样本不足对模型的影响。大量实验证据表明,MSMTL-PI显著提高了图像和文本分类任务的性能,以最少的标记数据实现了优异的分类精度。在15个基准数据集中,MSMTL-PI始终优于现有方法,与最佳基线相比,平均f1分数提高了1.92%,最大增益为4.17%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Semi-supervised manifold regularized multi-task learning with privileged information
Multi-task learning (MTL) represents an advanced learning paradigm that improves the generalization ability and learning efficiency of a model by learning multiple related tasks simultaneously. The fundamental principle of multi-task learning is the transfer of information between tasks. Nevertheless where data is limited in quantity, effectively modeling inter-task correlations is a significant challenge. We propose a novel method, semi-supervised manifold regularized multi-task learning with privileged information (MSMTL-PI), that effectively leverages the intrinsic geometric structure of data by enforcing manifold regularization and subspace learning techniques. Specifically, a similarity graph is constructed over both labeled and unlabeled samples, ensuring the preservation of local geometric relationships between data points, and manifold regularization is applied as a constraint. Concurrently, information sharing on low-dimensional subspace makes the relationship modeling between tasks more reasonable. Furthermore, a significant amount of privileged information is incorporated into the training phase, thereby optimizing the decision boundary and reducing the impact of insufficient labeled samples on the model. There is substantial experimental evidence that MSMTL-PI markedly enhances the performance of image and text classification tasks, achieving superior classification accuracy with minimal labeled data. Across 15 benchmark datasets, MSMTL-PI consistently outperforms existing methods, achieving an average F1-scores improvement of 1.92% compared to the best baseline, with a maximum gain of 4.17%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Sciences
Information Sciences 工程技术-计算机:信息系统
CiteScore
14.00
自引率
17.30%
发文量
1322
审稿时长
10.4 months
期刊介绍: Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信