基于模糊粗糙集的缺失标签多标签学习矩阵因式分解算法

IF 3.2 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS
Jiang Deng , Degang Chen , Hui Wang , Ruifeng Shi
{"title":"基于模糊粗糙集的缺失标签多标签学习矩阵因式分解算法","authors":"Jiang Deng ,&nbsp;Degang Chen ,&nbsp;Hui Wang ,&nbsp;Ruifeng Shi","doi":"10.1016/j.fss.2024.109143","DOIUrl":null,"url":null,"abstract":"<div><div>In multi-label learning, samples of practical classification task may associated with multiple labels, it is challenging to acquire all labels of the training samples, the rapid expansion of the label space and the significant increase in annotation costs have exacerbated the issue of missing labels in multi-label learning. By utilizing the low rank structure of the label matrix, missing labels can be effectively recovered with matrix factorization technique. Nevertheless, current approaches disregard the potential correlation between the feature space and the multi-dimensional label data. In this paper, key features associated to different labels are extracted and then a non-negative matrix factorization algorithm is proposed for recover missing labels. Firstly, the fuzzy rough set theory is used to analyze the consistency between label matrix and feature space, potential feature information is employed to determine the latent variable dimension in non-negative matrix decomposition and the symbolic label matrix is also converted to a numerical one by means of the lower approximation operator. Then, feature-based manifold regularization and local label correlations are used to model the multi-label completion algorithm. In order to verify the effectiveness in dealing incomplete label data, comparison experiments with varying levels of missing values are designed, the experiments show that compared with the state-of-the-art algorithm, the proposed algorithm is effective in the completion of missing labels. In addition, the sensitivity analysis experiments also show that the proposed method has good stability.</div></div>","PeriodicalId":55130,"journal":{"name":"Fuzzy Sets and Systems","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Matrix factorization algorithm for multi-label learning with missing labels based on fuzzy rough set\",\"authors\":\"Jiang Deng ,&nbsp;Degang Chen ,&nbsp;Hui Wang ,&nbsp;Ruifeng Shi\",\"doi\":\"10.1016/j.fss.2024.109143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In multi-label learning, samples of practical classification task may associated with multiple labels, it is challenging to acquire all labels of the training samples, the rapid expansion of the label space and the significant increase in annotation costs have exacerbated the issue of missing labels in multi-label learning. By utilizing the low rank structure of the label matrix, missing labels can be effectively recovered with matrix factorization technique. Nevertheless, current approaches disregard the potential correlation between the feature space and the multi-dimensional label data. In this paper, key features associated to different labels are extracted and then a non-negative matrix factorization algorithm is proposed for recover missing labels. Firstly, the fuzzy rough set theory is used to analyze the consistency between label matrix and feature space, potential feature information is employed to determine the latent variable dimension in non-negative matrix decomposition and the symbolic label matrix is also converted to a numerical one by means of the lower approximation operator. Then, feature-based manifold regularization and local label correlations are used to model the multi-label completion algorithm. In order to verify the effectiveness in dealing incomplete label data, comparison experiments with varying levels of missing values are designed, the experiments show that compared with the state-of-the-art algorithm, the proposed algorithm is effective in the completion of missing labels. In addition, the sensitivity analysis experiments also show that the proposed method has good stability.</div></div>\",\"PeriodicalId\":55130,\"journal\":{\"name\":\"Fuzzy Sets and Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fuzzy Sets and Systems\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165011424002896\",\"RegionNum\":1,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Sets and Systems","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165011424002896","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

摘要

在多标签学习中,实际分类任务的样本可能与多个标签相关联,获取训练样本的所有标签具有挑战性,标签空间的快速扩展和注释成本的大幅增加加剧了多标签学习中的标签缺失问题。利用标签矩阵的低秩结构,可以通过矩阵因式分解技术有效地恢复缺失标签。然而,目前的方法忽略了特征空间与多维标签数据之间的潜在相关性。本文提取了与不同标签相关的关键特征,然后提出了一种非负矩阵因式分解算法来恢复丢失的标签。首先,利用模糊粗糙集理论分析标签矩阵与特征空间之间的一致性,利用潜在特征信息确定非负矩阵分解中的潜变量维度,并通过下近似算子将符号标签矩阵转换为数值矩阵。然后,基于特征的流形正则化和局部标签相关性被用于多标签完成算法的建模。为了验证处理不完整标签数据的有效性,设计了不同缺失值水平的对比实验,实验结果表明,与最先进的算法相比,所提出的算法在完成缺失标签方面是有效的。此外,灵敏度分析实验还表明,所提出的方法具有良好的稳定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Matrix factorization algorithm for multi-label learning with missing labels based on fuzzy rough set
In multi-label learning, samples of practical classification task may associated with multiple labels, it is challenging to acquire all labels of the training samples, the rapid expansion of the label space and the significant increase in annotation costs have exacerbated the issue of missing labels in multi-label learning. By utilizing the low rank structure of the label matrix, missing labels can be effectively recovered with matrix factorization technique. Nevertheless, current approaches disregard the potential correlation between the feature space and the multi-dimensional label data. In this paper, key features associated to different labels are extracted and then a non-negative matrix factorization algorithm is proposed for recover missing labels. Firstly, the fuzzy rough set theory is used to analyze the consistency between label matrix and feature space, potential feature information is employed to determine the latent variable dimension in non-negative matrix decomposition and the symbolic label matrix is also converted to a numerical one by means of the lower approximation operator. Then, feature-based manifold regularization and local label correlations are used to model the multi-label completion algorithm. In order to verify the effectiveness in dealing incomplete label data, comparison experiments with varying levels of missing values are designed, the experiments show that compared with the state-of-the-art algorithm, the proposed algorithm is effective in the completion of missing labels. In addition, the sensitivity analysis experiments also show that the proposed method has good stability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Fuzzy Sets and Systems
Fuzzy Sets and Systems 数学-计算机:理论方法
CiteScore
6.50
自引率
17.90%
发文量
321
审稿时长
6.1 months
期刊介绍: Since its launching in 1978, the journal Fuzzy Sets and Systems has been devoted to the international advancement of the theory and application of fuzzy sets and systems. The theory of fuzzy sets now encompasses a well organized corpus of basic notions including (and not restricted to) aggregation operations, a generalized theory of relations, specific measures of information content, a calculus of fuzzy numbers. Fuzzy sets are also the cornerstone of a non-additive uncertainty theory, namely possibility theory, and of a versatile tool for both linguistic and numerical modeling: fuzzy rule-based systems. Numerous works now combine fuzzy concepts with other scientific disciplines as well as modern technologies. In mathematics fuzzy sets have triggered new research topics in connection with category theory, topology, algebra, analysis. Fuzzy sets are also part of a recent trend in the study of generalized measures and integrals, and are combined with statistical methods. Furthermore, fuzzy sets have strong logical underpinnings in the tradition of many-valued logics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信