{"title":"Learning embedded label-specific features for partial multi-label learning","authors":"Xiaohan Xu , Hao Wang , Jialu Yao , Zan Zhang","doi":"10.1016/j.patcog.2025.112005","DOIUrl":null,"url":null,"abstract":"<div><div>Partial multi-label learning (PML) aims to learn from instances with weak supervision, where each instance is associated with a set of candidate labels, among which only a subset is valid. Most existing approaches rely on identical feature representations to distinguish all class labels, overlooking the inherent distinctiveness of different labels, which leads to suboptimal model performance. Although recent studies have attempted to address this limitation by tailoring label-specific features, critical shortcomings remain: (1) isolated processing of feature tailoring and label disambiguation fails to leverage their synergistic relationship, and (2) direct extraction of label-specific features from the original feature space tends to yield unreliable results due to inherent noise and disturbances. This paper proposes a unified PML framework that jointly performs label disambiguation, embedded label-specific feature learning, and model induction. Within this framework, identifying ground-truth labels and generating label-specific features mutually reinforce each other, leading to continuous refinement. By customizing features from a compact and noise-free embedded space, the framework further ensures robustness and reliability in learning. Specifically, low-rank and sparse decomposition is employed to separate ground-truth labels from noisy ones, while a linear embedding discriminant model simultaneously generates embedded label-specific features and induces the model. Moreover, we enhance the classifier’s accuracy by assuming that the input and output spaces share local geometric structures, encouraging similar instances to have similar label sets. Extensive experiments on sixty-six real-world and synthetic datasets demonstrate that the proposed approach significantly outperforms state-of-the-art methods.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"170 ","pages":"Article 112005"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032500665X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Partial multi-label learning (PML) aims to learn from instances with weak supervision, where each instance is associated with a set of candidate labels, among which only a subset is valid. Most existing approaches rely on identical feature representations to distinguish all class labels, overlooking the inherent distinctiveness of different labels, which leads to suboptimal model performance. Although recent studies have attempted to address this limitation by tailoring label-specific features, critical shortcomings remain: (1) isolated processing of feature tailoring and label disambiguation fails to leverage their synergistic relationship, and (2) direct extraction of label-specific features from the original feature space tends to yield unreliable results due to inherent noise and disturbances. This paper proposes a unified PML framework that jointly performs label disambiguation, embedded label-specific feature learning, and model induction. Within this framework, identifying ground-truth labels and generating label-specific features mutually reinforce each other, leading to continuous refinement. By customizing features from a compact and noise-free embedded space, the framework further ensures robustness and reliability in learning. Specifically, low-rank and sparse decomposition is employed to separate ground-truth labels from noisy ones, while a linear embedding discriminant model simultaneously generates embedded label-specific features and induces the model. Moreover, we enhance the classifier’s accuracy by assuming that the input and output spaces share local geometric structures, encouraging similar instances to have similar label sets. Extensive experiments on sixty-six real-world and synthetic datasets demonstrate that the proposed approach significantly outperforms state-of-the-art methods.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.