{"title":"Semi-supervised label distribution learning via global factorization and local constrain","authors":"Peiqiu Yu, Xiuyi Jia","doi":"10.1016/j.neucom.2025.131024","DOIUrl":null,"url":null,"abstract":"<div><div>In label distribution learning, properly handling samples with missing label distributions is a particularly challenging task. When dealing with unlabeled samples, leveraging correlation is especially crucial as it reveals the intrinsic patterns of data distribution and effectively reduces the model’s hypothesis space. Currently, semi-supervised label distribution learning follows the same correlation mining methods as those used under complete supervision. However, due to the lack of supervision information for some samples, these methods designed for complete supervision are insufficient in a semi-supervised context. On one hand, the absence of labels for some samples makes it difficult to mine label correlations; on the other hand, label correlations mined solely based on samples are biased, leading to imprecise label correlations due to the missing labels. To address these issues, this paper innovatively proposes two strategies for mining label correlations in semi-supervised label distribution learning: first, exploring the common correlations between known and unknown label distributions; second, using the information of known label distributions to reveal the correlations of unknown label distributions. Specifically, globally, we employ independent component analysis for matrix completion of missing sample labels, and locally, we improve the <span><math><mi>k</mi></math></span>-NN framework to utilize the label constraints of known label distributions to restrict the label distribution values of unknown label distributions. Based on these mined correlations, we designed a semi-supervised label distribution learning algorithm. The algorithm outperforms existing methods in 67.27 % of cases, achieving outstanding performance, and demonstrates significant statistical significance in two-sample <span><math><mi>t</mi></math></span>-tests.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"652 ","pages":"Article 131024"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225016960","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In label distribution learning, properly handling samples with missing label distributions is a particularly challenging task. When dealing with unlabeled samples, leveraging correlation is especially crucial as it reveals the intrinsic patterns of data distribution and effectively reduces the model’s hypothesis space. Currently, semi-supervised label distribution learning follows the same correlation mining methods as those used under complete supervision. However, due to the lack of supervision information for some samples, these methods designed for complete supervision are insufficient in a semi-supervised context. On one hand, the absence of labels for some samples makes it difficult to mine label correlations; on the other hand, label correlations mined solely based on samples are biased, leading to imprecise label correlations due to the missing labels. To address these issues, this paper innovatively proposes two strategies for mining label correlations in semi-supervised label distribution learning: first, exploring the common correlations between known and unknown label distributions; second, using the information of known label distributions to reveal the correlations of unknown label distributions. Specifically, globally, we employ independent component analysis for matrix completion of missing sample labels, and locally, we improve the -NN framework to utilize the label constraints of known label distributions to restrict the label distribution values of unknown label distributions. Based on these mined correlations, we designed a semi-supervised label distribution learning algorithm. The algorithm outperforms existing methods in 67.27 % of cases, achieving outstanding performance, and demonstrates significant statistical significance in two-sample -tests.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.