Coordinate descent for top-k multi-label feature selection with pseudo-label learning and manifold learning

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-09-30 DOI:10.1016/j.neucom.2025.131640

Ruijia Li , Yingcang Ma , Hong Chen , Xiaofei Yang , Zhiwei Xing

{"title":"Coordinate descent for top-k multi-label feature selection with pseudo-label learning and manifold learning","authors":"Ruijia Li , Yingcang Ma , Hong Chen , Xiaofei Yang , Zhiwei Xing","doi":"10.1016/j.neucom.2025.131640","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-label learning plays an increasingly important role in handling complex problems where data instances are associated with multiple labels. However, current methods face significant limitations when dealing with high-dimensional feature spaces. They struggle to preserve the geometric structure among features while failing to fully exploit the latent correlations between labels. To address these key challenges, this paper proposes a novel feature selection method called coordinate descent for top-k multi-label feature selection with pseudo-label learning and manifold learning (CD-MPL), which integrates manifold learning with pseudo-label learning techniques. First, by constructing a feature graph Laplacian matrix, we establish a mathematical representation of the feature manifold structure, effectively preserving the local geometric properties of the feature space. Second, we introduce a pseudo-label learning mechanism, converting discrete binary labels into continuous representations to better model complex label correlations. Notably, to tackle the non-convex optimization problem caused by the <span><math><msub><mi>ℓ</mi><mrow><mn>2</mn><mo>,</mo><mn>0</mn></mrow></msub></math></span>-norm constraint, we innovatively transform the original problem into the joint optimization of a continuous matrix and a discrete selection matrix. We then employ a coordinate descent (CD) method to efficiently solve the selection matrix, overcoming the non-convexity issue while enhancing model performance, interpretability, and practicality. Experimental results on ten multi-label datasets demonstrate that CD-MPL significantly outperforms existing methods across multiple key evaluation metrics, achieving an average performance improvement of 3.31 %. The algorithm maintains stable performance even with reduced feature subsets and exhibits rapid convergence within 10 iterations, fully validating its efficiency and effectiveness in multi-label classification tasks.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"658 ","pages":"Article 131640"},"PeriodicalIF":6.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225023124","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-label learning plays an increasingly important role in handling complex problems where data instances are associated with multiple labels. However, current methods face significant limitations when dealing with high-dimensional feature spaces. They struggle to preserve the geometric structure among features while failing to fully exploit the latent correlations between labels. To address these key challenges, this paper proposes a novel feature selection method called coordinate descent for top-k multi-label feature selection with pseudo-label learning and manifold learning (CD-MPL), which integrates manifold learning with pseudo-label learning techniques. First, by constructing a feature graph Laplacian matrix, we establish a mathematical representation of the feature manifold structure, effectively preserving the local geometric properties of the feature space. Second, we introduce a pseudo-label learning mechanism, converting discrete binary labels into continuous representations to better model complex label correlations. Notably, to tackle the non-convex optimization problem caused by the

ℓ_{2, 0}

-norm constraint, we innovatively transform the original problem into the joint optimization of a continuous matrix and a discrete selection matrix. We then employ a coordinate descent (CD) method to efficiently solve the selection matrix, overcoming the non-convexity issue while enhancing model performance, interpretability, and practicality. Experimental results on ten multi-label datasets demonstrate that CD-MPL significantly outperforms existing methods across multiple key evaluation metrics, achieving an average performance improvement of 3.31 %. The algorithm maintains stable performance even with reduced feature subsets and exhibits rapid convergence within 10 iterations, fully validating its efficiency and effectiveness in multi-label classification tasks.

查看原文本刊更多论文

基于伪标签学习和流形学习的top-k多标签特征选择的坐标下降

多标签学习在处理数据实例与多个标签相关联的复杂问题中发挥着越来越重要的作用。然而，目前的方法在处理高维特征空间时存在很大的局限性。他们努力保持特征之间的几何结构，但未能充分利用标签之间的潜在相关性。为了解决这些关键问题，本文提出了一种新的基于伪标签学习和流形学习的top-k多标签特征选择的坐标下降方法（CD-MPL），该方法将流形学习和伪标签学习技术相结合。首先，通过构造特征图拉普拉斯矩阵，建立了特征流形结构的数学表示，有效地保持了特征空间的局部几何性质；其次，我们引入了一种伪标签学习机制，将离散的二元标签转换为连续的表示，以更好地模拟复杂的标签相关性。值得注意的是，为了解决由0范数约束引起的非凸优化问题，我们创新地将原问题转化为连续矩阵和离散选择矩阵的联合优化问题。然后，我们采用坐标下降（CD）方法来有效地求解选择矩阵，克服了非凸性问题，同时提高了模型的性能、可解释性和实用性。在10个多标签数据集上的实验结果表明，CD-MPL在多个关键评估指标上显著优于现有方法，平均性能提高了3.31%。该算法在特征子集减少的情况下仍然保持稳定的性能，并且在10次迭代内表现出快速收敛，充分验证了其在多标签分类任务中的效率和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.