基于点模糊互信息的特征低秩正则化多标签特征选择

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-06-01 DOI:10.1016/j.asoc.2025.113301

Qingwei Jia , Tingquan Deng , Ziang Zhang , Yan Wang , Changzhong Wang

{"title":"基于点模糊互信息的特征低秩正则化多标签特征选择","authors":"Qingwei Jia , Tingquan Deng , Ziang Zhang , Yan Wang , Changzhong Wang","doi":"10.1016/j.asoc.2025.113301","DOIUrl":null,"url":null,"abstract":"<div><div>Feature selection is an effective solution to the dimensionality explosion of multi-label data. To assess the classification capability of features, many techniques have been developed from the view of granular computing. These methods can address the uncertainty in data but confront two primary challenges. Firstly, they rely on the predefined evaluation function and heuristic search strategies often converge to local optima. Secondly, redundant features remain difficult to be identified and eliminated. To tackle these challenges, we propose an embedded multi-label feature selection model by integrating pointwise fuzzy mutual information and feature low-rank regularization (PMILR). Redundant features tend to be highly correlated with other features, but contribute minimally to the labels. Highly correlated features exist in the same subspace, making the feature space low-rank. In this study, the low-rank structure of features is revealed to recognize potentially redundant features. Simultaneously, the pointwise fuzzy mutual information is formulated to capture the feature-label correlation. With the guidance of feature representation coefficients and feature-label correlation, a regularizer is properly designed to eliminate the effect of redundant features to labels. Theoretical analysis and experimental results validate the superiority of the developed method.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"179 ","pages":"Article 113301"},"PeriodicalIF":6.6000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Pointwise fuzzy mutual information based multi-label feature selection via feature low-rank regularization\",\"authors\":\"Qingwei Jia , Tingquan Deng , Ziang Zhang , Yan Wang , Changzhong Wang\",\"doi\":\"10.1016/j.asoc.2025.113301\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Feature selection is an effective solution to the dimensionality explosion of multi-label data. To assess the classification capability of features, many techniques have been developed from the view of granular computing. These methods can address the uncertainty in data but confront two primary challenges. Firstly, they rely on the predefined evaluation function and heuristic search strategies often converge to local optima. Secondly, redundant features remain difficult to be identified and eliminated. To tackle these challenges, we propose an embedded multi-label feature selection model by integrating pointwise fuzzy mutual information and feature low-rank regularization (PMILR). Redundant features tend to be highly correlated with other features, but contribute minimally to the labels. Highly correlated features exist in the same subspace, making the feature space low-rank. In this study, the low-rank structure of features is revealed to recognize potentially redundant features. Simultaneously, the pointwise fuzzy mutual information is formulated to capture the feature-label correlation. With the guidance of feature representation coefficients and feature-label correlation, a regularizer is properly designed to eliminate the effect of redundant features to labels. Theoretical analysis and experimental results validate the superiority of the developed method.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"179 \",\"pages\":\"Article 113301\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S156849462500612X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462500612X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

特征选择是解决多标签数据维数爆炸问题的有效方法。为了评估特征的分类能力，从颗粒计算的角度开发了许多技术。这些方法可以解决数据的不确定性，但面临两个主要挑战。首先，它们依赖于预定义的评价函数，启发式搜索策略往往收敛于局部最优。其次，冗余特征仍然难以识别和消除。为了解决这些问题，我们提出了一种集成点模糊互信息和特征低秩正则化（PMILR）的嵌入式多标签特征选择模型。冗余特征往往与其他特征高度相关，但对标签的贡献最小。高度相关的特征存在于同一子空间中，使得特征空间低秩。在本研究中，揭示了特征的低阶结构，以识别潜在的冗余特征。同时，建立了逐点模糊互信息来捕捉特征标签的相关性。在特征表示系数和特征-标签相关性的指导下，合理设计正则化器，消除冗余特征对标签的影响。理论分析和实验结果验证了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Pointwise fuzzy mutual information based multi-label feature selection via feature low-rank regularization

Feature selection is an effective solution to the dimensionality explosion of multi-label data. To assess the classification capability of features, many techniques have been developed from the view of granular computing. These methods can address the uncertainty in data but confront two primary challenges. Firstly, they rely on the predefined evaluation function and heuristic search strategies often converge to local optima. Secondly, redundant features remain difficult to be identified and eliminated. To tackle these challenges, we propose an embedded multi-label feature selection model by integrating pointwise fuzzy mutual information and feature low-rank regularization (PMILR). Redundant features tend to be highly correlated with other features, but contribute minimally to the labels. Highly correlated features exist in the same subspace, making the feature space low-rank. In this study, the low-rank structure of features is revealed to recognize potentially redundant features. Simultaneously, the pointwise fuzzy mutual information is formulated to capture the feature-label correlation. With the guidance of feature representation coefficients and feature-label correlation, a regularizer is properly designed to eliminate the effect of redundant features to labels. Theoretical analysis and experimental results validate the superiority of the developed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.