Xiangguang Dai, Mingyu Guan, Facheng Dai, Wei Zhang, Tingji Zhang, Hangjun Che, Xiangqin Dai
{"title":"Unsupervised feature selection based on generalized regression model with linear discriminant constraints","authors":"Xiangguang Dai, Mingyu Guan, Facheng Dai, Wei Zhang, Tingji Zhang, Hangjun Che, Xiangqin Dai","doi":"10.1007/s40747-025-01873-1","DOIUrl":null,"url":null,"abstract":"<p>Unsupervised feature selection (UFS) methods play a crucial role in improving the efficiency of extracting relevant information and reducing computational complexity in the context of big data analysis. Despite notable advancements in the field of unsupervised feature selection for large-scale datasets, many UFS methods still remain redundant and irrelevant features during the feature selection process. To tackle these challenges, we present a novel unsupervised feature selection method that leverages the generalized regression model with linear discriminant constraints to learn discriminant and effective features from the data. Benefited from this, the relationships and patterns within the high-dimensional data are retained in the reduced-dimensional feature space. We reformulate our proposed method as a multi-variable optimization problem that incorporates equality constraints. To efficiently solve this problem, we develop an algorithm that updates each variable alternately. Extensive experiments on six datasets among nine state-of-the-art methods on the clustering task are conducted to demonstrate the effectiveness of the proposed method.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"15 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01873-1","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Unsupervised feature selection (UFS) methods play a crucial role in improving the efficiency of extracting relevant information and reducing computational complexity in the context of big data analysis. Despite notable advancements in the field of unsupervised feature selection for large-scale datasets, many UFS methods still remain redundant and irrelevant features during the feature selection process. To tackle these challenges, we present a novel unsupervised feature selection method that leverages the generalized regression model with linear discriminant constraints to learn discriminant and effective features from the data. Benefited from this, the relationships and patterns within the high-dimensional data are retained in the reduced-dimensional feature space. We reformulate our proposed method as a multi-variable optimization problem that incorporates equality constraints. To efficiently solve this problem, we develop an algorithm that updates each variable alternately. Extensive experiments on six datasets among nine state-of-the-art methods on the clustering task are conducted to demonstrate the effectiveness of the proposed method.
期刊介绍:
Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.