Zhonghua Liu , Fa Zhu , Athanasios V. Vasilakos , Xingchi Chen , Qiang Zhao , David Camacho
{"title":"Discriminative approximate regression projection for feature extraction","authors":"Zhonghua Liu , Fa Zhu , Athanasios V. Vasilakos , Xingchi Chen , Qiang Zhao , David Camacho","doi":"10.1016/j.inffus.2025.103088","DOIUrl":null,"url":null,"abstract":"<div><div>Dimension reduction has attracted much attention in pattern recognition and computer vision to mitigate the ‘curse of dimensionality’. As a classical dimension reduction method, least squares regression (LSR) shows its powerful ability in fitting data and extracting discriminative features. However, LSR still faces the following issues. First, the retained feature dimension is strictly equal to the number of classes, which lacks flexibility. Especially, when the categories are few, LSR is insufficient to extract enough features. Second, the local structure relation is neglected. The manifold in LSR only preserves the neighborhood relationship of high-dimensional data, but cannot cluster these data to a single point in a low-dimensional space. Third, LSR considers dimensionality reduction, but neglects data reconstruction. In fact, if the extracted features can reconstruct the original data well, it indicates that the extracted features retain most information of the original data, which will be beneficial for the further classification. To this end, this paper proposes Discriminative Approximate Regression Project (DARP) to address the above issues. In DARP, two matrices are introduced for feature extraction and label fitting to ensure the flexibility of the extracted feature dimension. Using the label matrix as a weight matrix, the samples from each class are projected onto their corresponding class centroid point, which can make the samples from each class to gather in a low-dimensional space. Furthermore, a data reconstruction regularization term is introduced to enable that the feature extraction matrix can reconstruct the original data to preserve most information of the original data. Comprehensive experimental results on widely tested benchmarks demonstrate the competitive performance of DARP.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"120 ","pages":"Article 103088"},"PeriodicalIF":14.7000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525001617","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Dimension reduction has attracted much attention in pattern recognition and computer vision to mitigate the ‘curse of dimensionality’. As a classical dimension reduction method, least squares regression (LSR) shows its powerful ability in fitting data and extracting discriminative features. However, LSR still faces the following issues. First, the retained feature dimension is strictly equal to the number of classes, which lacks flexibility. Especially, when the categories are few, LSR is insufficient to extract enough features. Second, the local structure relation is neglected. The manifold in LSR only preserves the neighborhood relationship of high-dimensional data, but cannot cluster these data to a single point in a low-dimensional space. Third, LSR considers dimensionality reduction, but neglects data reconstruction. In fact, if the extracted features can reconstruct the original data well, it indicates that the extracted features retain most information of the original data, which will be beneficial for the further classification. To this end, this paper proposes Discriminative Approximate Regression Project (DARP) to address the above issues. In DARP, two matrices are introduced for feature extraction and label fitting to ensure the flexibility of the extracted feature dimension. Using the label matrix as a weight matrix, the samples from each class are projected onto their corresponding class centroid point, which can make the samples from each class to gather in a low-dimensional space. Furthermore, a data reconstruction regularization term is introduced to enable that the feature extraction matrix can reconstruct the original data to preserve most information of the original data. Comprehensive experimental results on widely tested benchmarks demonstrate the competitive performance of DARP.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.