{"title":"Random projection-based response best-subset selector for ultra-high dimensional multivariate data","authors":"Jianhua Hu , Tao Li , Xiaoqian Liu , Xu Liu","doi":"10.1016/j.jmva.2025.105465","DOIUrl":null,"url":null,"abstract":"<div><div>In this article, we propose a random projection-based response best-subset selector to perform response variable selection in ultra-high dimensional multivariate data, where both the dimensions of response and predictor variables are substantially greater than the sample size. This method is developed by integrating the response best-subset selector and random projection technique which is applied to reduce dimensionality of predictors. Under a multivariate tail eigenvalue condition, such a random projection-based dimensionality reduction of predictors only leads to an ignorable error between the original and dimension-reduced models. A computational procedure is presented. The proposed method exhibits model consistency under some certain conditions. The efficiency and merit of the proposed method are strongly supported by extensive finite-sample simulation studies. A real breast cancer dataset spanning 22 chromosomes are analyzed to demonstrate the proposed method.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"210 ","pages":"Article 105465"},"PeriodicalIF":1.4000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multivariate Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0047259X25000600","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
In this article, we propose a random projection-based response best-subset selector to perform response variable selection in ultra-high dimensional multivariate data, where both the dimensions of response and predictor variables are substantially greater than the sample size. This method is developed by integrating the response best-subset selector and random projection technique which is applied to reduce dimensionality of predictors. Under a multivariate tail eigenvalue condition, such a random projection-based dimensionality reduction of predictors only leads to an ignorable error between the original and dimension-reduced models. A computational procedure is presented. The proposed method exhibits model consistency under some certain conditions. The efficiency and merit of the proposed method are strongly supported by extensive finite-sample simulation studies. A real breast cancer dataset spanning 22 chromosomes are analyzed to demonstrate the proposed method.
期刊介绍:
Founded in 1971, the Journal of Multivariate Analysis (JMVA) is the central venue for the publication of new, relevant methodology and particularly innovative applications pertaining to the analysis and interpretation of multidimensional data.
The journal welcomes contributions to all aspects of multivariate data analysis and modeling, including cluster analysis, discriminant analysis, factor analysis, and multidimensional continuous or discrete distribution theory. Topics of current interest include, but are not limited to, inferential aspects of
Copula modeling
Functional data analysis
Graphical modeling
High-dimensional data analysis
Image analysis
Multivariate extreme-value theory
Sparse modeling
Spatial statistics.