{"title":"一种基于p值的高维均值降维检验","authors":"Hongyan Fang, Chunyu Yao, Wenzhi Yang, Xuejun Wang, Huang Xu","doi":"10.1080/02331888.2023.2179627","DOIUrl":null,"url":null,"abstract":"With the rapid development of modern computing techniques, high-dimensional data are increasingly encountered in many studies. In this paper, we propose a three-step method to study the mean testing problem. The proposed test is based on the p-values calculated from the univariate tests and the dimension reduction method. Since it does not require explicit conditions of data dimension and sample size, we can use it to solve the mean testing problem of high-dimensional data, especially when the data dimension is much larger than the sample size. The new method can be implemented for the normal and non-normal distribution, which has a wide application. Various simulations are conducted to compare the testing power of the new method and the existing tests. The comparison shows that the new method has higher testing power. We also apply the proposed method to a real example of gene expression data.","PeriodicalId":54358,"journal":{"name":"Statistics","volume":"58 1","pages":"282 - 299"},"PeriodicalIF":1.2000,"publicationDate":"2023-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A p-value based dimensionality reduction test for high dimensional means\",\"authors\":\"Hongyan Fang, Chunyu Yao, Wenzhi Yang, Xuejun Wang, Huang Xu\",\"doi\":\"10.1080/02331888.2023.2179627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of modern computing techniques, high-dimensional data are increasingly encountered in many studies. In this paper, we propose a three-step method to study the mean testing problem. The proposed test is based on the p-values calculated from the univariate tests and the dimension reduction method. Since it does not require explicit conditions of data dimension and sample size, we can use it to solve the mean testing problem of high-dimensional data, especially when the data dimension is much larger than the sample size. The new method can be implemented for the normal and non-normal distribution, which has a wide application. Various simulations are conducted to compare the testing power of the new method and the existing tests. The comparison shows that the new method has higher testing power. We also apply the proposed method to a real example of gene expression data.\",\"PeriodicalId\":54358,\"journal\":{\"name\":\"Statistics\",\"volume\":\"58 1\",\"pages\":\"282 - 299\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1080/02331888.2023.2179627\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/02331888.2023.2179627","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
A p-value based dimensionality reduction test for high dimensional means
With the rapid development of modern computing techniques, high-dimensional data are increasingly encountered in many studies. In this paper, we propose a three-step method to study the mean testing problem. The proposed test is based on the p-values calculated from the univariate tests and the dimension reduction method. Since it does not require explicit conditions of data dimension and sample size, we can use it to solve the mean testing problem of high-dimensional data, especially when the data dimension is much larger than the sample size. The new method can be implemented for the normal and non-normal distribution, which has a wide application. Various simulations are conducted to compare the testing power of the new method and the existing tests. The comparison shows that the new method has higher testing power. We also apply the proposed method to a real example of gene expression data.
期刊介绍:
Statistics publishes papers developing and analysing new methods for any active field of statistics, motivated by real-life problems. Papers submitted for consideration should provide interesting and novel contributions to statistical theory and its applications with rigorous mathematical results and proofs. Moreover, numerical simulations and application to real data sets can improve the quality of papers, and should be included where appropriate. Statistics does not publish papers which represent mere application of existing procedures to case studies, and papers are required to contain methodological or theoretical innovation. Topics of interest include, for example, nonparametric statistics, time series, analysis of topological or functional data. Furthermore the journal also welcomes submissions in the field of theoretical econometrics and its links to mathematical statistics.