Mean test for high-dimensional data based on covariance matrix with linear structures

IF 0.9 4区 数学 Q3 STATISTICS & PROBABILITY
Metrika Pub Date : 2024-07-02 DOI:10.1007/s00184-024-00971-3
Guanpeng Wang, Yuyuan Wang, Hengjian Cui
{"title":"Mean test for high-dimensional data based on covariance matrix with linear structures","authors":"Guanpeng Wang, Yuyuan Wang, Hengjian Cui","doi":"10.1007/s00184-024-00971-3","DOIUrl":null,"url":null,"abstract":"<p>In this work, the mean test is considered under the condition that the number of dimensions <i>p</i> is much larger than the sample size <i>n</i> when the covariance matrix is represented as a linear structure as possible. At first, the estimator of coefficients in the linear structures of the covariance matrix is constructed, and then an efficient covariance matrix estimator is naturally given. Next, a new test statistic similar to the classical Hotelling’s <span>\\(T^2\\)</span> test is proposed by replacing the sample covariance matrix with the given estimator of covariance matrix. Then the asymptotic normality of the estimator of coefficients and that of a new statistic for the mean test are separately obtained under some mild conditions. Simulation results show that the performance of the proposed test statistic is almost the same as the Hotelling’s <span>\\(T^2\\)</span> test statistic for which the covariance matrix is known. Our new test statistic can not only control reasonably the nominal level; it also gains greater empirical powers than competing tests. It is found that the power of mean test has great improvement when considering the structure information of the covariance matrix, especially for high-dimensional cases. Moreover, an example with real data is provided to show the application of our approach.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"156 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metrika","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s00184-024-00971-3","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

In this work, the mean test is considered under the condition that the number of dimensions p is much larger than the sample size n when the covariance matrix is represented as a linear structure as possible. At first, the estimator of coefficients in the linear structures of the covariance matrix is constructed, and then an efficient covariance matrix estimator is naturally given. Next, a new test statistic similar to the classical Hotelling’s \(T^2\) test is proposed by replacing the sample covariance matrix with the given estimator of covariance matrix. Then the asymptotic normality of the estimator of coefficients and that of a new statistic for the mean test are separately obtained under some mild conditions. Simulation results show that the performance of the proposed test statistic is almost the same as the Hotelling’s \(T^2\) test statistic for which the covariance matrix is known. Our new test statistic can not only control reasonably the nominal level; it also gains greater empirical powers than competing tests. It is found that the power of mean test has great improvement when considering the structure information of the covariance matrix, especially for high-dimensional cases. Moreover, an example with real data is provided to show the application of our approach.

Abstract Image

基于线性结构协方差矩阵的高维数据均值检验
在本研究中,当协方差矩阵尽可能表示为线性结构时,均值检验是在维数 p 远大于样本量 n 的条件下考虑的。首先,构建协方差矩阵线性结构中系数的估计器,然后自然给出有效的协方差矩阵估计器。接着,通过用给定的协方差矩阵估计器代替样本协方差矩阵,提出了一种类似于经典霍特林(T^2\)检验的新检验统计量。然后,在一些温和的条件下,分别得到了系数估计值的渐近正态性和均值检验的新统计量的渐近正态性。仿真结果表明,所提出的检验统计量与已知协方差矩阵的 Hotelling's \(T^2\) 检验统计量的性能几乎相同。我们的新检验统计量不仅能合理地控制名义水平,还能获得比其他竞争检验更大的经验力量。研究发现,在考虑协方差矩阵的结构信息时,均值检验的功率有很大提高,尤其是在高维情况下。此外,我们还提供了一个真实数据的例子来说明我们的方法的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Metrika
Metrika 数学-统计学与概率论
CiteScore
1.50
自引率
14.30%
发文量
39
审稿时长
6-12 weeks
期刊介绍: Metrika is an international journal for theoretical and applied statistics. Metrika publishes original research papers in the field of mathematical statistics and statistical methods. Great importance is attached to new developments in theoretical statistics, statistical modeling and to actual innovative applicability of the proposed statistical methods and results. Topics of interest include, without being limited to, multivariate analysis, high dimensional statistics and nonparametric statistics; categorical data analysis and latent variable models; reliability, lifetime data analysis and statistics in engineering sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信