Nonuniformity of P-values Can Occur Early in Diverging Dimensions.

Journal of machine learning research : JMLR Pub Date : 2019-01-01

Yingying Fan, Emre Demirkaya, Jinchi Lv

引用次数: 0

Abstract

Evaluating the joint significance of covariates is of fundamental importance in a wide range of applications. To this end, p-values are frequently employed and produced by algorithms that are powered by classical large-sample asymptotic theory. It is well known that the conventional p-values in Gaussian linear model are valid even when the dimensionality is a non-vanishing fraction of the sample size, but can break down when the design matrix becomes singular in higher dimensions or when the error distribution deviates from Gaussianity. A natural question is when the conventional p-values in generalized linear models become invalid in diverging dimensions. We establish that such a breakdown can occur early in nonlinear models. Our theoretical characterizations are confirmed by simulation studies.

Abstract Image

本刊更多论文

p值的非均匀性可以在发散维数的早期出现。

评估协变量的联合显著性在广泛的应用中具有根本的重要性。为此，p值经常被使用，并由经典大样本渐近理论驱动的算法产生。众所周知，在高斯线性模型中，传统的p值即使在维数为样本量的非消失部分时也是有效的，但当设计矩阵在高维中变得奇异或误差分布偏离高斯性时，p值就会失效。一个自然的问题是，当广义线性模型中的常规p值在发散维数中失效时。我们证明了这种击穿可以在非线性模型中早期发生。我们的理论描述得到了仿真研究的证实。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of machine learning research : JMLR

自引率

0.00%

发文量