具有非多项式维度滋扰参数的可能错误定义的广义线性模型的推理

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS

ACS Applied Bio Materials Pub Date : 2024-05-24 DOI:10.1093/biomet/asae024

Shaoxin Hong, Jiancheng Jiang, Xuejun Jiang, Haofeng Wang

{"title":"具有非多项式维度滋扰参数的可能错误定义的广义线性模型的推理","authors":"Shaoxin Hong, Jiancheng Jiang, Xuejun Jiang, Haofeng Wang","doi":"10.1093/biomet/asae024","DOIUrl":null,"url":null,"abstract":"\n It is routine practice in statistical modelling to first select variables and then make inference for the selected model as in stepwise regression. Such inference is made upon the assumption that the selected model is true. However, without this assumption, one would not know the validity of the inference. Similar problems also exist in high dimensional regression with regularization. To address these problems, we propose a dimension-reduced generalized likelihood ratio test for generalized linear models with nonpolynomial dimensionality, based on the quasilikelihood estimation which allows for misspecification of the conditional variance. The test has nearly oracle performance when using the correct amount of shrinkage and has robust performance against the choice of regularization parameter across a large range. We further develop an adaptive data-driven dimension-reduced generalized likelihood ratio test and prove that with probability going to one it is an oracle generalized likelihood ratio test. However, in ultrahigh-dimensional models the penalized estimation may produce spuriously important variables which deteriorate the performance of test. To tackle this problem, we introduce a cross-fitted dimension-reduced generalized likelihood ratio test, which is not only free of spurious effects but robust against the choice of regularization parameter. We establish limiting distributions of the proposed tests. Their advantages are highlighted via theoretical and empirical comparisons to some competitive tests. An application to breast cancer data illustrates the use of our proposed methodology.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":"14 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inference for possibly misspecified generalized linear models with nonpolynomial-dimensional nuisance parameters\",\"authors\":\"Shaoxin Hong, Jiancheng Jiang, Xuejun Jiang, Haofeng Wang\",\"doi\":\"10.1093/biomet/asae024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n It is routine practice in statistical modelling to first select variables and then make inference for the selected model as in stepwise regression. Such inference is made upon the assumption that the selected model is true. However, without this assumption, one would not know the validity of the inference. Similar problems also exist in high dimensional regression with regularization. To address these problems, we propose a dimension-reduced generalized likelihood ratio test for generalized linear models with nonpolynomial dimensionality, based on the quasilikelihood estimation which allows for misspecification of the conditional variance. The test has nearly oracle performance when using the correct amount of shrinkage and has robust performance against the choice of regularization parameter across a large range. We further develop an adaptive data-driven dimension-reduced generalized likelihood ratio test and prove that with probability going to one it is an oracle generalized likelihood ratio test. However, in ultrahigh-dimensional models the penalized estimation may produce spuriously important variables which deteriorate the performance of test. To tackle this problem, we introduce a cross-fitted dimension-reduced generalized likelihood ratio test, which is not only free of spurious effects but robust against the choice of regularization parameter. We establish limiting distributions of the proposed tests. Their advantages are highlighted via theoretical and empirical comparisons to some competitive tests. An application to breast cancer data illustrates the use of our proposed methodology.\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1093/biomet/asae024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomet/asae024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 0

摘要

统计建模的常规做法是先选择变量，然后对所选模型进行推理，如逐步回归法。这种推论是在假设所选模型为真的基础上进行的。然而，如果没有这个假设，我们就无法知道推论的有效性。带正则化的高维回归也存在类似的问题。为了解决这些问题，我们针对非多项式维度的广义线性模型提出了一种降维的广义似然比检验，它基于准似然估计，允许条件方差的错误规范。当使用正确的收缩量时，该检验具有近乎神谕的性能，并且在很大范围内对正则化参数的选择具有稳健的性能。我们进一步开发了一种自适应数据驱动的降维广义似然比检验，并证明它是一种概率为 1 的神谕广义似然比检验。然而，在超高维模型中，惩罚估计可能会产生虚假的重要变量，从而降低检验的性能。为了解决这个问题，我们引入了一种交叉拟合的降维广义似然比检验，它不仅没有虚假效应，而且对正则化参数的选择具有鲁棒性。我们建立了拟议检验的极限分布。通过与一些有竞争力的检验方法进行理论和实证比较，凸显了它们的优势。对乳腺癌数据的应用说明了我们提出的方法的用途。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Inference for possibly misspecified generalized linear models with nonpolynomial-dimensional nuisance parameters

It is routine practice in statistical modelling to first select variables and then make inference for the selected model as in stepwise regression. Such inference is made upon the assumption that the selected model is true. However, without this assumption, one would not know the validity of the inference. Similar problems also exist in high dimensional regression with regularization. To address these problems, we propose a dimension-reduced generalized likelihood ratio test for generalized linear models with nonpolynomial dimensionality, based on the quasilikelihood estimation which allows for misspecification of the conditional variance. The test has nearly oracle performance when using the correct amount of shrinkage and has robust performance against the choice of regularization parameter across a large range. We further develop an adaptive data-driven dimension-reduced generalized likelihood ratio test and prove that with probability going to one it is an oracle generalized likelihood ratio test. However, in ultrahigh-dimensional models the penalized estimation may produce spuriously important variables which deteriorate the performance of test. To tackle this problem, we introduce a cross-fitted dimension-reduced generalized likelihood ratio test, which is not only free of spurious effects but robust against the choice of regularization parameter. We establish limiting distributions of the proposed tests. Their advantages are highlighted via theoretical and empirical comparisons to some competitive tests. An application to breast cancer data illustrates the use of our proposed methodology.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACS Applied Bio Materials Chemistry-Chemistry (all)

CiteScore

9.40

自引率

2.10%

发文量

464

期刊介绍： ACS Applied Bio Materials is an interdisciplinary journal publishing original research covering all aspects of biomaterials and biointerfaces including and beyond the traditional biosensing, biomedical and therapeutic applications. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrates knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important bio applications. The journal is specifically interested in work that addresses the relationship between structure and function and assesses the stability and degradation of materials under relevant environmental and biological conditions.