涉及多个分类预测因子及其交互作用的随机效应回归方法

Hanmei Sun, Jiangshan Zhang, Jiming Jiang
{"title":"涉及多个分类预测因子及其交互作用的随机效应回归方法","authors":"Hanmei Sun, Jiangshan Zhang, Jiming Jiang","doi":"arxiv-2409.09355","DOIUrl":null,"url":null,"abstract":"Linear model prediction with a large number of potential predictors is both\nstatistically and computationally challenging. The traditional approaches are\nlargely based on shrinkage selection/estimation methods, which are applicable\neven when the number of potential predictors is (much) larger than the sample\nsize. A situation of the latter scenario occurs when the candidate predictors\ninvolve many binary indicators corresponding to categories of some categorical\npredictors as well as their interactions. We propose an alternative approach to\nthe shrinkage prediction methods in such a case based on mixed model\nprediction, which effectively treats combinations of the categorical effects as\nrandom effects. We establish theoretical validity of the proposed method, and\ndemonstrate empirically its advantage over the shrinkage methods. We also\ndevelop measures of uncertainty for the proposed method and evaluate their\nperformance empirically. A real-data example is considered.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"105 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions\",\"authors\":\"Hanmei Sun, Jiangshan Zhang, Jiming Jiang\",\"doi\":\"arxiv-2409.09355\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Linear model prediction with a large number of potential predictors is both\\nstatistically and computationally challenging. The traditional approaches are\\nlargely based on shrinkage selection/estimation methods, which are applicable\\neven when the number of potential predictors is (much) larger than the sample\\nsize. A situation of the latter scenario occurs when the candidate predictors\\ninvolve many binary indicators corresponding to categories of some categorical\\npredictors as well as their interactions. We propose an alternative approach to\\nthe shrinkage prediction methods in such a case based on mixed model\\nprediction, which effectively treats combinations of the categorical effects as\\nrandom effects. We establish theoretical validity of the proposed method, and\\ndemonstrate empirically its advantage over the shrinkage methods. We also\\ndevelop measures of uncertainty for the proposed method and evaluate their\\nperformance empirically. A real-data example is considered.\",\"PeriodicalId\":501379,\"journal\":{\"name\":\"arXiv - STAT - Statistics Theory\",\"volume\":\"105 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09355\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

使用大量潜在预测因子进行线性模型预测,在统计和计算上都具有挑战性。传统方法主要基于收缩选择/估计方法,这些方法即使在潜在预测因子数量(远远)大于样本数量时也适用。当候选预测因子涉及许多与某些分类预测因子类别相对应的二进制指标以及它们之间的交互作用时,就会出现后一种情况。在这种情况下,我们提出了一种基于混合模型预测的收缩预测方法的替代方法,它能有效地将分类效应的组合视为随机效应。我们建立了所提方法的理论有效性,并通过实证证明了它相对于收缩预测方法的优势。我们还为所提出的方法制定了不确定性度量,并对其性能进行了实证评估。我们还考虑了一个真实数据示例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions
Linear model prediction with a large number of potential predictors is both statistically and computationally challenging. The traditional approaches are largely based on shrinkage selection/estimation methods, which are applicable even when the number of potential predictors is (much) larger than the sample size. A situation of the latter scenario occurs when the candidate predictors involve many binary indicators corresponding to categories of some categorical predictors as well as their interactions. We propose an alternative approach to the shrinkage prediction methods in such a case based on mixed model prediction, which effectively treats combinations of the categorical effects as random effects. We establish theoretical validity of the proposed method, and demonstrate empirically its advantage over the shrinkage methods. We also develop measures of uncertainty for the proposed method and evaluate their performance empirically. A real-data example is considered.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信