{"title":"涉及多个分类预测因子及其交互作用的随机效应回归方法","authors":"Hanmei Sun, Jiangshan Zhang, Jiming Jiang","doi":"arxiv-2409.09355","DOIUrl":null,"url":null,"abstract":"Linear model prediction with a large number of potential predictors is both\nstatistically and computationally challenging. The traditional approaches are\nlargely based on shrinkage selection/estimation methods, which are applicable\neven when the number of potential predictors is (much) larger than the sample\nsize. A situation of the latter scenario occurs when the candidate predictors\ninvolve many binary indicators corresponding to categories of some categorical\npredictors as well as their interactions. We propose an alternative approach to\nthe shrinkage prediction methods in such a case based on mixed model\nprediction, which effectively treats combinations of the categorical effects as\nrandom effects. We establish theoretical validity of the proposed method, and\ndemonstrate empirically its advantage over the shrinkage methods. We also\ndevelop measures of uncertainty for the proposed method and evaluate their\nperformance empirically. A real-data example is considered.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"105 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions\",\"authors\":\"Hanmei Sun, Jiangshan Zhang, Jiming Jiang\",\"doi\":\"arxiv-2409.09355\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Linear model prediction with a large number of potential predictors is both\\nstatistically and computationally challenging. The traditional approaches are\\nlargely based on shrinkage selection/estimation methods, which are applicable\\neven when the number of potential predictors is (much) larger than the sample\\nsize. A situation of the latter scenario occurs when the candidate predictors\\ninvolve many binary indicators corresponding to categories of some categorical\\npredictors as well as their interactions. We propose an alternative approach to\\nthe shrinkage prediction methods in such a case based on mixed model\\nprediction, which effectively treats combinations of the categorical effects as\\nrandom effects. We establish theoretical validity of the proposed method, and\\ndemonstrate empirically its advantage over the shrinkage methods. We also\\ndevelop measures of uncertainty for the proposed method and evaluate their\\nperformance empirically. A real-data example is considered.\",\"PeriodicalId\":501379,\"journal\":{\"name\":\"arXiv - STAT - Statistics Theory\",\"volume\":\"105 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09355\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Random-effects Approach to Regression Involving Many Categorical Predictors and Their Interactions
Linear model prediction with a large number of potential predictors is both
statistically and computationally challenging. The traditional approaches are
largely based on shrinkage selection/estimation methods, which are applicable
even when the number of potential predictors is (much) larger than the sample
size. A situation of the latter scenario occurs when the candidate predictors
involve many binary indicators corresponding to categories of some categorical
predictors as well as their interactions. We propose an alternative approach to
the shrinkage prediction methods in such a case based on mixed model
prediction, which effectively treats combinations of the categorical effects as
random effects. We establish theoretical validity of the proposed method, and
demonstrate empirically its advantage over the shrinkage methods. We also
develop measures of uncertainty for the proposed method and evaluate their
performance empirically. A real-data example is considered.