{"title":"遗传互作选择的改进最小角回归算法","authors":"Woosung Kim, Seonghyeon Kim, M. Na, Yongdai Kim","doi":"10.1002/sam.11577","DOIUrl":null,"url":null,"abstract":"In many practical problems, the main effects alone may not be enough to capture the relationship between the response and predictors, and the interaction effects are often of interest to scientific researchers. In considering a regression model with main effects and all possible two‐way interaction effects, which we call the two‐way interaction model, there is an important challenge—computational burden. One way to reduce the aforementioned problems is to consider the heredity constraint between the main and interaction effects. The heredity constraint assumes that a given interaction effect is significant only when the corresponding main effects are significant. Various sparse penalized methods to reflect the heredity constraint have been proposed, but those algorithms are still computationally demanding and can be applied to data where the dimension of the main effects is only few hundreds. In this paper, we propose a modification of the LARS algorithm for selecting interaction effects under the heredity constraint, which can be applied to high‐dimensional data. Our numerical studies confirm that the proposed modified LARS algorithm is much faster and spends less memory than its competitors but has comparable prediction accuracies when the dimension of covariates is large.","PeriodicalId":342679,"journal":{"name":"Statistical Analysis and Data Mining: The ASA Data Science Journal","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A modified least angle regression algorithm for interaction selection with heredity\",\"authors\":\"Woosung Kim, Seonghyeon Kim, M. Na, Yongdai Kim\",\"doi\":\"10.1002/sam.11577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In many practical problems, the main effects alone may not be enough to capture the relationship between the response and predictors, and the interaction effects are often of interest to scientific researchers. In considering a regression model with main effects and all possible two‐way interaction effects, which we call the two‐way interaction model, there is an important challenge—computational burden. One way to reduce the aforementioned problems is to consider the heredity constraint between the main and interaction effects. The heredity constraint assumes that a given interaction effect is significant only when the corresponding main effects are significant. Various sparse penalized methods to reflect the heredity constraint have been proposed, but those algorithms are still computationally demanding and can be applied to data where the dimension of the main effects is only few hundreds. In this paper, we propose a modification of the LARS algorithm for selecting interaction effects under the heredity constraint, which can be applied to high‐dimensional data. Our numerical studies confirm that the proposed modified LARS algorithm is much faster and spends less memory than its competitors but has comparable prediction accuracies when the dimension of covariates is large.\",\"PeriodicalId\":342679,\"journal\":{\"name\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/sam.11577\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Analysis and Data Mining: The ASA Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/sam.11577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A modified least angle regression algorithm for interaction selection with heredity
In many practical problems, the main effects alone may not be enough to capture the relationship between the response and predictors, and the interaction effects are often of interest to scientific researchers. In considering a regression model with main effects and all possible two‐way interaction effects, which we call the two‐way interaction model, there is an important challenge—computational burden. One way to reduce the aforementioned problems is to consider the heredity constraint between the main and interaction effects. The heredity constraint assumes that a given interaction effect is significant only when the corresponding main effects are significant. Various sparse penalized methods to reflect the heredity constraint have been proposed, but those algorithms are still computationally demanding and can be applied to data where the dimension of the main effects is only few hundreds. In this paper, we propose a modification of the LARS algorithm for selecting interaction effects under the heredity constraint, which can be applied to high‐dimensional data. Our numerical studies confirm that the proposed modified LARS algorithm is much faster and spends less memory than its competitors but has comparable prediction accuracies when the dimension of covariates is large.