多重共线性下Ridge、Bridge和Lasso回归模型的比较分析

Kelachi P. Enwere, E. Nduka, U. Ogoke
{"title":"多重共线性下Ridge、Bridge和Lasso回归模型的比较分析","authors":"Kelachi P. Enwere, E. Nduka, U. Ogoke","doi":"10.54117/iimj.v3i1.5","DOIUrl":null,"url":null,"abstract":"Aims: This research work investigated the best regression technique in handling multicollinearity using the Ridge, Least Absolute Shrinkage and Selection Operator (LASSO) and Bridge regression models in comparison to Analysis and Prediction. Study Design: Two sets of secondary data on Body Size and Heart Rate gotten from the Lulu Briggs Health Center, University of Port Harcourt were used for comparison for model fit and in handling multicollinearity between the regression techniques. Tables were used to present Comparisons made using MSE, RMSE, VIF, AIC and BIC for efficiency. Scatter plots were employed to show fitted regression models. R Software was used to perform data analysis. Methodology: The data were tested for the presence of Multicollinearity using VIF respectively, before proceeding to apply Ridge, LASSO and Bridge regression techniques to solve the problem of multicollinearity. Then comparison was made in analysis and prediction between the regression techniques. Results: The results from the study show that, for analysis on body size, we found that none of the Regression Techniques handled the problem of multicollinearity, even though the degree of multicollinearity present in the data set reduces, with VIF values of 11.36762 for Ridge, 10.8042 for LASSO, and Bridge which are 10.95578, 11.24945, 12.22628 and 12.14645 respectively. For Heart Rate analysis, we see that all the regularized regression techniques handled the problem of multicollinearity. The results show that the Bridge regression technique performed better with a VIF of 1.744461 when","PeriodicalId":301364,"journal":{"name":"IPS Intelligentsia Multidisciplinary Journal","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity\",\"authors\":\"Kelachi P. Enwere, E. Nduka, U. Ogoke\",\"doi\":\"10.54117/iimj.v3i1.5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aims: This research work investigated the best regression technique in handling multicollinearity using the Ridge, Least Absolute Shrinkage and Selection Operator (LASSO) and Bridge regression models in comparison to Analysis and Prediction. Study Design: Two sets of secondary data on Body Size and Heart Rate gotten from the Lulu Briggs Health Center, University of Port Harcourt were used for comparison for model fit and in handling multicollinearity between the regression techniques. Tables were used to present Comparisons made using MSE, RMSE, VIF, AIC and BIC for efficiency. Scatter plots were employed to show fitted regression models. R Software was used to perform data analysis. Methodology: The data were tested for the presence of Multicollinearity using VIF respectively, before proceeding to apply Ridge, LASSO and Bridge regression techniques to solve the problem of multicollinearity. Then comparison was made in analysis and prediction between the regression techniques. Results: The results from the study show that, for analysis on body size, we found that none of the Regression Techniques handled the problem of multicollinearity, even though the degree of multicollinearity present in the data set reduces, with VIF values of 11.36762 for Ridge, 10.8042 for LASSO, and Bridge which are 10.95578, 11.24945, 12.22628 and 12.14645 respectively. For Heart Rate analysis, we see that all the regularized regression techniques handled the problem of multicollinearity. The results show that the Bridge regression technique performed better with a VIF of 1.744461 when\",\"PeriodicalId\":301364,\"journal\":{\"name\":\"IPS Intelligentsia Multidisciplinary Journal\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPS Intelligentsia Multidisciplinary Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54117/iimj.v3i1.5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPS Intelligentsia Multidisciplinary Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54117/iimj.v3i1.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究工作探讨了处理多重共线性的最佳回归技术,使用岭、最小绝对收缩和选择算子(LASSO)和桥回归模型与分析和预测模型进行比较。研究设计:采用哈科特港大学Lulu Briggs健康中心的两组关于体型和心率的二手数据进行模型拟合比较,并处理回归技术之间的多重共线性。表格用于展示使用MSE, RMSE, VIF, AIC和BIC进行效率比较。散点图表示拟合的回归模型。使用R软件进行数据分析。方法:在应用Ridge, LASSO和Bridge回归技术解决多重共线性问题之前,分别使用VIF测试数据是否存在多重共线性。在此基础上,对两种回归方法在分析和预测方面进行了比较。结果:研究结果表明,对于体型的分析,我们发现所有回归技术都没有处理多重共线性问题,尽管数据集中存在多重共线性的程度有所降低,Ridge的VIF值为11.36762,LASSO的VIF值为10.8042,Bridge的VIF值分别为10.95578,11.24945,12.22628和12.14645。对于心率分析,我们看到所有的正则化回归技术都处理多重共线性问题。结果表明,当VIF为1.744461时,桥式回归技术表现较好
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity
Aims: This research work investigated the best regression technique in handling multicollinearity using the Ridge, Least Absolute Shrinkage and Selection Operator (LASSO) and Bridge regression models in comparison to Analysis and Prediction. Study Design: Two sets of secondary data on Body Size and Heart Rate gotten from the Lulu Briggs Health Center, University of Port Harcourt were used for comparison for model fit and in handling multicollinearity between the regression techniques. Tables were used to present Comparisons made using MSE, RMSE, VIF, AIC and BIC for efficiency. Scatter plots were employed to show fitted regression models. R Software was used to perform data analysis. Methodology: The data were tested for the presence of Multicollinearity using VIF respectively, before proceeding to apply Ridge, LASSO and Bridge regression techniques to solve the problem of multicollinearity. Then comparison was made in analysis and prediction between the regression techniques. Results: The results from the study show that, for analysis on body size, we found that none of the Regression Techniques handled the problem of multicollinearity, even though the degree of multicollinearity present in the data set reduces, with VIF values of 11.36762 for Ridge, 10.8042 for LASSO, and Bridge which are 10.95578, 11.24945, 12.22628 and 12.14645 respectively. For Heart Rate analysis, we see that all the regularized regression techniques handled the problem of multicollinearity. The results show that the Bridge regression technique performed better with a VIF of 1.744461 when
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信