{"title":"Bagging-based heteroscedasticity-adjusted ridge estimators in the linear regression model","authors":"Maha Shabbir , Sohail Chand , Irum Sajjad Dar","doi":"10.1016/j.kjs.2025.100412","DOIUrl":null,"url":null,"abstract":"<div><div>The existence of multicollinearity between independent variables and heteroscedastic error has a colossal impact on the performance of the ordinary least square (OLS) estimator and its covariance matrix. In this study, new bootstrap aggregation (i.e. bagging) based ridge estimators are introduced to circumvent multicollinearity by controlling the influence of heteroscedastic error terms. These estimators are a novel combination of bagging and heteroscedastic-adjusted ridge (HAR) estimators. The performance of bagging-based HAR estimators is evaluated using intensive Monte Carlo simulations by considering multicollinearity with low, high, and severe degrees of heteroscedasticity. The findings reveal that at severe heteroscedasticity, the performance of the proposed HAR-HK∗, HAR-LW∗, HAR-HSL∗, and HAR-KMS∗ estimators is 26.3 %, 81.5 %, 90.1 %, and 30.5 % better than their HAR-HK, HAR-LW, HAR-HSL, and HAR-KMS counterparts when the collinearity level is 0.80, sample size is 25, and the number of independent variables is 5. Similar improvements are observed when heteroscedasticity levels are low and moderate, with different specifications for sample size, collinearity level, and number of independent variables. Overall, the bagging-based HAR estimators are efficient and perform better than the baseline HAR estimators. The real-life applications are illustrated using livestock and passenger car mileage data. The outcomes show that the suggested HAR-LW∗ (PRESS = 10.5028) and HAR-KMS∗(PRESS = 0.0497) perform best on the passenger car and livestock data respectively. The improvements will be helpful in efficiently handling the estimation problem when the two challenging issues are present in the data.</div></div>","PeriodicalId":17848,"journal":{"name":"Kuwait Journal of Science","volume":"52 3","pages":"Article 100412"},"PeriodicalIF":1.2000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kuwait Journal of Science","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307410825000562","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The existence of multicollinearity between independent variables and heteroscedastic error has a colossal impact on the performance of the ordinary least square (OLS) estimator and its covariance matrix. In this study, new bootstrap aggregation (i.e. bagging) based ridge estimators are introduced to circumvent multicollinearity by controlling the influence of heteroscedastic error terms. These estimators are a novel combination of bagging and heteroscedastic-adjusted ridge (HAR) estimators. The performance of bagging-based HAR estimators is evaluated using intensive Monte Carlo simulations by considering multicollinearity with low, high, and severe degrees of heteroscedasticity. The findings reveal that at severe heteroscedasticity, the performance of the proposed HAR-HK∗, HAR-LW∗, HAR-HSL∗, and HAR-KMS∗ estimators is 26.3 %, 81.5 %, 90.1 %, and 30.5 % better than their HAR-HK, HAR-LW, HAR-HSL, and HAR-KMS counterparts when the collinearity level is 0.80, sample size is 25, and the number of independent variables is 5. Similar improvements are observed when heteroscedasticity levels are low and moderate, with different specifications for sample size, collinearity level, and number of independent variables. Overall, the bagging-based HAR estimators are efficient and perform better than the baseline HAR estimators. The real-life applications are illustrated using livestock and passenger car mileage data. The outcomes show that the suggested HAR-LW∗ (PRESS = 10.5028) and HAR-KMS∗(PRESS = 0.0497) perform best on the passenger car and livestock data respectively. The improvements will be helpful in efficiently handling the estimation problem when the two challenging issues are present in the data.
期刊介绍:
Kuwait Journal of Science (KJS) is indexed and abstracted by major publishing houses such as Chemical Abstract, Science Citation Index, Current contents, Mathematics Abstract, Micribiological Abstracts etc. KJS publishes peer-review articles in various fields of Science including Mathematics, Computer Science, Physics, Statistics, Biology, Chemistry and Earth & Environmental Sciences. In addition, it also aims to bring the results of scientific research carried out under a variety of intellectual traditions and organizations to the attention of specialized scholarly readership. As such, the publisher expects the submission of original manuscripts which contain analysis and solutions about important theoretical, empirical and normative issues.