{"title":"Black hole algorithm as a heuristic approach for rare event classification problem","authors":"Elif Yıldırım","doi":"10.18187/pjsor.v19i4.4211","DOIUrl":null,"url":null,"abstract":"The logistic regression is generally preferred when there is no big difference in the occurrence frequencies of two possible results for the considered event. However, for the events occurring rarely such as wars, economic crisis and natural disasters, namely having relatively small occurrence frequency when compared to the general events, the logistic regression gives biased parameter estimations. Therefore, the logistic regression underestimates the occurrence probability of the rare events. In this study, black hole algorithm is proposed and used to obtain unbiased estimation parameters for rare events, instead of using the classical logistic regression approach. In order to estimate the logistic regression parameter for the cases dichotomous event groups are rare, we propose a black hole algorithm (BHA) approach. For the samples with different rareness degrees, we obtain the parameter values and their bias and root mean square errors for BHA and logistic regression, and then compare them. Moreover, we also investigate the classification performance of two methods on a real life data. As a result, we obtained that BHA gives less biased estimates in simulation and real-life data compared to logistic regression.","PeriodicalId":19973,"journal":{"name":"Pakistan Journal of Statistics and Operation Research","volume":"49 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pakistan Journal of Statistics and Operation Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18187/pjsor.v19i4.4211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
The logistic regression is generally preferred when there is no big difference in the occurrence frequencies of two possible results for the considered event. However, for the events occurring rarely such as wars, economic crisis and natural disasters, namely having relatively small occurrence frequency when compared to the general events, the logistic regression gives biased parameter estimations. Therefore, the logistic regression underestimates the occurrence probability of the rare events. In this study, black hole algorithm is proposed and used to obtain unbiased estimation parameters for rare events, instead of using the classical logistic regression approach. In order to estimate the logistic regression parameter for the cases dichotomous event groups are rare, we propose a black hole algorithm (BHA) approach. For the samples with different rareness degrees, we obtain the parameter values and their bias and root mean square errors for BHA and logistic regression, and then compare them. Moreover, we also investigate the classification performance of two methods on a real life data. As a result, we obtained that BHA gives less biased estimates in simulation and real-life data compared to logistic regression.
期刊介绍:
Pakistan Journal of Statistics and Operation Research. PJSOR is a peer-reviewed journal, published four times a year. PJSOR publishes refereed research articles and studies that describe the latest research and developments in the area of statistics, operation research and actuarial statistics.