{"title":"作为罕见事件分类问题启发式方法的黑洞算法","authors":"Elif Yıldırım","doi":"10.18187/pjsor.v19i4.4211","DOIUrl":null,"url":null,"abstract":"The logistic regression is generally preferred when there is no big difference in the occurrence frequencies of two possible results for the considered event. However, for the events occurring rarely such as wars, economic crisis and natural disasters, namely having relatively small occurrence frequency when compared to the general events, the logistic regression gives biased parameter estimations. Therefore, the logistic regression underestimates the occurrence probability of the rare events. In this study, black hole algorithm is proposed and used to obtain unbiased estimation parameters for rare events, instead of using the classical logistic regression approach. In order to estimate the logistic regression parameter for the cases dichotomous event groups are rare, we propose a black hole algorithm (BHA) approach. For the samples with different rareness degrees, we obtain the parameter values and their bias and root mean square errors for BHA and logistic regression, and then compare them. Moreover, we also investigate the classification performance of two methods on a real life data. As a result, we obtained that BHA gives less biased estimates in simulation and real-life data compared to logistic regression.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Black hole algorithm as a heuristic approach for rare event classification problem\",\"authors\":\"Elif Yıldırım\",\"doi\":\"10.18187/pjsor.v19i4.4211\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The logistic regression is generally preferred when there is no big difference in the occurrence frequencies of two possible results for the considered event. However, for the events occurring rarely such as wars, economic crisis and natural disasters, namely having relatively small occurrence frequency when compared to the general events, the logistic regression gives biased parameter estimations. Therefore, the logistic regression underestimates the occurrence probability of the rare events. In this study, black hole algorithm is proposed and used to obtain unbiased estimation parameters for rare events, instead of using the classical logistic regression approach. In order to estimate the logistic regression parameter for the cases dichotomous event groups are rare, we propose a black hole algorithm (BHA) approach. For the samples with different rareness degrees, we obtain the parameter values and their bias and root mean square errors for BHA and logistic regression, and then compare them. Moreover, we also investigate the classification performance of two methods on a real life data. As a result, we obtained that BHA gives less biased estimates in simulation and real-life data compared to logistic regression.\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18187/pjsor.v19i4.4211\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18187/pjsor.v19i4.4211","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Black hole algorithm as a heuristic approach for rare event classification problem
The logistic regression is generally preferred when there is no big difference in the occurrence frequencies of two possible results for the considered event. However, for the events occurring rarely such as wars, economic crisis and natural disasters, namely having relatively small occurrence frequency when compared to the general events, the logistic regression gives biased parameter estimations. Therefore, the logistic regression underestimates the occurrence probability of the rare events. In this study, black hole algorithm is proposed and used to obtain unbiased estimation parameters for rare events, instead of using the classical logistic regression approach. In order to estimate the logistic regression parameter for the cases dichotomous event groups are rare, we propose a black hole algorithm (BHA) approach. For the samples with different rareness degrees, we obtain the parameter values and their bias and root mean square errors for BHA and logistic regression, and then compare them. Moreover, we also investigate the classification performance of two methods on a real life data. As a result, we obtained that BHA gives less biased estimates in simulation and real-life data compared to logistic regression.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.