{"title":"BANKRUPTCY PREDICTION WITH MACHINE LEARNING MODELS","authors":"Yijing Wan","doi":"10.46609/ijsser.2023.v08i06.018","DOIUrl":null,"url":null,"abstract":"Individuals and businesses can eliminate their debts legally by applying for bankruptcy which can impact everyone involved from the investors to individual employees. Bankruptcy is also an indicator of the health of the economy. In this research, 6819 companies’ data was analyzed with 96 features for each company initially and the relationship between the features and whether the companies were bankrupt was investigated. Four machine learning algorithms were applied for the classification task: logistic regression, random forest, and XGBoost. Their performance was compared to find the best fit for the model. The accuracy of logistics regression was 0.66, of XGBoost was 0.80, and the random forest had the highest accuracy with a value of 0.93.","PeriodicalId":500023,"journal":{"name":"International journal of social science and economic research","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of social science and economic research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46609/ijsser.2023.v08i06.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Individuals and businesses can eliminate their debts legally by applying for bankruptcy which can impact everyone involved from the investors to individual employees. Bankruptcy is also an indicator of the health of the economy. In this research, 6819 companies’ data was analyzed with 96 features for each company initially and the relationship between the features and whether the companies were bankrupt was investigated. Four machine learning algorithms were applied for the classification task: logistic regression, random forest, and XGBoost. Their performance was compared to find the best fit for the model. The accuracy of logistics regression was 0.66, of XGBoost was 0.80, and the random forest had the highest accuracy with a value of 0.93.