{"title":"An Empirical Study on Bankruptcy Prediction using Ensemble Learning","authors":"Hoang Luu Quang Tien, L. Tran, Trong-Hop Do","doi":"10.1109/RIVF55975.2022.10013848","DOIUrl":null,"url":null,"abstract":"Bankruptcy prediction helps to assess the financial condition of a company and its future perspectives within the context of long-term operation on the market. Using machine learning to solve this problem can be a time-efficient and cost-effective approach. This paper introduces an approach using Ensemble learning methods to tackle the bankruptcy classification problem, which achieved the fourth position on the leader board of The 3rd Annual International Data Science & AI Competition 2022 - Structured Data Track. The data set given by the committee of the competition has a lot of challenges so we perform some preprocessing and feature engineering techniques to make the data set become cleaner for modelling. We use three ensemble algorithms, namely Random Forest, Catboost, and LightGBM to compare the performance of three algorithms on the bankruptcy classification problem and find the best result to submit to the competition. After experimenting, we achieve the best result at 98.21% Accuracy on the private leader board of the competition. The result comes from the LightGBM model trained on the data set which is enhanced through feature engineering techniques.","PeriodicalId":171525,"journal":{"name":"Conference on Research, Innovation and Vision for the Future in Computing & Communication Technologies","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Research, Innovation and Vision for the Future in Computing & Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF55975.2022.10013848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Bankruptcy prediction helps to assess the financial condition of a company and its future perspectives within the context of long-term operation on the market. Using machine learning to solve this problem can be a time-efficient and cost-effective approach. This paper introduces an approach using Ensemble learning methods to tackle the bankruptcy classification problem, which achieved the fourth position on the leader board of The 3rd Annual International Data Science & AI Competition 2022 - Structured Data Track. The data set given by the committee of the competition has a lot of challenges so we perform some preprocessing and feature engineering techniques to make the data set become cleaner for modelling. We use three ensemble algorithms, namely Random Forest, Catboost, and LightGBM to compare the performance of three algorithms on the bankruptcy classification problem and find the best result to submit to the competition. After experimenting, we achieve the best result at 98.21% Accuracy on the private leader board of the competition. The result comes from the LightGBM model trained on the data set which is enhanced through feature engineering techniques.