{"title":"Forecasting Credit Card Defaults Using Light Gradient Boosting Machine with Dart Algorithm","authors":"Haoming Wang","doi":"10.1145/3582099.3582130","DOIUrl":null,"url":null,"abstract":"With the rapid development of financial services and technologies, credit cards have been increasingly used for personal daily consumption and small loans. However, bad debts caused by credit card defaults remarkably affect the healthy development of financial markets. Therefore, forecasting potential credit card defaults is of great significance with respect to financial stability and economic order. For this purpose, we propose a machine learning method based on Light Gradient Boosting Machine to detect credit card defaults in this paper. DART algorithm is utilized in our model instead of the traditional gradient boosting tree. The model is trained and evaluated using the dataset provided by American Express in the Kaggle competition American Express - Default Prediction. Based on feature analysis and engineering, raw data with 190 descriptors are transformed into data with 2358 descriptors, and are used to train 3 LightGBM models with different hyper-parameters. By applying the model ensemble and pseudo-label technique, the competition metric of our method reaches 0.80029/0.80767 on the public/private test set. This score ranks 106/4874 (top 2.2%), and can get a silver medal in the Kaggle competition.","PeriodicalId":222372,"journal":{"name":"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference","volume":"777 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3582099.3582130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the rapid development of financial services and technologies, credit cards have been increasingly used for personal daily consumption and small loans. However, bad debts caused by credit card defaults remarkably affect the healthy development of financial markets. Therefore, forecasting potential credit card defaults is of great significance with respect to financial stability and economic order. For this purpose, we propose a machine learning method based on Light Gradient Boosting Machine to detect credit card defaults in this paper. DART algorithm is utilized in our model instead of the traditional gradient boosting tree. The model is trained and evaluated using the dataset provided by American Express in the Kaggle competition American Express - Default Prediction. Based on feature analysis and engineering, raw data with 190 descriptors are transformed into data with 2358 descriptors, and are used to train 3 LightGBM models with different hyper-parameters. By applying the model ensemble and pseudo-label technique, the competition metric of our method reaches 0.80029/0.80767 on the public/private test set. This score ranks 106/4874 (top 2.2%), and can get a silver medal in the Kaggle competition.