{"title":"基于Dart算法的光梯度增强机预测信用卡违约","authors":"Haoming Wang","doi":"10.1145/3582099.3582130","DOIUrl":null,"url":null,"abstract":"With the rapid development of financial services and technologies, credit cards have been increasingly used for personal daily consumption and small loans. However, bad debts caused by credit card defaults remarkably affect the healthy development of financial markets. Therefore, forecasting potential credit card defaults is of great significance with respect to financial stability and economic order. For this purpose, we propose a machine learning method based on Light Gradient Boosting Machine to detect credit card defaults in this paper. DART algorithm is utilized in our model instead of the traditional gradient boosting tree. The model is trained and evaluated using the dataset provided by American Express in the Kaggle competition American Express - Default Prediction. Based on feature analysis and engineering, raw data with 190 descriptors are transformed into data with 2358 descriptors, and are used to train 3 LightGBM models with different hyper-parameters. By applying the model ensemble and pseudo-label technique, the competition metric of our method reaches 0.80029/0.80767 on the public/private test set. This score ranks 106/4874 (top 2.2%), and can get a silver medal in the Kaggle competition.","PeriodicalId":222372,"journal":{"name":"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference","volume":"777 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Forecasting Credit Card Defaults Using Light Gradient Boosting Machine with Dart Algorithm\",\"authors\":\"Haoming Wang\",\"doi\":\"10.1145/3582099.3582130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of financial services and technologies, credit cards have been increasingly used for personal daily consumption and small loans. However, bad debts caused by credit card defaults remarkably affect the healthy development of financial markets. Therefore, forecasting potential credit card defaults is of great significance with respect to financial stability and economic order. For this purpose, we propose a machine learning method based on Light Gradient Boosting Machine to detect credit card defaults in this paper. DART algorithm is utilized in our model instead of the traditional gradient boosting tree. The model is trained and evaluated using the dataset provided by American Express in the Kaggle competition American Express - Default Prediction. Based on feature analysis and engineering, raw data with 190 descriptors are transformed into data with 2358 descriptors, and are used to train 3 LightGBM models with different hyper-parameters. By applying the model ensemble and pseudo-label technique, the competition metric of our method reaches 0.80029/0.80767 on the public/private test set. This score ranks 106/4874 (top 2.2%), and can get a silver medal in the Kaggle competition.\",\"PeriodicalId\":222372,\"journal\":{\"name\":\"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference\",\"volume\":\"777 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3582099.3582130\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th Artificial Intelligence and Cloud Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3582099.3582130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Forecasting Credit Card Defaults Using Light Gradient Boosting Machine with Dart Algorithm
With the rapid development of financial services and technologies, credit cards have been increasingly used for personal daily consumption and small loans. However, bad debts caused by credit card defaults remarkably affect the healthy development of financial markets. Therefore, forecasting potential credit card defaults is of great significance with respect to financial stability and economic order. For this purpose, we propose a machine learning method based on Light Gradient Boosting Machine to detect credit card defaults in this paper. DART algorithm is utilized in our model instead of the traditional gradient boosting tree. The model is trained and evaluated using the dataset provided by American Express in the Kaggle competition American Express - Default Prediction. Based on feature analysis and engineering, raw data with 190 descriptors are transformed into data with 2358 descriptors, and are used to train 3 LightGBM models with different hyper-parameters. By applying the model ensemble and pseudo-label technique, the competition metric of our method reaches 0.80029/0.80767 on the public/private test set. This score ranks 106/4874 (top 2.2%), and can get a silver medal in the Kaggle competition.