{"title":"Credit risk analysis using boosting methods","authors":"S. Coşkun, M. Turanli","doi":"10.2478/jamsi-2023-0001","DOIUrl":null,"url":null,"abstract":"Abstract The use of credit for various occasions has become a routine in our lives. In return, banking and financial institutions require to determine whether the loan demands from them contain any risk. Accordingly, these institutions have been increased their activities in determining whether credit rating models from past credit records of the person applying for the loan works properly. Machine learning-based technologies have opened a new era in this field. AI and machine learning based methods for credit scoring are currently implemented by banking or non-banking financial institutions. Employed models are to extract meaningful features from the required data in which wide variety of information available. In this study, credit risk assessment is conducted using boosting methods such as CatBoost, XGBoost and Light GBM. To this aim, Kaggle Home Credit Default Risk dataset is used and the effect of crediting tendency on the results is also considered. The results have shown that gradient boosting methods provide results that are close to each other, and crediting tendency produces better AUC score in CatBoost while it causes a small decrement in AUC score of XGBoost and LightGBM.","PeriodicalId":43016,"journal":{"name":"Journal of Applied Mathematics Statistics and Informatics","volume":"19 1","pages":"5 - 18"},"PeriodicalIF":0.3000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Mathematics Statistics and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jamsi-2023-0001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract The use of credit for various occasions has become a routine in our lives. In return, banking and financial institutions require to determine whether the loan demands from them contain any risk. Accordingly, these institutions have been increased their activities in determining whether credit rating models from past credit records of the person applying for the loan works properly. Machine learning-based technologies have opened a new era in this field. AI and machine learning based methods for credit scoring are currently implemented by banking or non-banking financial institutions. Employed models are to extract meaningful features from the required data in which wide variety of information available. In this study, credit risk assessment is conducted using boosting methods such as CatBoost, XGBoost and Light GBM. To this aim, Kaggle Home Credit Default Risk dataset is used and the effect of crediting tendency on the results is also considered. The results have shown that gradient boosting methods provide results that are close to each other, and crediting tendency produces better AUC score in CatBoost while it causes a small decrement in AUC score of XGBoost and LightGBM.