Credit risk analysis using boosting methods

IF 0.5 Q4 MATHEMATICS, APPLIED

Journal of Applied Mathematics Statistics and Informatics Pub Date : 2023-05-01 DOI:10.2478/jamsi-2023-0001

S. Coşkun, M. Turanli

{"title":"Credit risk analysis using boosting methods","authors":"S. Coşkun, M. Turanli","doi":"10.2478/jamsi-2023-0001","DOIUrl":null,"url":null,"abstract":"Abstract The use of credit for various occasions has become a routine in our lives. In return, banking and financial institutions require to determine whether the loan demands from them contain any risk. Accordingly, these institutions have been increased their activities in determining whether credit rating models from past credit records of the person applying for the loan works properly. Machine learning-based technologies have opened a new era in this field. AI and machine learning based methods for credit scoring are currently implemented by banking or non-banking financial institutions. Employed models are to extract meaningful features from the required data in which wide variety of information available. In this study, credit risk assessment is conducted using boosting methods such as CatBoost, XGBoost and Light GBM. To this aim, Kaggle Home Credit Default Risk dataset is used and the effect of crediting tendency on the results is also considered. The results have shown that gradient boosting methods provide results that are close to each other, and crediting tendency produces better AUC score in CatBoost while it causes a small decrement in AUC score of XGBoost and LightGBM.","PeriodicalId":43016,"journal":{"name":"Journal of Applied Mathematics Statistics and Informatics","volume":"19 1","pages":"5 - 18"},"PeriodicalIF":0.5000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Mathematics Statistics and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/jamsi-2023-0001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract The use of credit for various occasions has become a routine in our lives. In return, banking and financial institutions require to determine whether the loan demands from them contain any risk. Accordingly, these institutions have been increased their activities in determining whether credit rating models from past credit records of the person applying for the loan works properly. Machine learning-based technologies have opened a new era in this field. AI and machine learning based methods for credit scoring are currently implemented by banking or non-banking financial institutions. Employed models are to extract meaningful features from the required data in which wide variety of information available. In this study, credit risk assessment is conducted using boosting methods such as CatBoost, XGBoost and Light GBM. To this aim, Kaggle Home Credit Default Risk dataset is used and the effect of crediting tendency on the results is also considered. The results have shown that gradient boosting methods provide results that are close to each other, and crediting tendency produces better AUC score in CatBoost while it causes a small decrement in AUC score of XGBoost and LightGBM.

查看原文本刊更多论文

使用助推方法进行信用风险分析

在各种场合使用信用已经成为我们生活中的一种惯例。作为回报，银行和金融机构需要确定来自它们的贷款要求是否包含任何风险。因此，这些机构加大了以贷款申请者的信用记录为基础的信用等级模型是否正常运行的调查力度。基于机器学习的技术开启了这一领域的新时代。基于人工智能和机器学习的信用评分方法目前由银行或非银行金融机构实施。所使用的模型是从所需的数据中提取有意义的特征，其中有各种各样的信息可用。本研究采用CatBoost、XGBoost、Light GBM等助推方法进行信用风险评估。为此，使用Kaggle家庭信用违约风险数据集，并考虑了信贷倾向对结果的影响。结果表明，梯度增强方法提供的结果彼此接近，credit倾向在CatBoost中产生更好的AUC分数，而在XGBoost和LightGBM中产生较小的AUC分数下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Applied Mathematics Statistics and Informatics MATHEMATICS, APPLIED-

自引率

0.00%

发文量

审稿时长

20 weeks