A machine learning approach for predicting bank credit worthiness

Regina Esi Turkson, E. Baagyere, Gideon Evans Wenya
{"title":"A machine learning approach for predicting bank credit worthiness","authors":"Regina Esi Turkson, E. Baagyere, Gideon Evans Wenya","doi":"10.1109/ICAIPR.2016.7585216","DOIUrl":null,"url":null,"abstract":"Machine learning is an emerging technique for building analytic models for machines to \"learn\" from data and be able to do predictive analysis. The ability of machines to \"learn\" and do predictive analysis is very important in this era of big data and it has a wide range of application areas. For instance, banks and financial institutions are sometimes faced with the challenge of what risk factors to consider when advancing credit/loans to customers. For several features/attributes of the customers are normally taken into consideration, but most of these features have little predictive effect on the credit worthiness or otherwise of the customer. Furthermore, a robust and effective automated bank credit risk score that can aid in the prediction of customer credit worthiness very accurately is still a major challenge facing many banks. In this paper, we examine a real bank credit data and conduct several machine learning algorithms on the data for comparative analysis and to choose which algorithms are the best fit for learning bank credit data. The algorithms gave over 80% accuracy in prediction. Furthermore, the most important features that determine whether a customer will default or otherwise in paying his/her credit the next month are extracted from a total of 23 features. We then applied these most important features on some selected machine learning algorithms and compare their predictive accuracy with the other algorithms that used all the 23 features. The results show no significant di erence, signifying that these features can accurately determine the credit worthiness of the customers. Finally, we formulate a predictive model using the most important features to predict the credit worthiness of a given customer.","PeriodicalId":127231,"journal":{"name":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIPR.2016.7585216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33

Abstract

Machine learning is an emerging technique for building analytic models for machines to "learn" from data and be able to do predictive analysis. The ability of machines to "learn" and do predictive analysis is very important in this era of big data and it has a wide range of application areas. For instance, banks and financial institutions are sometimes faced with the challenge of what risk factors to consider when advancing credit/loans to customers. For several features/attributes of the customers are normally taken into consideration, but most of these features have little predictive effect on the credit worthiness or otherwise of the customer. Furthermore, a robust and effective automated bank credit risk score that can aid in the prediction of customer credit worthiness very accurately is still a major challenge facing many banks. In this paper, we examine a real bank credit data and conduct several machine learning algorithms on the data for comparative analysis and to choose which algorithms are the best fit for learning bank credit data. The algorithms gave over 80% accuracy in prediction. Furthermore, the most important features that determine whether a customer will default or otherwise in paying his/her credit the next month are extracted from a total of 23 features. We then applied these most important features on some selected machine learning algorithms and compare their predictive accuracy with the other algorithms that used all the 23 features. The results show no significant di erence, signifying that these features can accurately determine the credit worthiness of the customers. Finally, we formulate a predictive model using the most important features to predict the credit worthiness of a given customer.
预测银行信用价值的机器学习方法
机器学习是一种新兴技术,用于为机器构建分析模型,以便从数据中“学习”并能够进行预测分析。在这个大数据时代,机器“学习”和预测分析的能力非常重要,并且具有广泛的应用领域。例如,银行和金融机构有时面临的挑战是,在向客户提供信贷/贷款时要考虑哪些风险因素。通常会考虑客户的一些特征/属性,但大多数这些特征对客户的信用价值或其他方面几乎没有预测作用。此外,一个强大而有效的自动化银行信用风险评分,可以帮助非常准确地预测客户的信用价值,仍然是许多银行面临的主要挑战。在本文中,我们研究了一个真实的银行信贷数据,并对数据进行了几种机器学习算法进行比较分析,并选择哪些算法最适合学习银行信贷数据。该算法的预测准确率在80%以上。此外,从总共23个特征中提取出决定客户是否会在下个月违约或以其他方式支付其信用的最重要特征。然后,我们将这些最重要的特征应用于一些选定的机器学习算法,并将它们的预测准确性与使用所有23个特征的其他算法进行比较。结果显示没有显著差异,表明这些特征可以准确地判断客户的信用价值。最后,我们制定了一个预测模型,使用最重要的特征来预测给定客户的信用价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信