An ensemble machine learning based bank loan approval predictions system with a smart application

Nazim Uddin , Md. Khabir Uddin Ahamed , Md Ashraf Uddin , Md. Manwarul Islam , Md. Alamin Talukder , Sunil Aryal
{"title":"An ensemble machine learning based bank loan approval predictions system with a smart application","authors":"Nazim Uddin ,&nbsp;Md. Khabir Uddin Ahamed ,&nbsp;Md Ashraf Uddin ,&nbsp;Md. Manwarul Islam ,&nbsp;Md. Alamin Talukder ,&nbsp;Sunil Aryal","doi":"10.1016/j.ijcce.2023.09.001","DOIUrl":null,"url":null,"abstract":"<div><p>Banks rely heavily on loans as a primary source of revenue; however, distinguishing deserving applicants who will reliably repay loans presents an ongoing challenge. Conventional selection processes often struggle to identify the most suitable candidates from a pool of loan applicants. In response to this challenge, we present an innovative machine learning (ML) based loan prediction system designed to identify qualified loan applicants autonomously. This comprehensive study encompasses data preprocessing, effective data balancing using SMOTE, and the implementation of diverse ML models, including Logistic Regression, Decision Tree, Random Forest, Extra Trees, Support Vector Machine, K-Nearest Neighbors, Gaussian Naive Bayes, AdaBoost, Gradient Boosting, and advanced deep learning models such as deep neural networks, recurrent neural networks, and long short-term memory models. The model's performance is rigorously assessed in terms of accuracy, recall, and F1_score. Our experimental analysis reveals that the Extra Trees outperforms its counterparts. Furthermore, we successfully predict bank loan defaulters through an ensemble voting model, which includes the top three ML models, achieving a remarkable 0.62% increase in accuracy compared to the Extra Trees. To facilitate user interaction, we have developed a user-friendly desktop-based application. Notably, our findings demonstrate that the voting-based ensemble model surpasses both individual ML models, including Extra Trees, and existing state-of-the-art approaches, achieving an impressive accuracy of 87.26%. This innovative system has the potential to significantly streamline and enhance the efficiency of bank loan approval processes, ultimately benefiting both financial institutions and loan applicants alike.</p></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"4 ","pages":"Pages 327-339"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307423000293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Banks rely heavily on loans as a primary source of revenue; however, distinguishing deserving applicants who will reliably repay loans presents an ongoing challenge. Conventional selection processes often struggle to identify the most suitable candidates from a pool of loan applicants. In response to this challenge, we present an innovative machine learning (ML) based loan prediction system designed to identify qualified loan applicants autonomously. This comprehensive study encompasses data preprocessing, effective data balancing using SMOTE, and the implementation of diverse ML models, including Logistic Regression, Decision Tree, Random Forest, Extra Trees, Support Vector Machine, K-Nearest Neighbors, Gaussian Naive Bayes, AdaBoost, Gradient Boosting, and advanced deep learning models such as deep neural networks, recurrent neural networks, and long short-term memory models. The model's performance is rigorously assessed in terms of accuracy, recall, and F1_score. Our experimental analysis reveals that the Extra Trees outperforms its counterparts. Furthermore, we successfully predict bank loan defaulters through an ensemble voting model, which includes the top three ML models, achieving a remarkable 0.62% increase in accuracy compared to the Extra Trees. To facilitate user interaction, we have developed a user-friendly desktop-based application. Notably, our findings demonstrate that the voting-based ensemble model surpasses both individual ML models, including Extra Trees, and existing state-of-the-art approaches, achieving an impressive accuracy of 87.26%. This innovative system has the potential to significantly streamline and enhance the efficiency of bank loan approval processes, ultimately benefiting both financial institutions and loan applicants alike.

基于智能应用程序的集成机器学习的银行贷款审批预测系统
银行严重依赖贷款作为主要收入来源;然而,区分能够可靠偿还贷款的合格申请人是一项持续的挑战。传统的选拔过程往往很难从贷款申请人中找到最合适的候选人。为了应对这一挑战,我们提出了一种创新的基于机器学习(ML)的贷款预测系统,旨在自主识别合格的贷款申请人。这项全面的研究包括数据预处理、使用SMOTE的有效数据平衡,以及各种ML模型的实现,包括逻辑回归、决策树、随机森林、额外树、支持向量机、K近邻、高斯朴素贝叶斯、AdaBoost、梯度提升,以及高级深度学习模型,如深度神经网络、递归神经网络,以及长短期记忆模型。该模型的性能在准确性、召回率和F1_score方面得到了严格评估。我们的实验分析表明,Extra Trees的性能优于其同类产品。此外,我们通过集合投票模型成功预测了银行贷款违约者,该模型包括前三个ML模型,与Extra Trees相比,准确率显著提高了0.62%。为了方便用户交互,我们开发了一个用户友好的基于桌面的应用程序。值得注意的是,我们的研究结果表明,基于投票的集成模型超越了包括Extra Trees在内的单个ML模型和现有最先进的方法,实现了87.26%的令人印象深刻的准确率。这一创新系统有可能显著简化和提高银行贷款审批流程的效率,最终使金融机构和贷款申请人都受益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
13.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信