A Novel Predictive Model for Housing Loan Default using Feature Generation and Explainable AI

M. Mahyoub, Shatha Ghareeb, J. Mustafina
{"title":"A Novel Predictive Model for Housing Loan Default using Feature Generation and Explainable AI","authors":"M. Mahyoub, Shatha Ghareeb, J. Mustafina","doi":"10.1109/DeSE58274.2023.10099796","DOIUrl":null,"url":null,"abstract":"Home Loan plays a pivotal role in today's age when one steps into purchasing their home. It has been witnessed that in many cases users are unable to pay the after taking the loan and thus the loan is slipped to NPA(Non-Performing Asset) from Standard Asset for the bank or any lending institution. The revenue generation is ceased. As the housing loan is taken against property the lenders have right to sell the property and close the dues, but the process is lengthy as judicial procedures are involved. In most cases, the property value is much less than the calculated loan amount (Principal + Interest). In this study we examined the several ML methods to identify the loan default before disbursing the loan to the applicant. This matter has been studied widely and used the predictive analytics to find out the relationship between attributes and the target variable. Predictive Analytics enables us to feed optimal set of features to the ML models. The study started with 122 attributes and ended up with around 30% of features as the ideal subset for housing loan default prediction. Then, five ML models were fit into the dataset and the champion model came up with roc score 0.94, Recall 0.90 and Precision 0.94. LIME and SHAP were applied on the champion model along with the dataset for global and local interpretability. The experimental procedure concluded that ML models along with predictive analytics can arrest the loan disbursal to the ineligible applicants and will also provide the insight of such prediction with the help of model interpretability.","PeriodicalId":346847,"journal":{"name":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE58274.2023.10099796","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Home Loan plays a pivotal role in today's age when one steps into purchasing their home. It has been witnessed that in many cases users are unable to pay the after taking the loan and thus the loan is slipped to NPA(Non-Performing Asset) from Standard Asset for the bank or any lending institution. The revenue generation is ceased. As the housing loan is taken against property the lenders have right to sell the property and close the dues, but the process is lengthy as judicial procedures are involved. In most cases, the property value is much less than the calculated loan amount (Principal + Interest). In this study we examined the several ML methods to identify the loan default before disbursing the loan to the applicant. This matter has been studied widely and used the predictive analytics to find out the relationship between attributes and the target variable. Predictive Analytics enables us to feed optimal set of features to the ML models. The study started with 122 attributes and ended up with around 30% of features as the ideal subset for housing loan default prediction. Then, five ML models were fit into the dataset and the champion model came up with roc score 0.94, Recall 0.90 and Precision 0.94. LIME and SHAP were applied on the champion model along with the dataset for global and local interpretability. The experimental procedure concluded that ML models along with predictive analytics can arrest the loan disbursal to the ineligible applicants and will also provide the insight of such prediction with the help of model interpretability.
基于特征生成和可解释人工智能的住房贷款违约预测模型
住房贷款起着举足轻重的作用,在今天的时代,当一个人进入购买自己的房子。在许多情况下,用户在获得贷款后无法偿还贷款,因此贷款从银行或任何贷款机构的标准资产滑入NPA(不良资产)。创收停止了。由于住房贷款是以财产为抵押的,贷款人有权出售财产并结清欠款,但由于涉及司法程序,这一过程很漫长。在大多数情况下,物业价值远低于计算的贷款金额(本金+利息)。在本研究中,我们检查了几种ML方法,以便在向申请人支付贷款之前识别贷款违约。这一问题得到了广泛的研究,并利用预测分析来找出属性与目标变量之间的关系。预测分析使我们能够为ML模型提供最优的特征集。这项研究从122个属性开始,最终得到了大约30%的特征,作为住房贷款违约预测的理想子集。然后,将5个ML模型拟合到数据集中,冠军模型的roc得分为0.94,召回率为0.90,精度为0.94。LIME和SHAP与数据集一起应用于冠军模型,以获得全局和局部可解释性。实验过程得出的结论是,机器学习模型和预测分析可以阻止贷款发放给不合格的申请人,并且还将在模型可解释性的帮助下提供这种预测的洞察力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信