Loan Eligibility Prediction using Machine Learning based on Personal Information

M. Meenaakumari, P. Jayasuriya, Nasa Dhanraj, Seema Sharma, Geetha Manoharan, M. Tiwari
{"title":"Loan Eligibility Prediction using Machine Learning based on Personal Information","authors":"M. Meenaakumari, P. Jayasuriya, Nasa Dhanraj, Seema Sharma, Geetha Manoharan, M. Tiwari","doi":"10.1109/IC3I56241.2022.10073318","DOIUrl":null,"url":null,"abstract":"Banks serves the basic necessities of everyone next to hospitals and schools. People reach out to banks for various purposes. But one of the most common services offered by banks is loans. However, many common people are not completely aware of the banking procedures and eligibility criteria for loans. This study aims to develop a Machine Learning (ML) model which is capable of predicting whether the person is eligible for a health loan or not by analyzing some basic values entered by the user. For this process, a dataset consisting of all necessary parameters for a loan application is collected from Kaggle. The collected dataset is then preprocessed by two methods namely the null value elimination method and encoding. Simultaneously, three ML models were developed using three different algorithms. They are the Random Forest (RF), Naive Bayes (NB), and Linear Regression (LR). The preprocessed data will next be used to train the models. Following that, a comparison of a few parameters will be used to assess the models' effectiveness. The results of the analysis prove that the RF algorithm is the best in terms of both accuracy and error. The accuracy of the RF algorithm is 91% and it also predicts loan eligibility with lesser error values. The LR model has the lowest accuracy values and the highest error value making it the least efficient algorithm that can be used in loan prediction.","PeriodicalId":274660,"journal":{"name":"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I56241.2022.10073318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Banks serves the basic necessities of everyone next to hospitals and schools. People reach out to banks for various purposes. But one of the most common services offered by banks is loans. However, many common people are not completely aware of the banking procedures and eligibility criteria for loans. This study aims to develop a Machine Learning (ML) model which is capable of predicting whether the person is eligible for a health loan or not by analyzing some basic values entered by the user. For this process, a dataset consisting of all necessary parameters for a loan application is collected from Kaggle. The collected dataset is then preprocessed by two methods namely the null value elimination method and encoding. Simultaneously, three ML models were developed using three different algorithms. They are the Random Forest (RF), Naive Bayes (NB), and Linear Regression (LR). The preprocessed data will next be used to train the models. Following that, a comparison of a few parameters will be used to assess the models' effectiveness. The results of the analysis prove that the RF algorithm is the best in terms of both accuracy and error. The accuracy of the RF algorithm is 91% and it also predicts loan eligibility with lesser error values. The LR model has the lowest accuracy values and the highest error value making it the least efficient algorithm that can be used in loan prediction.
基于个人信息的机器学习贷款资格预测
银行为医院和学校附近的每个人提供基本必需品。人们出于各种目的向银行求助。但银行提供的最常见的服务之一是贷款。然而,许多普通人并不完全了解银行贷款的程序和资格标准。本研究旨在开发一个机器学习(ML)模型,该模型能够通过分析用户输入的一些基本值来预测该人是否有资格获得医疗贷款。对于这个过程,将从Kaggle收集一个包含贷款申请所有必要参数的数据集。然后对收集到的数据集进行空值消除法和编码两种方法的预处理。同时,使用三种不同的算法开发了三个ML模型。它们是随机森林(RF)、朴素贝叶斯(NB)和线性回归(LR)。预处理后的数据将用于训练模型。接下来,将使用几个参数的比较来评估模型的有效性。分析结果表明,射频算法在精度和误差方面都是最好的。RF算法的准确率为91%,并且它还能以较小的误差值预测贷款资格。LR模型具有最低的准确度值和最高的误差值,使其成为可用于贷款预测的效率最低的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信