{"title":"用机器学习模型预测安全银行的贷款行为","authors":"Mayank Anand, Arun Velu, P. Whig","doi":"10.36596/jcse.v3i1.237","DOIUrl":null,"url":null,"abstract":"Given loan default prediction has such a large impact on earnings, it is one of the most influential factor on credit score that banks and other financial organisations face. There have been several traditional methods for mining information about a loan application and some new machine learning methods of which, most of these methods appear to be failing, as the number of defaults in loans has increased. For loan default prediction, a variety of techniques such as Multiple Logistic Regression, Decision Tree, Random Forests, Gaussian Naive Bayes, Support Vector Machines, and other ensemble methods are presented in this research work. The prediction is based on loan data from multiple internet sources such as Kaggle, as well as data sets from the applicant's loan application. Significant evaluation measures including Confusion Matrix, Accuracy, Recall, Precision, F1- Score, ROC analysis area and Feature Importance has been calculated and shown in the results section. It is found that Extra Trees Classifier and Random Forest has highest Accuracy of using predictive modelling, this research concludes effectual results for loan credit disapproval on vulnerable consumers from a large number of loan applications","PeriodicalId":354823,"journal":{"name":"Journal of Computer Science and Engineering (JCSE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Prediction of Loan Behaviour with Machine Learning Models for Secure Banking\",\"authors\":\"Mayank Anand, Arun Velu, P. Whig\",\"doi\":\"10.36596/jcse.v3i1.237\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given loan default prediction has such a large impact on earnings, it is one of the most influential factor on credit score that banks and other financial organisations face. There have been several traditional methods for mining information about a loan application and some new machine learning methods of which, most of these methods appear to be failing, as the number of defaults in loans has increased. For loan default prediction, a variety of techniques such as Multiple Logistic Regression, Decision Tree, Random Forests, Gaussian Naive Bayes, Support Vector Machines, and other ensemble methods are presented in this research work. The prediction is based on loan data from multiple internet sources such as Kaggle, as well as data sets from the applicant's loan application. Significant evaluation measures including Confusion Matrix, Accuracy, Recall, Precision, F1- Score, ROC analysis area and Feature Importance has been calculated and shown in the results section. It is found that Extra Trees Classifier and Random Forest has highest Accuracy of using predictive modelling, this research concludes effectual results for loan credit disapproval on vulnerable consumers from a large number of loan applications\",\"PeriodicalId\":354823,\"journal\":{\"name\":\"Journal of Computer Science and Engineering (JCSE)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer Science and Engineering (JCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36596/jcse.v3i1.237\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer Science and Engineering (JCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36596/jcse.v3i1.237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
摘要
鉴于贷款违约预测对收益的影响如此之大,它是银行和其他金融机构面临的影响信用评分的最重要因素之一。有几种传统的方法来挖掘贷款申请的信息和一些新的机器学习方法,其中大多数方法似乎都失败了,因为贷款违约的数量增加了。对于贷款违约预测,本研究提出了多种技术,如多元逻辑回归、决策树、随机森林、高斯朴素贝叶斯、支持向量机和其他集成方法。该预测基于来自多个互联网来源(如Kaggle)的贷款数据,以及来自申请人贷款申请的数据集。计算了包括混淆矩阵(Confusion Matrix)、准确率(Accuracy)、召回率(Recall)、精度(Precision)、F1- Score、ROC分析面积(ROC analysis area)和特征重要性(Feature Importance)在内的重要评价指标,并显示在结果部分。研究发现,Extra Trees Classifier和Random Forest在使用预测建模时准确率最高,本研究从大量贷款申请中得出弱势消费者贷款信用不批准的有效结果
Prediction of Loan Behaviour with Machine Learning Models for Secure Banking
Given loan default prediction has such a large impact on earnings, it is one of the most influential factor on credit score that banks and other financial organisations face. There have been several traditional methods for mining information about a loan application and some new machine learning methods of which, most of these methods appear to be failing, as the number of defaults in loans has increased. For loan default prediction, a variety of techniques such as Multiple Logistic Regression, Decision Tree, Random Forests, Gaussian Naive Bayes, Support Vector Machines, and other ensemble methods are presented in this research work. The prediction is based on loan data from multiple internet sources such as Kaggle, as well as data sets from the applicant's loan application. Significant evaluation measures including Confusion Matrix, Accuracy, Recall, Precision, F1- Score, ROC analysis area and Feature Importance has been calculated and shown in the results section. It is found that Extra Trees Classifier and Random Forest has highest Accuracy of using predictive modelling, this research concludes effectual results for loan credit disapproval on vulnerable consumers from a large number of loan applications