A novel credit scoring system in financial institutions using artificial intelligence technology

自主智能(英文) Pub Date : 2023-08-22 DOI:10.32629/jai.v6i2.824

Geethamanikanta Jakka, Amrutanshu Panigrahi, Abhilash Pati, M. N. Das, Jyotsnarani Tripathy

{"title":"A novel credit scoring system in financial institutions using artificial intelligence technology","authors":"Geethamanikanta Jakka, Amrutanshu Panigrahi, Abhilash Pati, M. N. Das, Jyotsnarani Tripathy","doi":"10.32629/jai.v6i2.824","DOIUrl":null,"url":null,"abstract":"In order to evaluate a person’s or a company’s creditworthiness, financial institutions must use credit scoring. Traditional credit scoring algorithms frequently rely on manual and rule-based methods, which can be tedious and inaccurate. Recent developments in artificial intelligence (AI) technology have opened up possibilities for creating more reliable and effective credit rating systems. The data are pre-processed, including scaling using the 0–1 normalization method and resolving missing values by imputation. Information gain (IG), gain ratio (GR), and chi-square are three feature selection methodologies covered in the study. While GR normalizes IG by dividing it by the total entropy of the feature, IG quantifies the reduction in total entropy by adding a new feature. Based on chi-squared statistics, the most vital traits are determined using chi-square. This research employs different ML models to develop a hybrid model for credit score prediction. The ML algorithms support vector machine (SVM), neural networks (NNs), decision trees (DTs), random forest (RF), and logistic regression (LR) classifiers are employed here for experiments along with IG, GR, and chi-square feature selection methodologies for credit prediction over Australian and German datasets. The study offers an understanding of the decision-making process for informative characteristics and the functionality of machine learning (ML) in credit prediction tasks. The empirical analysis shows that in the case of the German dataset, the DT with GR feature selection and hyperparameter optimization outperforms SVM and NN with an accuracy of 99.78%. For the Australian dataset, SVM with GR feature selection outperforms NN and DT with an accuracy of 99.98%.","PeriodicalId":70721,"journal":{"name":"自主智能(英文)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"自主智能(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.32629/jai.v6i2.824","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In order to evaluate a person’s or a company’s creditworthiness, financial institutions must use credit scoring. Traditional credit scoring algorithms frequently rely on manual and rule-based methods, which can be tedious and inaccurate. Recent developments in artificial intelligence (AI) technology have opened up possibilities for creating more reliable and effective credit rating systems. The data are pre-processed, including scaling using the 0–1 normalization method and resolving missing values by imputation. Information gain (IG), gain ratio (GR), and chi-square are three feature selection methodologies covered in the study. While GR normalizes IG by dividing it by the total entropy of the feature, IG quantifies the reduction in total entropy by adding a new feature. Based on chi-squared statistics, the most vital traits are determined using chi-square. This research employs different ML models to develop a hybrid model for credit score prediction. The ML algorithms support vector machine (SVM), neural networks (NNs), decision trees (DTs), random forest (RF), and logistic regression (LR) classifiers are employed here for experiments along with IG, GR, and chi-square feature selection methodologies for credit prediction over Australian and German datasets. The study offers an understanding of the decision-making process for informative characteristics and the functionality of machine learning (ML) in credit prediction tasks. The empirical analysis shows that in the case of the German dataset, the DT with GR feature selection and hyperparameter optimization outperforms SVM and NN with an accuracy of 99.78%. For the Australian dataset, SVM with GR feature selection outperforms NN and DT with an accuracy of 99.98%.

查看原文本刊更多论文

一种基于人工智能技术的金融机构信用评分系统

为了评估一个人或一家公司的信誉，金融机构必须使用信用评分。传统的信用评分算法经常依赖于手动和基于规则的方法，这可能是乏味和不准确的。人工智能技术的最新发展为创建更可靠、更有效的信用评级系统开辟了可能性。数据经过预处理，包括使用0–1归一化方法进行缩放和通过插补解决缺失值。信息增益（IG）、增益比（GR）和卡方是本研究涵盖的三种特征选择方法。GR通过将IG除以特征的总熵来归一化IG，而IG通过添加新特征来量化总熵的减少。基于卡方统计，最重要的特征是使用卡方来确定的。本研究采用不同的ML模型来开发信用评分预测的混合模型。ML算法支持向量机（SVM）、神经网络（NNs）、决策树（DTs）、随机森林（RF）和逻辑回归（LR）分类器与IG、GR和卡方特征选择方法一起用于澳大利亚和德国数据集上的信用预测实验。该研究提供了对信息特征的决策过程以及机器学习（ML）在信用预测任务中的功能的理解。实证分析表明，在德国数据集的情况下，具有GR特征选择和超参数优化的DT优于SVM和NN，准确率为99.78%。在澳大利亚数据集，带有GR特征选择的SVM优于NN和DT，准确率达99.98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

自主智能(英文)

CiteScore

0.40

自引率

0.00%

发文量