Prediction of Bank Performance Using Machine Learning Classifiers Optimized by Genetic Algorithm

Ummey Hany Ainan, Md. Nur-E-Arefin
{"title":"Prediction of Bank Performance Using Machine Learning Classifiers Optimized by Genetic Algorithm","authors":"Ummey Hany Ainan, Md. Nur-E-Arefin","doi":"10.1109/icaeee54957.2022.9836523","DOIUrl":null,"url":null,"abstract":"Bank performance is defined as the reflection of the way by which the assets of the bank are utilized in a form which enables it to accomplice its targets. Economic development highly depends on the functionalities of the banks. In past statistical approach is used to predict bank performance. Nowadays Machine Learning (ML) approaches are used in banking sector for better accuracy. In this work three famous Machine Learning classifiers named Random Forest (RF), Support Vector Machine (SVM) and Logistic Regression (LR) are used to find out the bank performance. The dataset used in this work are consist of 50 Turkish banks, 30 American banks and 20 European banks. The data have 24 performance indicators that measures performance from the year of 2010 to 2020. CAMEL technique is applied in this dataset in order to find ratings of the banks. In this study Genetic Algorithm (GA) plays a vital role. GA is used as optimizer and feature selector. At the end the models are evaluated with and without feature selection as well as with and without optimization. In this study SVM with optimization but without feature selection provides best accuracy among all the models which is 97.06% test accuracy. On the other hand, LR provides 80.21% test accuracy with feature selection but without optimization which is lowest in the whole study.","PeriodicalId":383872,"journal":{"name":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaeee54957.2022.9836523","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bank performance is defined as the reflection of the way by which the assets of the bank are utilized in a form which enables it to accomplice its targets. Economic development highly depends on the functionalities of the banks. In past statistical approach is used to predict bank performance. Nowadays Machine Learning (ML) approaches are used in banking sector for better accuracy. In this work three famous Machine Learning classifiers named Random Forest (RF), Support Vector Machine (SVM) and Logistic Regression (LR) are used to find out the bank performance. The dataset used in this work are consist of 50 Turkish banks, 30 American banks and 20 European banks. The data have 24 performance indicators that measures performance from the year of 2010 to 2020. CAMEL technique is applied in this dataset in order to find ratings of the banks. In this study Genetic Algorithm (GA) plays a vital role. GA is used as optimizer and feature selector. At the end the models are evaluated with and without feature selection as well as with and without optimization. In this study SVM with optimization but without feature selection provides best accuracy among all the models which is 97.06% test accuracy. On the other hand, LR provides 80.21% test accuracy with feature selection but without optimization which is lowest in the whole study.
利用遗传算法优化的机器学习分类器预测银行业绩
银行业绩被定义为反映银行资产以一种使其能够实现其目标的形式被利用的方式。经济的发展在很大程度上取决于银行的功能。过去用统计方法来预测银行业绩。如今,机器学习(ML)方法被用于银行部门,以提高准确性。在这项工作中,使用随机森林(RF),支持向量机(SVM)和逻辑回归(LR)这三个著名的机器学习分类器来找出银行绩效。这项工作中使用的数据集包括50家土耳其银行,30家美国银行和20家欧洲银行。该数据有24项绩效指标,衡量2010年至2020年的绩效。为了找到银行的评级,在这个数据集中应用了CAMEL技术。在此研究中,遗传算法(GA)起着至关重要的作用。采用遗传算法作为优化器和特征选择器。最后对模型进行了特征选择和非特征选择以及优化和非优化的评估。在本研究中,经过优化但未进行特征选择的SVM在所有模型中准确率最高,达到97.06%的测试准确率。另一方面,LR具有特征选择但未进行优化的测试准确率为80.21%,在整个研究中最低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信