Optimalisasi Data Tidak Seimbang Pada Data Nasabah Koperasi dalam Pemberian Pinjaman Menggunakan Random Oversampling

Richky Faizal Amir, Andreyestha Andreyestha, Imam Nawawi, Rino Ramadan
{"title":"Optimalisasi Data Tidak Seimbang Pada Data Nasabah Koperasi dalam Pemberian Pinjaman Menggunakan Random Oversampling","authors":"Richky Faizal Amir, Andreyestha Andreyestha, Imam Nawawi, Rino Ramadan","doi":"10.22441/format.2023.v12.i1.004","DOIUrl":null,"url":null,"abstract":"Cooperatives have developed from time to time, in providing services, credit cooperatives certainly have certain requirements as prospective customers to receive loans. Cooperatives need to check whether interested parties will receive loans. Loans to customers are the main source of income for cooperatives. In data mining, there are several classification algorithms that can be used for credit analysis, including the Random Forest and the C4.5 Algorithm. Data on prospective customers received from cooperatives as a condition for applying for credit is processed using Random Forest data mining and C4.5 Algorithm to support credit analysis in order to obtain accurate information on whether the prospect who applies for credit is feasible or not, this study was conducted to classify loans to prospective customers. cooperative customers using the Random Forest method and the C4.5 Algorithm which is optimized by Random Oversampling because the dataset is in an unbalanced condition. In testing the C4.5 Algorithm which is optimized with Random Oversampling, it gets an accuracy of 78.03%, where the accuracy increases by 7.89% from the previous 70.14%. Meanwhile, Random Forest with Random Oversampling has an accuracy value of 87.12%, an increase of 23.69% from the previous Random Forest test of 63.43","PeriodicalId":381291,"journal":{"name":"Format : Jurnal Ilmiah Teknik Informatika","volume":"84 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Format : Jurnal Ilmiah Teknik Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22441/format.2023.v12.i1.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Cooperatives have developed from time to time, in providing services, credit cooperatives certainly have certain requirements as prospective customers to receive loans. Cooperatives need to check whether interested parties will receive loans. Loans to customers are the main source of income for cooperatives. In data mining, there are several classification algorithms that can be used for credit analysis, including the Random Forest and the C4.5 Algorithm. Data on prospective customers received from cooperatives as a condition for applying for credit is processed using Random Forest data mining and C4.5 Algorithm to support credit analysis in order to obtain accurate information on whether the prospect who applies for credit is feasible or not, this study was conducted to classify loans to prospective customers. cooperative customers using the Random Forest method and the C4.5 Algorithm which is optimized by Random Oversampling because the dataset is in an unbalanced condition. In testing the C4.5 Algorithm which is optimized with Random Oversampling, it gets an accuracy of 78.03%, where the accuracy increases by 7.89% from the previous 70.14%. Meanwhile, Random Forest with Random Oversampling has an accuracy value of 87.12%, an increase of 23.69% from the previous Random Forest test of 63.43
利用随机抽样的方法,对合作客户提供的贷款数据的优化不平衡
合作社不时发展起来,在提供服务时,信用社肯定有一定的要求作为准客户获得贷款。合作社需要检查利益相关方是否会获得贷款。向客户提供贷款是合作社的主要收入来源。在数据挖掘中,有几种分类算法可用于信用分析,包括随机森林和C4.5算法。本研究采用随机森林数据挖掘和C4.5算法对作为申请信贷条件的准客户数据进行处理,支持信用分析,以获得准客户申请信贷是否可行的准确信息,对准客户进行贷款分类。由于数据集处于不平衡状态,采用随机森林方法和随机过采样优化的C4.5算法对合作客户进行优化。在测试随机过采样优化后的C4.5算法时,准确率达到78.03%,比之前的70.14%提高了7.89%。同时,随机过采样随机森林的准确率为87.12%,比之前随机森林的63.43提高了23.69%
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信