A comparison study of semi-supervised SVM algorithms for small business credit prediction

Jie Zhang, Lin Li, Ge Zhu, Xiangfu Meng, Qing Xie
{"title":"A comparison study of semi-supervised SVM algorithms for small business credit prediction","authors":"Jie Zhang, Lin Li, Ge Zhu, Xiangfu Meng, Qing Xie","doi":"10.1109/BESC.2016.7804484","DOIUrl":null,"url":null,"abstract":"The small companies become increasingly important in bank's lending business. But the challenge is how bank's credit assessment is made in a small amount of time and money. Compare with the big companies, the small companies often need a small amount of cash flow. They may not provide the complete certificates or documents, so that the bank has to collect information of the companies and evaluate their credit rating especially by experts. For the bank, it is worthless to spend time and money to investigate a small company, especially just to lend several hundred thousand dollars. In the real life, credits of most the companies are good, while only small of them cannot repay for some reasons. The few number of small companies' credit data is valuable while considerable unknowing credit data of small companies is within reach. Therefore, the binary classification of the good credit and the bad credit is asymmetry. we choose supervised learning algorithm (Regularized Least Squares Classification and SVM) and semi-supervised learning algorithm (Transductive SVM and Deterministic Annealing Semi-supervised SVM) to predict the credits of small companies. In this paper, we conduct a series of experiments on credit datasets with different proportion classification and the results show that the Deterministic Annealing Semi-supervised SVM (DAS3VM) performance better when the data set is rare and asymmetry.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BESC.2016.7804484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The small companies become increasingly important in bank's lending business. But the challenge is how bank's credit assessment is made in a small amount of time and money. Compare with the big companies, the small companies often need a small amount of cash flow. They may not provide the complete certificates or documents, so that the bank has to collect information of the companies and evaluate their credit rating especially by experts. For the bank, it is worthless to spend time and money to investigate a small company, especially just to lend several hundred thousand dollars. In the real life, credits of most the companies are good, while only small of them cannot repay for some reasons. The few number of small companies' credit data is valuable while considerable unknowing credit data of small companies is within reach. Therefore, the binary classification of the good credit and the bad credit is asymmetry. we choose supervised learning algorithm (Regularized Least Squares Classification and SVM) and semi-supervised learning algorithm (Transductive SVM and Deterministic Annealing Semi-supervised SVM) to predict the credits of small companies. In this paper, we conduct a series of experiments on credit datasets with different proportion classification and the results show that the Deterministic Annealing Semi-supervised SVM (DAS3VM) performance better when the data set is rare and asymmetry.
半监督支持向量机算法在小企业信用预测中的比较研究
小公司在银行贷款业务中变得越来越重要。但面临的挑战是如何在短时间和短资金内完成银行的信用评估。与大公司相比,小公司往往需要少量的现金流。他们可能没有提供完整的证明或文件,因此银行不得不收集公司的信息,并评估他们的信用评级,特别是由专家。对于银行来说,花费时间和金钱去调查一家小公司是毫无价值的,尤其是为了贷款几十万美元。在现实生活中,大多数公司的信用都是良好的,只有少数公司因为某些原因无法偿还。少数小公司的信用数据是有价值的,而大量不知情的小公司的信用数据是触手可及的。因此,良好信用和不良信用的二元分类是不对称的。我们选择有监督学习算法(正则化最小二乘分类和支持向量机)和半监督学习算法(传导支持向量机和确定性退火半监督支持向量机)来预测小企业的信用。本文对不同比例分类的信贷数据集进行了一系列实验,结果表明,在数据集稀少且不对称的情况下,确定性退火半监督支持向量机(DAS3VM)的性能更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信