半监督支持向量机算法在小企业信用预测中的比较研究

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) Pub Date : 2016-11-01 DOI:10.1109/BESC.2016.7804484

Jie Zhang, Lin Li, Ge Zhu, Xiangfu Meng, Qing Xie

{"title":"半监督支持向量机算法在小企业信用预测中的比较研究","authors":"Jie Zhang, Lin Li, Ge Zhu, Xiangfu Meng, Qing Xie","doi":"10.1109/BESC.2016.7804484","DOIUrl":null,"url":null,"abstract":"The small companies become increasingly important in bank's lending business. But the challenge is how bank's credit assessment is made in a small amount of time and money. Compare with the big companies, the small companies often need a small amount of cash flow. They may not provide the complete certificates or documents, so that the bank has to collect information of the companies and evaluate their credit rating especially by experts. For the bank, it is worthless to spend time and money to investigate a small company, especially just to lend several hundred thousand dollars. In the real life, credits of most the companies are good, while only small of them cannot repay for some reasons. The few number of small companies' credit data is valuable while considerable unknowing credit data of small companies is within reach. Therefore, the binary classification of the good credit and the bad credit is asymmetry. we choose supervised learning algorithm (Regularized Least Squares Classification and SVM) and semi-supervised learning algorithm (Transductive SVM and Deterministic Annealing Semi-supervised SVM) to predict the credits of small companies. In this paper, we conduct a series of experiments on credit datasets with different proportion classification and the results show that the Deterministic Annealing Semi-supervised SVM (DAS3VM) performance better when the data set is rare and asymmetry.","PeriodicalId":225942,"journal":{"name":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A comparison study of semi-supervised SVM algorithms for small business credit prediction\",\"authors\":\"Jie Zhang, Lin Li, Ge Zhu, Xiangfu Meng, Qing Xie\",\"doi\":\"10.1109/BESC.2016.7804484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The small companies become increasingly important in bank's lending business. But the challenge is how bank's credit assessment is made in a small amount of time and money. Compare with the big companies, the small companies often need a small amount of cash flow. They may not provide the complete certificates or documents, so that the bank has to collect information of the companies and evaluate their credit rating especially by experts. For the bank, it is worthless to spend time and money to investigate a small company, especially just to lend several hundred thousand dollars. In the real life, credits of most the companies are good, while only small of them cannot repay for some reasons. The few number of small companies' credit data is valuable while considerable unknowing credit data of small companies is within reach. Therefore, the binary classification of the good credit and the bad credit is asymmetry. we choose supervised learning algorithm (Regularized Least Squares Classification and SVM) and semi-supervised learning algorithm (Transductive SVM and Deterministic Annealing Semi-supervised SVM) to predict the credits of small companies. In this paper, we conduct a series of experiments on credit datasets with different proportion classification and the results show that the Deterministic Annealing Semi-supervised SVM (DAS3VM) performance better when the data set is rare and asymmetry.\",\"PeriodicalId\":225942,\"journal\":{\"name\":\"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BESC.2016.7804484\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BESC.2016.7804484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

小公司在银行贷款业务中变得越来越重要。但面临的挑战是如何在短时间和短资金内完成银行的信用评估。与大公司相比，小公司往往需要少量的现金流。他们可能没有提供完整的证明或文件，因此银行不得不收集公司的信息，并评估他们的信用评级，特别是由专家。对于银行来说，花费时间和金钱去调查一家小公司是毫无价值的，尤其是为了贷款几十万美元。在现实生活中，大多数公司的信用都是良好的，只有少数公司因为某些原因无法偿还。少数小公司的信用数据是有价值的，而大量不知情的小公司的信用数据是触手可及的。因此，良好信用和不良信用的二元分类是不对称的。我们选择有监督学习算法(正则化最小二乘分类和支持向量机)和半监督学习算法(传导支持向量机和确定性退火半监督支持向量机)来预测小企业的信用。本文对不同比例分类的信贷数据集进行了一系列实验，结果表明，在数据集稀少且不对称的情况下，确定性退火半监督支持向量机(DAS3VM)的性能更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A comparison study of semi-supervised SVM algorithms for small business credit prediction

The small companies become increasingly important in bank's lending business. But the challenge is how bank's credit assessment is made in a small amount of time and money. Compare with the big companies, the small companies often need a small amount of cash flow. They may not provide the complete certificates or documents, so that the bank has to collect information of the companies and evaluate their credit rating especially by experts. For the bank, it is worthless to spend time and money to investigate a small company, especially just to lend several hundred thousand dollars. In the real life, credits of most the companies are good, while only small of them cannot repay for some reasons. The few number of small companies' credit data is valuable while considerable unknowing credit data of small companies is within reach. Therefore, the binary classification of the good credit and the bad credit is asymmetry. we choose supervised learning algorithm (Regularized Least Squares Classification and SVM) and semi-supervised learning algorithm (Transductive SVM and Deterministic Annealing Semi-supervised SVM) to predict the credits of small companies. In this paper, we conduct a series of experiments on credit datasets with different proportion classification and the results show that the Deterministic Annealing Semi-supervised SVM (DAS3VM) performance better when the data set is rare and asymmetry.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)

自引率

0.00%

发文量