{"title":"个人信用风险评估数据挖掘算法比较研究","authors":"Hong Yu, Xiaolei Huang, Xiaorong Hu, Hengwen Cai","doi":"10.1109/ICMECG.2010.16","DOIUrl":null,"url":null,"abstract":"Individual credit risk evaluation is an important and challenging data mining problem in financial analysis domain. This paper compares the effectiveness of four data mining algorithms - logistic regression (LR), decision tree (C4.5), support vector machine (SVM) and neural networks (NN) by applying them to two credit data sets. Experiment results show that the LR and SVM algorithms produced the best classification accuracy, and the SVM shows the higher robustness and generalization ability compared to the other algorithms. On the contrary, the neural networks algorithm performed poor relatively on the two credit data sets in our experiments. The computer simulation shows the C4.5 algorithm is sensitive to input data, and the classification accuracy is unstable, but it has the better explanatory.","PeriodicalId":129936,"journal":{"name":"2010 International Conference on Management of e-Commerce and e-Government","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":"{\"title\":\"A Comparative Study on Data Mining Algorithms for Individual Credit Risk Evaluation\",\"authors\":\"Hong Yu, Xiaolei Huang, Xiaorong Hu, Hengwen Cai\",\"doi\":\"10.1109/ICMECG.2010.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Individual credit risk evaluation is an important and challenging data mining problem in financial analysis domain. This paper compares the effectiveness of four data mining algorithms - logistic regression (LR), decision tree (C4.5), support vector machine (SVM) and neural networks (NN) by applying them to two credit data sets. Experiment results show that the LR and SVM algorithms produced the best classification accuracy, and the SVM shows the higher robustness and generalization ability compared to the other algorithms. On the contrary, the neural networks algorithm performed poor relatively on the two credit data sets in our experiments. The computer simulation shows the C4.5 algorithm is sensitive to input data, and the classification accuracy is unstable, but it has the better explanatory.\",\"PeriodicalId\":129936,\"journal\":{\"name\":\"2010 International Conference on Management of e-Commerce and e-Government\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Management of e-Commerce and e-Government\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMECG.2010.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Management of e-Commerce and e-Government","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMECG.2010.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparative Study on Data Mining Algorithms for Individual Credit Risk Evaluation
Individual credit risk evaluation is an important and challenging data mining problem in financial analysis domain. This paper compares the effectiveness of four data mining algorithms - logistic regression (LR), decision tree (C4.5), support vector machine (SVM) and neural networks (NN) by applying them to two credit data sets. Experiment results show that the LR and SVM algorithms produced the best classification accuracy, and the SVM shows the higher robustness and generalization ability compared to the other algorithms. On the contrary, the neural networks algorithm performed poor relatively on the two credit data sets in our experiments. The computer simulation shows the C4.5 algorithm is sensitive to input data, and the classification accuracy is unstable, but it has the better explanatory.